論文Okudo, Takato, Yamada, Seiji (2021). Subgoal-Based Reward Shaping to Improve Efficiency in Reinforcement Learning. IEEE Access. Seiji Yamada / 2021-06-18 Okudo, Takato, Yamad
国際会議Takato Okudo and Seiji Yamada: Reward Shaping with Dynamic Trajectory Aggregation, IJCNN2021 Seiji Yamada / 2021-06-15 Takato Okudo and Sei
山田のブログ, 未分類岡村さんの論文 “Adaptive trust calibration for human-AI collaboration”がPLOS ONEに掲載されました. Seiji Yamada / 2020-02-25 岡村和男さん(総研大大学院生D5)と山田