Takato Okudo and Seiji Yamada: Reward Shaping with Dynamic Trajectory Aggregation, IJCNN2021

Takato Okudo and Seiji Yamada: Reward Shaping with Dynamic Trajectory Aggregation, 2021 International Joint Conference on Neural Networks (IJCNN2021), online, 10.1109/IJCNN52387.2021.9533401

上部へスクロール