 | 2009 |
| 3 |  | Tsuyoshi Ueno,
Shin-ichi Maeda,
Motoaki Kawanabe,
Shin Ishii:
Optimal Online Learning Procedures for Model-Free Policy Evaluation.
ECML/PKDD (2) 2009: 473-488 |
| 2008 |
| 2 |  | Tsuyoshi Ueno,
Motoaki Kawanabe,
Takeshi Mori,
Shin-ichi Maeda,
Shin Ishii:
A semiparametric statistical approach to model-free policy evaluation.
ICML 2008: 1072-1079 |
| 2006 |
| 1 |  | Tsuyoshi Ueno,
Yutaka Nakamura,
Takashi Takuma,
Tomohiro Shibata,
Koh Hosoda,
Shin Ishii:
Fast and Stable Learning of Quasi-Passive Dynamic Walking by an Unstable Biped Robot based on Off-Policy Natural Actor-Critic.
IROS 2006: 5226-5231 |