| 2009 | ||
|---|---|---|
| 3 | Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiyama: Active Policy Iteration: Efficient Exploration through Active Learning for Value Function Approximation in Reinforcement Learning. IJCAI 2009: 980-985 | |
| 2 | Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiyama, Jan Peters: Adaptive importance sampling for value function approximation in off-policy reinforcement learning. Neural Networks 22(10): 1399-1410 (2009) | |
| 2008 | ||
| 1 | Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiyama, Jan Peters: Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation. AAAI 2008: 1351-1356 | |
| 1 | Hirotaka Hachiya | [1] [2] [3] |
| 2 | Jan Peters | [1] [2] |
| 3 | Masashi Sugiyama | [1] [2] [3] |