 | 1998 |
| 4 |  | Mohammad A. Al-Ansari,
Ronald J. Williams:
Robust, Efficient, Globally-Optimized Reinforcement Learning with the Parti-Game Algorithm.
NIPS 1998: 961-967 |
| 1996 |
| 3 |  | Jing Peng,
Ronald J. Williams:
Incremental Multi-Step Q-Learning.
Machine Learning 22(1-3): 283-290 (1996) |
| 1994 |
| 2 |  | Jing Peng,
Ronald J. Williams:
Incremental Multi-Step Q-Learning.
ICML 1994: 226-232 |
| 1992 |
| 1 |  | Ronald J. Williams:
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning.
Machine Learning 8: 229-256 (1992) |