| 2009 | ||
|---|---|---|
| c2 | Hengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári: Multi-Step Dyna Planning for Policy Evaluation and Control. NIPS 2009: 2187-2195 | |
| 2006 | ||
| c1 | Hengshuai Yao, Diao Dongcui, Zengqi Sun: Historical Temporal Difference Learning: Some Initial Results. IMSCCS (2) 2006: 678-685 | |
| 1 | Shalabh Bhatnagar | |
| 2 | Zengqi Sun | |
| 3 | Richard S. Sutton | |
| 4 | Csaba Szepesvári | |
| 5 | Hengshuai Yao |
Data released under the ODC-BY 1.0 license — See also our legal information page