| 2013 | ||
|---|---|---|
| i3 | ||
| 2012 | ||
| c6 | ||
| i2 | Hengshuai Yao: Discovering and Leveraging the Most Valuable Links for Ranking. CoRR abs/1210.1626 (2012) | |
| 2009 | ||
| c5 | Hengshuai Yao, Shalabh Bhatnagar, Csaba Szepesvári: LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS. CDC 2009: 1181-1188 | |
| c4 | Hengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári: Multi-Step Dyna Planning for Policy Evaluation and Control. NIPS 2009: 2187-2195 | |
| 2008 | ||
| c3 | ||
| c2 | Hengshuai Yao, Zhi-Qiang Liu: Minimal Residual Approaches for Policy Evaluation in Large Sparse Markov Chains. ISAIM 2008 | |
| 2007 | ||
| i1 | ||
| 2006 | ||
| c1 | Hengshuai Yao, Diao Dongcui, Zengqi Sun: Historical Temporal Difference Learning: Some Initial Results. IMSCCS (2) 2006: 678-685 | |
| 1 | Shalabh Bhatnagar | |
| 2 | Diao Dongcui | |
| 3 | Zhi-Qiang Liu | |
| 4 | Dale Schuurmans | |
| 5 | Zengqi Sun | |
| 6 | Richard S. Sutton | |
| 7 | Csaba Szepesvári |
Data released under the ODC-BY 1.0 license — See also our legal information page