Richard S. Sutton Home Page Coauthor index pubzone.org

List of publications from the DBLP Bibliography Server - FAQ
Other views: by type - by year (modern) - classic-C
Ask others: ACM DL/Guide - CiteSeerX - CSB - MetaPress - Google - Bing - Yahoo
DBLP keys2013
j9Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Patrick M. Pilarski, Michael Rory Dawson, Thomas Degris, Jason P. Carey, K. Ming Chan, Jacqueline S. Hebert, Richard S. Sutton: Adaptive Artificial Limbs: A Real-Time Approach to Prediction and Anticipation. IEEE Robot. Automat. Mag. 20(1): 53-64 (2013)
i5Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Harm van Seijen, Richard S. Sutton: Planning by Prioritized Sweeping with Small Backups. CoRR abs/1301.2343 (2013)
2012
j8Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
David Silver, Richard S. Sutton, Martin Müller: Temporal-difference search in computer Go. Machine Learning 87(2): 183-219 (2012)
c54Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Ashique Rupam Mahmood, Richard S. Sutton, Thomas Degris, Patrick M. Pilarski: Tuning-free step-size adaptation. ICASSP 2012: 2121-2124
c53Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Adam White, Joseph Modayil, Richard S. Sutton: Scaling life-long off-policy learning. ICDL-EPIROB 2012: 1-6
c52Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Thomas Degris, Martha White, Richard S. Sutton: Linear Off-Policy Actor-Critic. ICML 2012
c51Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Beyond Reward: The Problem of Knowledge and Data. ILP 2012: 2-6
c50Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Joseph Modayil, Adam White, Richard S. Sutton: Multi-timescale Nexting in a Reinforcement Learning Robot. SAB 2012: 299-309
c49Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Joseph Modayil, Adam White, Patrick M. Pilarski, Richard S. Sutton: Acquiring a broad range of empirical knowledge in real time by temporal-difference learning. SMC 2012: 1903-1910
i4Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Thomas Degris, Martha White, Richard S. Sutton: Off-Policy Actor-Critic. CoRR abs/1205.4839 (2012)
i3Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael Bowling: Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. CoRR abs/1206.3285 (2012)
i2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Adam White, Joseph Modayil, Richard S. Sutton: Scaling Life-long Off-policy Learning. CoRR abs/1206.6262 (2012)
2011
c48Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Joseph Modayil, Michael Delp, Thomas Degris, Patrick M. Pilarski, Adam White, Doina Precup: Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction. AAMAS 2011: 761-768
i1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Joseph Modayil, Adam White, Richard S. Sutton: Multi-timescale Nexting in a Reinforcement Learning Robot. CoRR abs/1112.1133 (2011)
2010
c47Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard S. Sutton: Toward Off-Policy Learning Control with Function Approximation. ICML 2010: 719-726
2009
j7Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Ghavamzadeh, Mark Lee: Natural actor-critic algorithms. Automatica 45(11): 2471-2482 (2009)
c46Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora: Fast gradient-descent methods for temporal-difference learning with linear function approximation. ICML 2009: 125
c45Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton: Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation. NIPS 2009: 1204-1212
c44Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Hengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári: Multi-Step Dyna Planning for Policy Evaluation and Control. NIPS 2009: 2187-2195
2008
j6Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Elliot A. Ludvig, Richard S. Sutton, E. James Kehoe: Stimulus Representation and the Timing of Reward-Prediction Errors in Models of the Dopamine System. Neural Computation 20(12): 3034-3054 (2008)
c43Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Maria Cutumisu, Duane Szafron, Michael H. Bowling, Richard S. Sutton: Agent Learning using Action-Dependent Learning Rates in Computer Role-Playing Games. AIIDE 2008
c42Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
David Silver, Richard S. Sutton, Martin Müller: Sample-based learning and search with permanent and transient memories. ICML 2008: 968-975
c41Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Elliot A. Ludvig, Richard S. Sutton, Eric Verbeek, E. James Kehoe: A computational model of hippocampal function in trace conditioning. NIPS 2008: 993-1000
c40Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Csaba Szepesvári, Hamid Reza Maei: A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation. NIPS 2008: 1609-1616
c39Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael H. Bowling: Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. UAI 2008: 528-536
2007
c38Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Anna Koop, David Silver: On the role of tracking in stationary environments. ICML 2007: 871-878
c37Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
David Silver, Richard S. Sutton, Martin Müller: Reinforcement Learning of Local Shape in the Game of Go. IJCAI 2007: 1053-1058
c36Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Ghavamzadeh, Mark Lee: Incremental Natural Actor-Critic Algorithms. NIPS 2007
2006
c35Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Alborz Geramifard, Michael H. Bowling, Richard S. Sutton: Incremental Least-Squares Temporal Difference Learning. AAAI 2006: 356-361
c34Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Alborz Geramifard, Michael H. Bowling, Martin Zinkevich, Richard S. Sutton: iLSTD: Eligibility Traces and Convergence Analysis. NIPS 2006: 441-448
2005
c33Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Brian Tanner, Richard S. Sutton: TD(lambda) networks: temporal-difference networks with eligibility traces. ICML 2005: 888-895
c32Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Eddie J. Rafols, Mark B. Ring, Richard S. Sutton, Brian Tanner: Using Predictive Representations to Improve Generalization in Reinforcement Learning. IJCAI 2005: 835-840
c31Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Brian Tanner, Richard S. Sutton: Temporal-Difference Networks with History. IJCAI 2005: 865-870
c30Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Doina Precup, Richard S. Sutton, Cosmin Paduraru, Anna Koop, Satinder P. Singh: Off-policy Learning with Options and Recognizers. NIPS 2005
c29Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Eddie J. Rafols, Anna Koop: Temporal Abstraction in Temporal-difference Networks. NIPS 2005
2004
c28Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Brian Tanner: Temporal-Difference Networks. NIPS 2004
2002
e1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Rina Dechter, Richard S. Sutton (Eds.): Proceedings of the Eighteenth National Conference on Artificial Intelligence and Fourteenth Conference on Innovative Applications of Artificial Intelligence, July 28 - August 1, 2002, Edmonton, Alberta, Canada. AAAI Press / The MIT Press 2002
2001
c27no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Doina Precup, Richard S. Sutton, Sanjoy Dasgupta: Off-Policy Temporal Difference Learning with Function Approximation. ICML 2001: 417-424
c26no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Peter Stone, Richard S. Sutton: Scaling Reinforcement Learning toward RoboCup Soccer. ICML 2001: 537-544
c25Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Michael L. Littman, Richard S. Sutton, Satinder P. Singh: Predictive Representations of State. NIPS 2001: 1555-1561
c24Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Peter Stone, Richard S. Sutton: Keepaway Soccer: A Machine Learning Testbed. RoboCup 2001: 214-223
2000
c23no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Doina Precup, Richard S. Sutton, Satinder P. Singh: Eligibility Traces for Off-Policy Policy Evaluation. ICML 2000: 759-766
c22Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Peter Stone, Richard S. Sutton, Satinder P. Singh: Reinforcement Learning for 3 vs. 2 Keepaway. RoboCup 2000: 249-258
1999
j5Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Doina Precup, Satinder P. Singh: Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning. Artif. Intell. 112(1-2): 181-211 (1999)
c21Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Open Theoretical Questions in Reinforcement Learning. EuroCOLT 1999: 11-17
c20Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, David A. McAllester, Satinder P. Singh, Yishay Mansour: Policy Gradient Methods for Reinforcement Learning with Function Approximation. NIPS 1999: 1057-1063
1998
j4Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Andrew G. Barto: Reinforcement Learning: An Introduction. IEEE Transactions on Neural Networks 9(5): 1054-1054 (1998)
c19Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Doina Precup, Richard S. Sutton, Satinder P. Singh: Theoretical Results on Reinforcement Learning with Temporally Abstract Options. ECML 1998: 382-393
c18no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Doina Precup, Satinder P. Singh: Intra-Option Learning about Temporally Abstract Actions. ICML 1998: 556-564
c17Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Robert Moll, Andrew G. Barto, Theodore J. Perkins, Richard S. Sutton: Learning Instance-Independent Value Functions to Enhance Local Search. NIPS 1998: 1017-1023
c16Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Satinder P. Singh, Doina Precup, Balaraman Ravindran: Improved Switching among Temporally Abstract Actions. NIPS 1998: 1066-1072
c15Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Reinforcement Learning: Past, Present and Future. SEAL 1998: 195-197
1997
c14Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: On the Significance of Markov Decision Processes. ICANN 1997: 273-282
c13no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Doina Precup, Richard S. Sutton: Exponentiated Gradient Methods for Reinforcement Learning. ICML 1997: 272-277
c12no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Doina Precup, Richard S. Sutton: Multi-time Models for Temporally Abstract Planning. NIPS 1997
1996
j3Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Satinder P. Singh, Richard S. Sutton: Reinforcement Learning with Replacing Eligibility Traces. Machine Learning 22(1-3): 123-158 (1996)
1995
c11no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: TD Models: Modeling the World at a Mixture of Time Scales. ICML 1995: 531-539
c10Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. NIPS 1995: 1038-1044
1993
c9no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Steven D. Whitehead: Online Learning with Random Representations. ICML 1993: 314-321
1992
c8Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Adapting Bias by Gradient Descent: An Incremental Version of Delta-Bar-Delta. AAAI 1992: 171-176
1991
j2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Dyna, an Integrated Architecture for Learning, Planning, and Reacting. SIGART Bulletin 2(4): 160-163 (1991)
c7no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Christopher J. Matheus: Learning Polynomial Functions by Feature Construction. ML 1991: 208-212
c6no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Planning by Incremental Dynamic Programming. ML 1991: 353-357
c5Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Terence D. Sanger, Richard S. Sutton, Christopher J. Matheus: Iterative Construction of Sparse Polynomial Approximations. NIPS 1991: 1064-1071
1990
c4no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Integrated Architectures for Learning, Planning, and Reacting Based on Approximating Dynamic Programming. ML 1990: 216-224
c3Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Integrated Modeling and Control Based on Reinforcement Learning. NIPS 1990: 471-478
1989
c2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Andrew G. Barto, Richard S. Sutton, Christopher J. C. H. Watkins: Sequential Decision Probelms and Neural Networks. NIPS 1989: 686-693
1988
j1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton: Learning to Predict by the Methods of Temporal Differences. Machine Learning 3: 9-44 (1988)
1985
c1no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Oliver G. Selfridge, Richard S. Sutton, Andrew G. Barto: Training and Tracking in Robotics. IJCAI 1985: 670-672

Coauthor Index

1Andrew G. Barto
[j4] [c17] [c2] [c1]
2Shalabh Bhatnagar
[c47] [j7] [c46] [c45] [c44] [c36]
3Michael H. Bowling (Michael Bowling)
[i3] [c43] [c39] [c35] [c34]
4Jason P. Carey
[j9]
5K. Ming Chan
[j9]
6Maria Cutumisu
[c43]
7Sanjoy Dasgupta
[c27]
8Michael Rory Dawson
[j9]
9Rina Dechter
[e1]
10Thomas Degris
[j9] [c54] [c52] [i4] [c48]
11Michael Delp
[c48]
12Diao Dongcui
[c44]
13Alborz Geramifard
[i3] [c39] [c35] [c34]
14Mohammad Ghavamzadeh
[j7] [c36]
15Jacqueline S. Hebert
[j9]
16E. James Kehoe
[j6] [c41]
17Anna Koop
[c38] [c30] [c29]
18Mark Lee
[j7] [c36]
19Michael L. Littman
[c25]
20Elliot A. Ludvig
[j6] [c41]
21Hamid Reza Maei
[c47] [c46] [c45] [c40]
22Ashique Rupam Mahmood
[c54]
23Yishay Mansour
[c20]
24Christopher J. Matheus
[c7] [c5]
25David A. McAllester
[c20]
26Joseph Modayil
[c53] [c50] [c49] [i2] [c48] [i1]
27Robert Moll (Robert N. Moll)
[c17]
28Martin Müller 0003
[j8] [c42] [c37]
29Cosmin Paduraru
[c30]
30Theodore J. Perkins
[c17]
31Patrick M. Pilarski
[j9] [c54] [c49] [c48]
32Doina Precup
[c48] [c46] [c45] [c30] [c27] [c23] [j5] [c19] [c18] [c16] [c13] [c12]
33Eddie J. Rafols
[c32] [c29]
34Balaraman Ravindran
[c16]
35Mark B. Ring
[c32]
36Terence D. Sanger
[c5]
37Harm van Seijen
[i5]
38Oliver G. Selfridge
[c1]
39David Silver
[j8] [c46] [c45] [c42] [c38] [c37]
40Satinder P. Singh
[c30] [c25] [c23] [c22] [j5] [c20] [c19] [c18] [c16] [j3]
41Peter Stone
[c26] [c24] [c22]
42Duane Szafron
[c43]
43Csaba Szepesvári
[i3] [c47] [c46] [c45] [c44] [c40] [c39]
44Brian Tanner
[c33] [c32] [c31] [c28]
45H. M. W. (Eric) Verbeek (H. M. W. Verbeek, Eric Verbeek)
[c41]
46Christopher J. C. H. Watkins
[c2]
47Adam White
[c53] [c50] [c49] [i2] [c48] [i1]
48Martha White
[c52] [i4]
49Steven D. Whitehead
[c9]
50Eric Wiewiora
[c46]
51Hengshuai Yao
[c44]
52Martin Zinkevich
[c34]

Colors in the list of coauthors

Last update Sun May 19 15:17:40 2013 CET by the DBLP TeamThis material is Open Data Data released under the ODC-BY 1.0 license — See also our legal information page