Gerald Tesauro

List of publications from the DBLP Bibliography Server - FAQ
Coauthor Index - Ask others: ACM DL/Guide - CiteSeer - CSB - Google - MSN - Yahoo

2007
42EEJeffrey O. Kephart, Hoi Chan, Rajarshi Das, David W. Levine, Gerald Tesauro, Freeman L. Rawson III, Charles Lefurgy: Coordinating Multiple Autonomic Managers to Achieve Specified Power-Performance Tradeoffs. ICAC 2007: 24
41EEIrina Rish, Gerald Tesauro: Estimating End-to-End Performance by Collaborative Prediction with Active Sampling. Integrated Network Management 2007: 294-303
40EEGerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani: On the use of hybrid reinforcement learning for autonomic resource allocation. Cluster Computing 10(3): 287-299 (2007)
39EEGerald Tesauro: Reinforcement Learning in Autonomic Computing: A Manifesto and Case Studies. IEEE Internet Computing 11(1): 22-30 (2007)
2006
38EEGerald Tesauro, Nicholas K. Jong, Rajarshi Das, Mohamed N. Bennani: Improvement of Systems Management Policies Using Hybrid Reinforcement Learning. ECML 2006: 783-791
2005
37 Relu Patrascu, Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh: New Approaches to Optimization and Utility Elicitation in Autonomic Computing. AAAI 2005: 140-145
36 Gerald Tesauro: Online Resource Allocation Using Decompositional Reinforcement Learning. AAAI 2005: 886-891
35EEGerald Tesauro, Rajarshi Das, William E. Walsh, Jeffrey O. Kephart: Utility-Function-Driven Resource Allocation in Autonomic Systems. ICAC 2005: 342-343
2004
34EEGerald Tesauro, David M. Chess, William E. Walsh, Rajarshi Das, Alla Segal, Ian Whalley, Jeffrey O. Kephart, Steve R. White: A Multi-Agent Systems Approach to Autonomic Computing. AAMAS 2004: 464-471
33EEWilliam E. Walsh, Gerald Tesauro, Jeffrey O. Kephart, Rajarshi Das: Utility Functions in Autonomic Systems. ICAC 2004: 70-77
2003
32EECuihong Li, Gerald Tesauro: A strategic decision model for multi-attribute bilateral negotiation with alternating. ACM Conference on Electronic Commerce 2003: 208-209
31EEJames E. Hanson, Gerald Tesauro, Jeffrey O. Kephart, E. C. Snibl: Multi-agent implementation of asymmetric protocol for bilateral negotiations. ACM Conference on Electronic Commerce 2003: 224-225
30EEGerald Tesauro: Extending Q-Learning to General Adaptive Multi-Agent Systems. NIPS 2003
29 Craig Boutilier, Rajarshi Das, Jeffrey O. Kephart, Gerald Tesauro, William E. Walsh: Cooperative Negotiation in Autonomic Systems using Incremental Utility Elicitation. UAI 2003: 89-97
2002
28EEGerald Tesauro, Jonathan Bredin: Strategic sequential bidding in auctions using dynamic programming. AAMAS 2002: 591-598
27EEGerald Tesauro: Programming backgammon using self-teaching neural nets. Artif. Intell. 134(1-2): 181-199 (2002)
26 Gerald Tesauro, Jeffrey O. Kephart: Pricing in Agent Economies Using Multi-Agent Q-Learning. Autonomous Agents and Multi-Agent Systems 5(3): 289-304 (2002)
2001
25EEGerald Tesauro, Rajarshi Das: High-performance bidding agents for the continuous double auction. ACM Conference on Electronic Commerce 2001: 206-209
24 Rajarshi Das, James E. Hanson, Jeffrey O. Kephart, Gerald Tesauro: Agent-Human Interactions in the Continuous Double Auction. IJCAI 2001: 1169-1187
23EEGerald Tesauro: Pricing in Agent Economies Using Neural Networks and Multi-agent Q-Learning. Sequence Learning 2001: 288-307
2000
22EEManu Sridharan, Gerald Tesauro: Multi-Agent Q-Learning and Regression Trees for Automated Pricing Decisions. ICMAS 2000: 447-448
21 Jeffrey O. Kephart, Gerald Tesauro: Pseudo-convergent Q-Learning by Competitive Pricebots. ICML 2000: 463-470
20 Manu Sridharan, Gerald Tesauro: Multi-agent Q-learning and Regression Trees for Automated Pricing Decisions. ICML 2000: 927-934
19EEGerald Tesauro, Jeffrey O. Kephart: Foresight-based pricing algorithms in agent economies. Decision Support Systems 28(1-2): 49-60 (2000)
1999
18EEAmy R. Greenwald, Jeffrey O. Kephart, Gerald Tesauro: Strategic pricebot dynamics. ACM Conference on Electronic Commerce 1999: 58-67
1998
17 Gerald Tesauro: Comments on ``Co-Evolution in the Successful Learning of Backgammon Strategy''. Machine Learning 32(3): 241-243 (1998)
1996
16EEGerald Tesauro, Gregory R. Galperin: On-line Policy Improvement using Monte-Carlo Search. NIPS 1996: 1068-1074
1995
15 Gerald Tesauro, David S. Touretzky, Todd K. Leen: Advances in Neural Information Processing Systems 7, [NIPS Conference, Denver, Colorado, USA, 1994] MIT Press 1995
14 Jeffrey O. Kephart, Gregory B. Sorkin, William C. Arnold, David M. Chess, Gerald Tesauro, Steve R. White: Biologically Inspired Defenses Against Computer Viruses. IJCAI (1) 1995: 985-996
13 Gerald Tesauro: Temporal Difference Learning and TD-Gammon. Commun. ACM 38(3): 58-68 (1995)
1994
12 Jack D. Cowan, Gerald Tesauro, Joshua Alspector: Advances in Neural Information Processing Systems 6, [7th NIPS Conference, Denver, Colorado, USA, 1993] Morgan Kaufmann 1994
1992
11 Gerald Tesauro: Temporal Difference Learning of Backgammon Strategy. ML 1992: 451-457
10 Gerald Tesauro: Practical Issues in Temporal Difference Learning. Machine Learning 8: 257-277 (1992)
1991
9EEGerald Tesauro: Practical Issues in Temporal Difference Learning. NIPS 1991: 259-266
1990
8EEDavid A. Cohn, Gerald Tesauro: Can Neural Networks Do Better Than the Vapnik-Chervonenkis Bounds? NIPS 1990: 911-917
1989
7EEJakub Wejchert, Gerald Tesauro: Neural Network Visualization. NIPS 1989: 465-472
6EESubutai Ahmad, Gerald Tesauro, Yu He: Asymptotic Convergence of Backpropagation: Numerical Experiments. NIPS 1989: 606-613
5 Gerald Tesauro, Terrence J. Sejnowski: A Parallel Network that Learns to Play Backgammon. Artif. Intell. 39(3): 357-390 (1989)
1988
4 Gerald Tesauro: Connectionist Learning of Expert Backgammon Evaluations. ML 1988: 200-206
3EESubutai Ahmad, Gerald Tesauro: Scaling and Generalization in Neural Networks: A Case Study. NIPS 1988: 160-168
2EEGerald Tesauro: Connectionist Learning of Expert Preferences by Comparison Training. NIPS 1988: 99-106
1987
1EEGerald Tesauro, Terrence J. Sejnowski: A 'Neural' Network that Learns to Play Backgammon. NIPS 1987: 794-803

Coauthor Index

1Subutai Ahmad [3] [6]
2Joshua Alspector [12]
3William C. Arnold [14]
4Mohamed N. Bennani [38] [40]
5Craig Boutilier [29] [37]
6Jonathan Bredin [28]
7Hoi Chan [42]
8David M. Chess [14] [34]
9David A. Cohn [8]
10Jack D. Cowan [12]
11Rajarshi Das [24] [25] [29] [33] [34] [35] [37] [38] [40] [42]
12Gregory R. Galperin [16]
13Amy R. Greenwald [18]
14James E. Hanson [24] [31]
15Yu He [6]
16Nicholas K. Jong [38] [40]
17Jeffrey O. Kephart [14] [18] [19] [21] [24] [26] [29] [31] [33] [34] [35] [37] [42]
18Todd K. Leen [15]
19Charles Lefurgy [42]
20David W. Levine [42]
21Cuihong Li [32]
22Relu Patrascu [37]
23Freeman L. Rawson III [42]
24Irina Rish [41]
25Alla Segal [34]
26Terrence J. Sejnowski [1] [5]
27E. C. Snibl [31]
28Gregory B. Sorkin [14]
29Manu Sridharan [20] [22]
30David S. Touretzky [15]
31William E. Walsh [29] [33] [34] [35] [37]
32Jakub Wejchert [7]
33Ian Whalley [34]
34Steve R. White [14] [34]

Colors in the list of coauthors

Copyright © Wed Jul 23 13:04:14 2008 by Michael Ley (ley@uni-trier.de)