Volume 49,
Number 1,
October 2002
- Fredrik A. Dahl:
The Lagging Anchor Algorithm: Reinforcement Learning in Two-Player Zero-Sum Games with Imperfect Information.
5-37
- Michael D. Lee:
A Simple Method for Generating Additive Clustering Models with Limited Complexity.
39-58
- Shaul Markovitch, Dan Rosenstein:
Feature Generation Using General Constructor Functions.
59-98
Volume 49,
Number 2-3,
November-December 2002
- Satinder P. Singh:
Introduction.
107-109
- Hui Tong, Timothy X. Brown:
Reinforcement Learning for Call Admission Control and Routing under Quality of Service Constraints in Multimedia Networks.
111-139
- Amy McGovern, J. Eliot B. Moss, Andrew G. Barto:
Building a Basic Block Instruction Scheduler with Reinforcement Learning and Rollouts.
141-160
- Dirk Ormoneit, Saunak Sen:
Kernel-Based Reinforcement Learning.
161-178
- John N. Tsitsiklis, Benjamin Van Roy:
On Average Versus Discounted Reward Temporal-Difference Learning.
179-191
- Michael J. Kearns, Yishay Mansour, Andrew Y. Ng:
A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes.
193-208
- Michael J. Kearns, Satinder P. Singh:
Near-Optimal Reinforcement Learning in Polynomial Time.
209-232
- Justin A. Boyan:
Technical Update: Least-Squares Temporal Difference Learning.
233-246
- José del R. Millán, Daniele Posenato, Eric Dedieu:
Continuous-Action Q-Learning.
247-265
- Oliver Mihatsch, Ralph Neuneier:
Risk-Sensitive Reinforcement Learning.
267-290
- Rémi Munos, Andrew W. Moore:
Variable Resolution Discretization in Optimal Control.
291-323
- David J. Foster, Peter Dayan:
Structure in the Space of Value Functions.
325-346
Copyright © Wed Nov 11 05:28:56 2009
by Michael Ley (ley@uni-trier.de)