Volume 22, Numbers 1-3, January 1996
: Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results.
: The Loss from Imperfect Value Functions in Expectation-Based and Minimax-Based Tasks.
, Reid G. Simmons
: The Effect of Representation and Knowledge on Goal-Directed Exploration with Reinforcement-Learning Algorithms.