 | 2009 |
| 20 |  | Istvan Szita,
András Lörincz:
Optimistic initialization and greediness lead to polynomial time learning in factored MDPs.
ICML 2009: 126 |
| 19 |  | Istvan Szita,
András Lörincz:
Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs - Extended Version
CoRR abs/0904.3352: (2009) |
| 2008 |
| 18 |  | Guillaume Chaslot,
Sander Bakkes,
Istvan Szita,
Pieter Spronck:
Monte-Carlo Tree Search: A New Framework for Game AI.
AIIDE 2008 |
| 17 |  | Istvan Szita,
András Lörincz:
The many faces of optimism: a unifying approach.
ICML 2008: 1048-1055 |
| 16 |  | Istvan Szita,
András Lörincz:
Factored Value Iteration Converges.
Acta Cybern. 18(4): 615-635 (2008) |
| 15 |  | Istvan Szita,
András Lörincz:
Online variants of the cross-entropy method
CoRR abs/0801.1988: (2008) |
| 14 |  | Istvan Szita,
András Lörincz:
Factored Value Iteration Converges
CoRR abs/0801.2069: (2008) |
| 13 |  | Istvan Szita,
András Lörincz:
The many faces of optimism - Extended version
CoRR abs/0810.3451: (2008) |
| 2007 |
| 12 |  | Istvan Szita,
András Lörincz:
Learning to Play Using Low-Complexity Rule-Based Policies: Illustrations through Ms. Pac-Man.
J. Artif. Intell. Res. (JAIR) 30: 659-684 (2007) |
| 2006 |
| 11 |  | Istvan Szita,
Viktor Gyenes,
András Lörincz:
Reinforcement Learning with Echo State Networks.
ICANN (1) 2006: 830-839 |
| 10 |  | Istvan Szita,
András Lörincz:
Low-complexity modular policies: learning to play Pac-Man and a new framework beyond MDPs
CoRR abs/cs/0610170: (2006) |
| 9 |  | Istvan Szita,
András Lörincz:
Learning Tetris Using the Noisy Cross-Entropy Method.
Neural Computation 18(12): 2936-2941 (2006) |
| 2004 |
| 8 |  | Istvan Szita,
András Lörincz:
Applying Policy Iteration for Training Recurrent Neural Networks
CoRR cs.AI/0410004: (2004) |
| 7 |  | Istvan Szita,
András Lörincz:
Kalman Filter Control Embedded into the Reinforcement Learning Framework.
Neural Computation 16(3): 491-499 (2004) |
| 2003 |
| 6 |  | Bálint Takács,
Istvan Szita,
András Lörincz:
Temporal plannability by variance of the episode length
CoRR cs.AI/0301006: (2003) |
| 5 |  | Istvan Szita,
András Lörincz:
Kalman filter control in the reinforcement learning framework
CoRR cs.LG/0301007: (2003) |
| 4 |  | Istvan Szita,
András Lörincz:
Reinforcement Learning with Linear Function Approximation and LQ control Converges
CoRR cs.LG/0306120: (2003) |
| 2002 |
| 3 |  | Istvan Szita,
Bálint Takács,
András Lörincz:
Reinforcement Learning Integrated with a Non-Markovian Controller.
ECAI 2002: 365-369 |
| 2 |  | Istvan Szita,
Bálint Takács,
András Lörincz:
Searching for Plannable Domains can Speed up Reinforcement Learning
CoRR cs.AI/0212025: (2002) |
| 1 |  | Istvan Szita,
Bálint Takács,
András Lörincz:
MDPs: Learning in Varying Environments.
Journal of Machine Learning Research 3: 145-174 (2002) |