Csaba Szepesvári Coauthor index pubzone.org

List of publications from the DBLP Bibliography Server - FAQ
Ask others: ACM DL/Guide - CiteSeerX - CSB - MetaPress - Google - Bing - Yahoo

DBLP keys2012
101Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMahdi Milani Fard, Joelle Pineau, Csaba Szepesvári: PAC-Bayesian Policy Evaluation for Reinforcement Learning CoRR abs/1202.3717: (2012)
100Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLSylvain Gelly, Levente Kocsis, Marc Schoenauer, Michèle Sebag, David Silver, Csaba Szepesvári, Olivier Teytaud: The grand challenge of computer Go: Monte Carlo tree search and extensions. Commun. ACM 55(3): 106-113 (2012)
99Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLYasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári: Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits. Journal of Machine Learning Research - Proceedings Track 22: 1-9 (2012)
98Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGergely Neu, András György, Csaba Szepesvári: The adversarial stochastic shortest path problem with unknown transition probabilities. Journal of Machine Learning Research - Proceedings Track 22: 805-813 (2012)
2011
97Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLJyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann: Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Espoo, Finland, October 5-7, 2011. Proceedings Springer 2011
96Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLJyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann: Editors' Introduction. ALT 2011: 1-13
95Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári: Invited Talk: Towards Robust Reinforcement Learning Algorithms. EWRL 2011: 4
94Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLPallavi Arora, Csaba Szepesvári, Rong Zheng: Sequential learning for optimal monitoring of multi-channel wireless networks. INFOCOM 2011: 1152-1160
93Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLYasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári: Improved Algorithms for Linear Stochastic Bandits. NIPS 2011: 2312-2320
92Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMahdi Milani Fard, Joelle Pineau, Csaba Szepesvári: PAC-Bayesian Policy Evaluation for Reinforcement Learning. UAI 2011: 195-202
91Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Antos, Gábor Bartók, Dávid Pál, Csaba Szepesvári: Toward a Classification of Finite Partial-Monitoring Games CoRR abs/1102.2041: (2011)
90Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLYasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári: Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems CoRR abs/1102.2670: (2011)
89Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Antos, Gábor Bartók, Csaba Szepesvári: Non-trivial two-armed partial-monitoring games are bandits CoRR abs/1108.4961: (2011)
88Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLArash Afkanpour, Csaba Szepesvári, Michael H. Bowling: Alignment Based Kernel Learning with a Continuous Set of Base Kernels CoRR abs/1112.4607: (2011)
87Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLSébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári: X-Armed Bandits. Journal of Machine Learning Research 12: 1655-1695 (2011)
86Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLYasin Abbasi-Yadkori, Csaba Szepesvári: Regret Bounds for the Adaptive Control of Linear Quadratic Systems. Journal of Machine Learning Research - Proceedings Track 19: 1-26 (2011)
85Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGábor Bartók, Dávid Pál, Csaba Szepesvári: Minimax Regret of Finite Partial-Monitoring Games in Stochastic Environments. Journal of Machine Learning Research - Proceedings Track 19: 133-154 (2011)
84Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLIstván Szita, Csaba Szepesvári: Agnostic KWIK learning and efficient approximate reinforcement learning. Journal of Machine Learning Research - Proceedings Track 19: 739-772 (2011)
83Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAmir Massoud Farahmand, Csaba Szepesvári: Model selection in reinforcement learning. Machine Learning 85(3): 299-332 (2011)
2010
82Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári: Algorithms for Reinforcement Learning Morgan & Claypool Publishers 2010
81Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGábor Bartók, Dávid Pál, Csaba Szepesvári: Toward a Classification of Finite Partial-Monitoring Games. ALT 2010: 224-238
80Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGergely Neu, András György, Csaba Szepesvári: The Online Loop-free Stochastic Shortest-Path Problem. COLT 2010: 231-243
79Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLIstvan Szita, Csaba Szepesvári: Model-based reinforcement learning with nearly tight exploration complexity bounds. ICML 2010: 1031-1038
78Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLHamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard S. Sutton: Toward Off-Policy Learning Control with Function Approximation. ICML 2010: 719-726
77Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLLiuyang Li, Barnabás Póczos, Csaba Szepesvári, Russell Greiner: Budgeted Distribution Learning of Belief Net Parameters. ICML 2010: 879-886
76Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLYasin Abbasi-Yadkori, Joseph Modayil, Csaba Szepesvári: Extending rapidly-exploring random trees for asymptotically optimal anytime motion planning. IROS 2010: 127-132
75Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGergely Neu, András György, Csaba Szepesvári, András Antos: Online Markov Decision Processes under Bandit Feedback. NIPS 2010: 1804-1812
74Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLDávid Pál, Barnabás Póczos, Csaba Szepesvári: Estimation of Renyi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs. NIPS 2010: 1849-1857
73Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAmir Massoud Farahmand, Rémi Munos, Csaba Szepesvári: Error Propagation for Approximate Policy and Value Iteration. NIPS 2010: 568-576
72Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLSarah Filippi, Olivier Cappé, Aurélien Garivier, Csaba Szepesvári: Parametric Bandits: The Generalized Linear Case. NIPS 2010: 586-594
71Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLSébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári: X-Armed Bandits CoRR abs/1001.4475: (2010)
70Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLDávid Pál, Barnabás Póczos, Csaba Szepesvári: Estimation of Rényi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs CoRR abs/1003.1954: (2010)
69Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGábor Bartók, Csaba Szepesvári, Sandra Zilles: Models of active learning in group-structured state spaces. Inf. Comput. 208(4): 364-384 (2010)
68Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLBarnabás Póczos, Sergey Kirshner, Csaba Szepesvári: REGO: Rank-based Estimation of Renyi Information using Euclidean Graph Optimization. Journal of Machine Learning Research - Proceedings Track 9: 605-612 (2010)
67Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLPéter Torma, András György, Csaba Szepesvári: A Markov-Chain Monte Carlo Approach to Simultaneous Localization and Mapping. Journal of Machine Learning Research - Proceedings Track 9: 852-859 (2010)
66Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Antos, Varun Grover, Csaba Szepesvári: Active learning in heteroscedastic noise. Theor. Comput. Sci. 411(29-30): 2712-2728 (2010)
2009
65Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLHengshuai Yao, Shalabh Bhatnagar, Csaba Szepesvári: LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS. CDC 2009: 1181-1188
64Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLBarnabás Póczos, Yasin Abbasi-Yadkori, Csaba Szepesvári, Russell Greiner, Nathan R. Sturtevant: Learning when to stop thinking and do something! ICML 2009: 104
63Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLRichard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora: Fast gradient-descent methods for temporal-difference learning with linear function approximation. ICML 2009: 125
62Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLJean-Yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko, Csaba Szepesvári: Workshop summary: On-line learning with limited feedback. ICML 2009: 168
61Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAlireza Farhangfar, Russell Greiner, Csaba Szepesvári: Learning to segment from a few well-selected training images. ICML 2009: 39
60Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAmir Massoud Farahmand, Azad Shademan, Martin Jägersand, Csaba Szepesvári: Model-based and model-free reinforcement learning for visual servoing. ICRA 2009: 2917-2924
59Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLHamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton: Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation. NIPS 2009: 1204-1212
58Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLHengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári: Multi-Step Dyna Planning for Policy Evaluation and Control. NIPS 2009: 2187-2195
57Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLYaoliang Yu, Yuxi Li, Dale Schuurmans, Csaba Szepesvári: A General Projection Property for Distribution Families. NIPS 2009: 2232-2240
56Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLYuxi Li, Csaba Szepesvári, Dale Schuurmans: Learning Exercise Policies for American Options. Journal of Machine Learning Research - Proceedings Track 5: 352-359 (2009)
55Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGergely Neu, Csaba Szepesvári: Training parsers by inverse reinforcement learning. Machine Learning 77(2-3): 303-337 (2009)
54Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLJean-Yves Audibert, Rémi Munos, Csaba Szepesvári: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits. Theor. Comput. Sci. 410(19): 1876-1902 (2009)
2008
53Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Antos, Varun Grover, Csaba Szepesvári: Active Learning in Multi-armed Bandits. ALT 2008: 287-302
52Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGábor Bartók, Csaba Szepesvári, Sandra Zilles: Active Learning of Group-Structured Environments. ALT 2008: 329-343
51Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAmir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor: Regularized Fitted Q-Iteration: Application to Planning. EWRL 2008: 55-68
50Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLVolodymyr Mnih, Csaba Szepesvári, Jean-Yves Audibert: Empirical Bernstein stopping. ICML 2008: 672-679
49Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLRichard S. Sutton, Csaba Szepesvári, Hamid Reza Maei: A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation. NIPS 2008: 1609-1616
48Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLSébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári: Online Optimization in X-Armed Bandits. NIPS 2008: 201-208
47Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAmir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor: Regularized Policy Iteration. NIPS 2008: 441-448
46Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAlejandro Isaza, Csaba Szepesvári, Vadim Bulitko, Russell Greiner: Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstraction. UAI 2008: 306-314
45Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLRichard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael H. Bowling: Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. UAI 2008: 528-536
44Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLRémi Munos, Csaba Szepesvári: Finite-Time Bounds for Fitted Value Iteration. Journal of Machine Learning Research 9: 815-857 (2008)
43Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Antos, Csaba Szepesvári, Rémi Munos: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path. Machine Learning 71(1): 89-129 (2008)
2007
42Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLJean-Yves Audibert, Rémi Munos, Csaba Szepesvári: Tuning Bandit Algorithms in Stochastic Environments. ALT 2007: 150-165
41Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLPeter Auer, Ronald Ortner, Csaba Szepesvári: Improved Rates for the Stochastic Continuum-Armed Bandit Problem. COLT 2007: 454-468
40Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAmir Massoud Farahmand, Csaba Szepesvári, Jean-Yves Audibert: Manifold-adaptive dimension estimation. ICML 2007: 265-272
39Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLIstván Bíró, Zoltán Szamonek, Csaba Szepesvári: Sequence Prediction Exploiting Similary Information. IJCAI 2007: 1576-1581
38Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás György, Levente Kocsis, Ivett Szabó, Csaba Szepesvári: Continuous Time Associative Bandit Problems. IJCAI 2007: 830-835
37Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Antos, Rémi Munos, Csaba Szepesvári: Fitted Q-iteration in continuous action-space MDPs. NIPS 2007
36Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGergely Neu, Csaba Szepesvári: Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods. UAI 2007: 295-302
2006
35Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLLevente Kocsis, Csaba Szepesvári, Mark H. M. Winands: RSPSA: Enhanced Parameter Optimization in Games. ACG 2006: 39-56
34Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Antos, Csaba Szepesvári, Rémi Munos: Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path. COLT 2006: 574-588
33Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLLevente Kocsis, Csaba Szepesvári: Bandit Based Monte-Carlo Planning. ECML 2006: 282-293
32Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLPéter Torma, Csaba Szepesvári: Local Importance Sampling: A Novel Technique to Enhance Particle Filtering. Journal of Multimedia 1(1): 32-43 (2006)
31Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLLevente Kocsis, Csaba Szepesvári: Universal parameter optimisation in games based on SPSA. Machine Learning 63(3): 249-286 (2006)
2005
30Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLZoltán Szamonek, Csaba Szepesvári: X-mHMM: An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown. ICDM 2005: 434-441
29Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári, Rémi Munos: Finite time bounds for sampling based fitted value iteration. ICML 2005: 880-887
2004
28no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári: Shortest Path Discovery Problems: A Framework, Algorithms and Experimental Results. AAAI 2004: 550-555
27no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári, András Kocsor, Kornél Kovács: Kernel Machine Based Feature Extraction Algorithms for Regression Problems. ECAI 2004: 1091-1092
26Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLPéter Torma, Csaba Szepesvári: Enhancing Particle Filters Using Local Likelihood Sampling. ECCV (1) 2004: 16-27
25Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Kocsor, Kornél Kovács, Csaba Szepesvári: Margin Maximizing Discriminant Analysis. ECML 2004: 227-238
24Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári, William D. Smart: Interpolation-based Q-learning. ICML 2004
2002
23Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLM. French, Csaba Szepesvári, Eric Rogers: LQ performance bounds for adaptive output feedback controllers for functionally uncertain nonlinear systems. Automatica 38(4): 683-693 (2002)
22Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLM. French, Csaba Szepesvári, Eric Rogers: An Asymptotic Scaling Analysis of LQ Performance for an Approximate Adaptive Control Design. MCSS 15(2): 145-176 (2002)
2001
21Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári: Efficient approximate planning in continuous space Markovian Decision Problems. AI Commun. 14(3): 163-176 (2001)
20Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLAndrás Lörincz, György Hévízi, Csaba Szepesvári: Ockham's Razor Modeling of the Matrisome Channels of the Basal Ganglia Thalamocortical Loops. Int. J. Neural Syst. 11(2): 125-143 (2001)
2000
19Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLGyörgy Balogh, Ervin Dobler, Tamás Gröbler, Béla Smodics, Csaba Szepesvári: FlexVoice: A Parametric Approach to High-Quality Speech Synthesis. TSD 2000: 189-194
18Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLZsolt Kalmár, Csaba Szepesvári, András Lörincz: Modular Reinforcement Learning: A Case Study in a Robot Domain. Acta Cybern. 14(3): 507-522 (2000)
17Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLSatinder P. Singh, Tommi Jaakkola, Michael L. Littman, Csaba Szepesvári: Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms. Machine Learning 38(3): 287-308 (2000)
1999
16Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári, Michael L. Littman: A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms. Neural Computation 11(8): 2017-2060 (1999)
15Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLZsolt Kalmár, Zsolt Marczell, Csaba Szepesvári, András Lörincz: Parallel and robust skeletonization built on self-organizing elements. Neural Networks 12(1): 163-173 (1999)
14Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLJános Murvai, Kristian Vlahovicek, Endre Barta, Csaba Szepesvári, Cristina Acatrinei, Sándor Pongor: The SBASE protein domain library, release 6.0: a collection of annotated protein sequence segments. Nucleic Acids Research 27(1): 257-259 (1999)
1998
13no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLZoltán Gábor, Zsolt Kalmár, Csaba Szepesvári: Multi-criteria Reinforcement Learning. ICML 1998: 197-205
12Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári: Non-Markovian Policies in Sequential Decision Problems. Acta Cybern. 13(3): 305-318 (1998)
11Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLZsolt Kalmár, Csaba Szepesvári, András Lörincz: Module-Based Reinforcement Learning: Experiments with a Real Robot. Auton. Robots 5(3-4): 273-295 (1998)
10Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLZsolt Kalmár, Csaba Szepesvári, András Lörincz: Module-Based Reinforcement Learning: Experiments with a Real Robot. Machine Learning 31(1-3): 55-85 (1998)
1997
9Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári: Learning and Exploitation Do Not Conflict Under Minimax Optimality. ECML 1997: 242-249
8Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLZsolt Kalmár, Csaba Szepesvári, András Lörincz: Module Based Reinforcement Learning: An Application to a Real Robot. EWLR 1997: 29-45
7no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári: The Asymptotic Convergence-Rate of Q-learning. NIPS 1997
6Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári, Szabolcs Cimmer, András Lörincz: Neurocontroller using dynamic state feedback for compensatory control. Neural Networks 10(9): 1691-1708 (1997)
1996
5Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári, András Lörincz: Inverse Dynamics Controllers for Robust Control: Consequences for Neurocontrollers. ICANN 1996: 791-796
4no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLMichael L. Littman, Csaba Szepesvári: A Generalized Reinforcement-Learning Model: Convergence and Applications. ICML 1996: 310-318
3Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLTibor Fomin, Tamás Rozgonyi, Csaba Szepesvári, András Lörincz: Self-Organizing Multi-Resolution Grid for Motion Planning and Control. Int. J. Neural Syst. 7(6): 757- (1996)
2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári, András Lörincz: Approximate geometry representations and sensory fusion. Neurocomputing 12(2-3): 267-287 (1996)
1994
1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XMLCsaba Szepesvári, László Balázs, András Lörincz: Topology Learning Solved by Extended Objects: A Neural Network Model. Neural Computation 6(3): 441-458 (1994)

Coauthor Index

1Yasin Abbasi-Yadkori [64] [76] [86] [90] [93] [99]
2Cristina Acatrinei [14]
3Arash Afkanpour [88]
4András Antos [34] [37] [43] [53] [66] [75] [89] [91]
5Pallavi Arora [94]
6Jean-Yves Audibert [40] [42] [50] [54] [62]
7Peter Auer [41] [62]
8László Balázs [1]
9György Balogh [19]
10Endre Barta [14]
11Gábor Bartók [52] [69] [81] [85] [89] [91]
12Shalabh Bhatnagar [58] [59] [63] [65] [78]
13István Bíró [39]
14Michael H. Bowling [45] [88]
15Sébastien Bubeck [48] [71] [87]
16Vadim Bulitko [46]
17Olivier Cappé [72]
18Szabolcs Cimmer [6]
19Ervin Dobler [19]
20Diao Dongcui [58]
21Amir Massoud Farahmand [40] [47] [51] [60] [73] [83]
22Mahdi Milani Fard [92] [101]
23Alireza Farhangfar [61]
24Sarah Filippi [72]
25Tibor Fomin [3]
26M. French [22] [23]
27Zoltán Gábor [13]
28Aurélien Garivier [72]
29Sylvain Gelly [100]
30Alborz Geramifard [45]
31Mohammad Ghavamzadeh [47] [51]
32Russell Greiner [46] [61] [64] [77]
33Tamás Gröbler [19]
34Varun Grover [53] [66]
35András György [38] [67] [75] [80] [98]
36György Hévízi [20]
37Alejandro Isaza [46]
38Tommi Jaakkola [17]
39Martin Jägersand [60]
40Zsolt Kalmár [8] [10] [11] [13] [15] [18]
41Sergey Kirshner [68]
42Jyrki Kivinen [96] [97]
43Levente Kocsis [31] [33] [35] [38] [100]
44András Kocsor [25] [27]
45Kornél Kovács [25] [27]
46Alessandro Lazaric [62]
47Liuyang Li [77]
48Yuxi Li [56] [57]
49Michael L. Littman [4] [16] [17]
50András Lörincz [1] [2] [3] [5] [6] [8] [10] [11] [15] [18] [20]
51Hamid Reza Maei [49] [59] [63] [78]
52Shie Mannor [47] [51]
53Zsolt Marczell [15]
54Volodymyr Mnih [50]
55Joseph Modayil [76]
56Rémi Munos [29] [34] [37] [42] [43] [44] [48] [54] [62] [71] [73] [87]
57János Murvai [14]
58Gergely Neu [36] [55] [75] [80] [98]
59Ronald Ortner [41]
60Dávid Pál [70] [74] [81] [85] [90] [91] [93] [99]
61Joelle Pineau [92] [101]
62Barnabás Póczos [64] [68] [70] [74] [77]
63Sándor Pongor [14]
64Doina Precup [59] [63]
65Eric Rogers [22] [23]
66Tamás Rozgonyi [3]
67Daniil Ryabko [62]
68Marc Schoenauer [100]
69Dale Schuurmans [56] [57]
70Michèle Sebag [100]
71Azad Shademan [60]
72David Silver [59] [63] [100]
73Satinder P. Singh [17]
74William D. Smart [24]
75Béla Smodics [19]
76Gilles Stoltz [48] [71] [87]
77Nathan R. Sturtevant [64]
78Richard S. Sutton [45] [49] [58] [59] [63] [78]
79Ivett Szabó [38]
80Zoltán Szamonek [30] [39]
81István Szita (Istvan Szita) [79] [84]
82Olivier Teytaud [100]
83Péter Torma [26] [32] [67]
84Esko Ukkonen [96] [97]
85Kristian Vlahovicek [14]
86Eric Wiewiora [63]
87Mark H. M. Winands [35]
88Hengshuai Yao [58] [65]
89Yaoliang Yu [57]
90Thomas Zeugmann [96] [97]
91Rong Zheng [94]
92Sandra Zilles [52] [69]

Colors in the list of coauthors

Last update Fri May 25 01:42:58 2012 CET by the DBLP TeamThis material is Open Data Data released under the ODC-BY 1.0 license — See also our legal information page