Csaba Szepesvári Coauthor index pubzone.org

List of publications from the DBLP Bibliography Server - FAQ
Other views: by type - by year (modern) - classic-C
Ask others: ACM DL/Guide - CiteSeerX - CSB - MetaPress - Google - Bing - Yahoo
DBLP keys2013
j37Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Antos, Gábor Bartók, Dávid Pál, Csaba Szepesvári: Toward a classification of finite partial-monitoring games. Theor. Comput. Sci. 473: 77-99 (2013)
i13Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yasin Abbasi-Yadkori, Peter L. Bartlett, Csaba Szepesvári: Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions. CoRR abs/1303.3055 (2013)
2012
j36Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Sylvain Gelly, Levente Kocsis, Marc Schoenauer, Michèle Sebag, David Silver, Csaba Szepesvári, Olivier Teytaud: The grand challenge of computer Go: Monte Carlo tree search and extensions. Commun. ACM 55(3): 106-113 (2012)
j35Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári: Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits. Journal of Machine Learning Research - Proceedings Track 22: 1-9 (2012)
j34Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gergely Neu, András György, Csaba Szepesvári: The adversarial stochastic shortest path problem with unknown transition probabilities. Journal of Machine Learning Research - Proceedings Track 22: 805-813 (2012)
c63Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Hengshuai Yao, Csaba Szepesvári: Approximate Policy Iteration with Linear Action Models. AAAI 2012
c62Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gábor Bartók, Csaba Szepesvári: Partial Monitoring with Side Information. ALT 2012: 305-319
c61Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gábor Bartók, Navid Zolghadr, Csaba Szepesvári: An adaptive algorithm for finite stochastic partial monitoring. ICML 2012
c60Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Bernardo Avila Pires, Csaba Szepesvári: Statistical linear estimation with penalized estimators: an application to reinforcement learning. ICML 2012
c59Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yaoliang Yu, Csaba Szepesvári: Analysis of Kernel Mean Matching under Covariate Shift. ICML 2012
c58Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Ryan Kiros, Csaba Szepesvári: Deep Representations and Codes for Image Auto-Annotation. NIPS 2012: 917-925
i12Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Mahdi Milani Fard, Joelle Pineau, Csaba Szepesvári: PAC-Bayesian Policy Evaluation for Reinforcement Learning. CoRR abs/1202.3717 (2012)
i11Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Arash Afkanpour, András György, Csaba Szepesvári, Michael H. Bowling: A Randomized Strategy for Learning to Combine Many Features. CoRR abs/1205.0288 (2012)
i10Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Alejandro Isaza, Csaba Szepesvári, Vadim Bulitko, Russell Greiner: Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions. CoRR abs/1206.3233 (2012)
i9Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael Bowling: Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. CoRR abs/1206.3285 (2012)
i8Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yaoliang Yu, Csaba Szepesvári: Analysis of Kernel Mean Matching under Covariate Shift. CoRR abs/1206.4650 (2012)
i7Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gergely Neu, Csaba Szepesvári: Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods. CoRR abs/1206.5264 (2012)
2011
j33Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári: X-Armed Bandits. Journal of Machine Learning Research 12: 1655-1695 (2011)
j32Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yasin Abbasi-Yadkori, Csaba Szepesvári: Regret Bounds for the Adaptive Control of Linear Quadratic Systems. Journal of Machine Learning Research - Proceedings Track 19: 1-26 (2011)
j31Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gábor Bartók, Dávid Pál, Csaba Szepesvári: Minimax Regret of Finite Partial-Monitoring Games in Stochastic Environments. Journal of Machine Learning Research - Proceedings Track 19: 133-154 (2011)
j30Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
István Szita, Csaba Szepesvári: Agnostic KWIK learning and efficient approximate reinforcement learning. Journal of Machine Learning Research - Proceedings Track 19: 739-772 (2011)
j29Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Amir Massoud Farahmand, Csaba Szepesvári: Model selection in reinforcement learning. Machine Learning 85(3): 299-332 (2011)
c57Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann: Editors' Introduction. ALT 2011: 1-13
c56Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári: Invited Talk: Towards Robust Reinforcement Learning Algorithms. EWRL 2011: 4
c55Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Pallavi Arora, Csaba Szepesvári, Rong Zheng: Sequential learning for optimal monitoring of multi-channel wireless networks. INFOCOM 2011: 1152-1160
c54Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári: Improved Algorithms for Linear Stochastic Bandits. NIPS 2011: 2312-2320
c53Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Mahdi Milani Fard, Joelle Pineau, Csaba Szepesvári: PAC-Bayesian Policy Evaluation for Reinforcement Learning. UAI 2011: 195-202
e1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann (Eds.): Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Espoo, Finland, October 5-7, 2011. Proceedings. Lecture Notes in Computer Science 6925, Springer 2011, isbn 978-3-642-24411-7
i6Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Antos, Gábor Bartók, Dávid Pál, Csaba Szepesvári: Toward a Classification of Finite Partial-Monitoring Games. CoRR abs/1102.2041 (2011)
i5Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yasin Abbasi-Yadkori, Dávid Pál, Csaba Szepesvári: Online Least Squares Estimation with Self-Normalized Processes: An Application to Bandit Problems. CoRR abs/1102.2670 (2011)
i4Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Antos, Gábor Bartók, Csaba Szepesvári: Non-trivial two-armed partial-monitoring games are bandits. CoRR abs/1108.4961 (2011)
i3Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Arash Afkanpour, Csaba Szepesvári, Michael H. Bowling: Alignment Based Kernel Learning with a Continuous Set of Base Kernels. CoRR abs/1112.4607 (2011)
2010
b1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
j28Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gábor Bartók, Csaba Szepesvári, Sandra Zilles: Models of active learning in group-structured state spaces. Inf. Comput. 208(4): 364-384 (2010)
j27Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Barnabás Póczos, Sergey Kirshner, Csaba Szepesvári: REGO: Rank-based Estimation of Renyi Information using Euclidean Graph Optimization. Journal of Machine Learning Research - Proceedings Track 9: 605-612 (2010)
j26Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Péter Torma, András György, Csaba Szepesvári: A Markov-Chain Monte Carlo Approach to Simultaneous Localization and Mapping. Journal of Machine Learning Research - Proceedings Track 9: 852-859 (2010)
j25Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Antos, Varun Grover, Csaba Szepesvári: Active learning in heteroscedastic noise. Theor. Comput. Sci. 411(29-30): 2712-2728 (2010)
c52Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gábor Bartók, Dávid Pál, Csaba Szepesvári: Toward a Classification of Finite Partial-Monitoring Games. ALT 2010: 224-238
c51Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gergely Neu, András György, Csaba Szepesvári: The Online Loop-free Stochastic Shortest-Path Problem. COLT 2010: 231-243
c50Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Richard S. Sutton: Toward Off-Policy Learning Control with Function Approximation. ICML 2010: 719-726
c49Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Liuyang Li, Barnabás Póczos, Csaba Szepesvári, Russell Greiner: Budgeted Distribution Learning of Belief Net Parameters. ICML 2010: 879-886
c48Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Istvan Szita, Csaba Szepesvári: Model-based reinforcement learning with nearly tight exploration complexity bounds. ICML 2010: 1031-1038
c47Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yasin Abbasi-Yadkori, Joseph Modayil, Csaba Szepesvári: Extending rapidly-exploring random trees for asymptotically optimal anytime motion planning. IROS 2010: 127-132
c46Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Amir Massoud Farahmand, Rémi Munos, Csaba Szepesvári: Error Propagation for Approximate Policy and Value Iteration. NIPS 2010: 568-576
c45Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Sarah Filippi, Olivier Cappé, Aurélien Garivier, Csaba Szepesvári: Parametric Bandits: The Generalized Linear Case. NIPS 2010: 586-594
c44Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gergely Neu, András György, Csaba Szepesvári, András Antos: Online Markov Decision Processes under Bandit Feedback. NIPS 2010: 1804-1812
c43Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Dávid Pál, Barnabás Póczos, Csaba Szepesvári: Estimation of Renyi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs. NIPS 2010: 1849-1857
i2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári: X-Armed Bandits. CoRR abs/1001.4475 (2010)
i1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Dávid Pál, Barnabás Póczos, Csaba Szepesvári: Estimation of Rényi Entropy and Mutual Information Based on Generalized Nearest-Neighbor Graphs. CoRR abs/1003.1954 (2010)
2009
j24Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yuxi Li, Csaba Szepesvári, Dale Schuurmans: Learning Exercise Policies for American Options. Journal of Machine Learning Research - Proceedings Track 5: 352-359 (2009)
j23Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gergely Neu, Csaba Szepesvári: Training parsers by inverse reinforcement learning. Machine Learning 77(2-3): 303-337 (2009)
j22Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Jean-Yves Audibert, Rémi Munos, Csaba Szepesvári: Exploration-exploitation tradeoff using variance estimates in multi-armed bandits. Theor. Comput. Sci. 410(19): 1876-1902 (2009)
c42Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Hengshuai Yao, Shalabh Bhatnagar, Csaba Szepesvári: LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS. CDC 2009: 1181-1188
c41Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Alireza Farhangfar, Russell Greiner, Csaba Szepesvári: Learning to segment from a few well-selected training images. ICML 2009: 39
c40Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Barnabás Póczos, Yasin Abbasi-Yadkori, Csaba Szepesvári, Russell Greiner, Nathan R. Sturtevant: Learning when to stop thinking and do something! ICML 2009: 104
c39Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Hamid Reza Maei, Doina Precup, Shalabh Bhatnagar, David Silver, Csaba Szepesvári, Eric Wiewiora: Fast gradient-descent methods for temporal-difference learning with linear function approximation. ICML 2009: 125
c38Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Jean-Yves Audibert, Peter Auer, Alessandro Lazaric, Rémi Munos, Daniil Ryabko, Csaba Szepesvári: Workshop summary: On-line learning with limited feedback. ICML 2009: 168
c37Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Amir Massoud Farahmand, Azad Shademan, Martin Jägersand, Csaba Szepesvári: Model-based and model-free reinforcement learning for visual servoing. ICRA 2009: 2917-2924
c36Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Hamid Reza Maei, Csaba Szepesvári, Shalabh Bhatnagar, Doina Precup, David Silver, Richard S. Sutton: Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation. NIPS 2009: 1204-1212
c35Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Hengshuai Yao, Richard S. Sutton, Shalabh Bhatnagar, Diao Dongcui, Csaba Szepesvári: Multi-Step Dyna Planning for Policy Evaluation and Control. NIPS 2009: 2187-2195
c34Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Yaoliang Yu, Yuxi Li, Dale Schuurmans, Csaba Szepesvári: A General Projection Property for Distribution Families. NIPS 2009: 2232-2240
2008
j21Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Rémi Munos, Csaba Szepesvári: Finite-Time Bounds for Fitted Value Iteration. Journal of Machine Learning Research 9: 815-857 (2008)
j20Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Antos, Csaba Szepesvári, Rémi Munos: Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path. Machine Learning 71(1): 89-129 (2008)
c33Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Antos, Varun Grover, Csaba Szepesvári: Active Learning in Multi-armed Bandits. ALT 2008: 287-302
c32Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gábor Bartók, Csaba Szepesvári, Sandra Zilles: Active Learning of Group-Structured Environments. ALT 2008: 329-343
c31Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor: Regularized Fitted Q-Iteration: Application to Planning. EWRL 2008: 55-68
c30Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Volodymyr Mnih, Csaba Szepesvári, Jean-Yves Audibert: Empirical Bernstein stopping. ICML 2008: 672-679
c29Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Sébastien Bubeck, Rémi Munos, Gilles Stoltz, Csaba Szepesvári: Online Optimization in X-Armed Bandits. NIPS 2008: 201-208
c28Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor: Regularized Policy Iteration. NIPS 2008: 441-448
c27Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Csaba Szepesvári, Hamid Reza Maei: A Convergent O(n) Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation. NIPS 2008: 1609-1616
c26Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Alejandro Isaza, Csaba Szepesvári, Vadim Bulitko, Russell Greiner: Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstraction. UAI 2008: 306-314
c25Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Richard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael H. Bowling: Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping. UAI 2008: 528-536
2007
c24Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Jean-Yves Audibert, Rémi Munos, Csaba Szepesvári: Tuning Bandit Algorithms in Stochastic Environments. ALT 2007: 150-165
c23Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Peter Auer, Ronald Ortner, Csaba Szepesvári: Improved Rates for the Stochastic Continuum-Armed Bandit Problem. COLT 2007: 454-468
c22Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Amir Massoud Farahmand, Csaba Szepesvári, Jean-Yves Audibert: Manifold-adaptive dimension estimation. ICML 2007: 265-272
c21Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András György, Levente Kocsis, Ivett Szabó, Csaba Szepesvári: Continuous Time Associative Bandit Problems. IJCAI 2007: 830-835
c20Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
István Bíró, Zoltán Szamonek, Csaba Szepesvári: Sequence Prediction Exploiting Similary Information. IJCAI 2007: 1576-1581
c19Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Antos, Rémi Munos, Csaba Szepesvári: Fitted Q-iteration in continuous action-space MDPs. NIPS 2007
c18Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Gergely Neu, Csaba Szepesvári: Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods. UAI 2007: 295-302
2006
j19Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Péter Torma, Csaba Szepesvári: Local Importance Sampling: A Novel Technique to Enhance Particle Filtering. Journal of Multimedia 1(1): 32-43 (2006)
j18Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Levente Kocsis, Csaba Szepesvári: Universal parameter optimisation in games based on SPSA. Machine Learning 63(3): 249-286 (2006)
c17Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Levente Kocsis, Csaba Szepesvári, Mark H. M. Winands: RSPSA: Enhanced Parameter Optimization in Games. ACG 2006: 39-56
c16Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Antos, Csaba Szepesvári, Rémi Munos: Learning Near-Optimal Policies with Bellman-Residual Minimization Based Fitted Policy Iteration and a Single Sample Path. COLT 2006: 574-588
c15Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Levente Kocsis, Csaba Szepesvári: Bandit Based Monte-Carlo Planning. ECML 2006: 282-293
2005
c14Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Zoltán Szamonek, Csaba Szepesvári: X-mHMM: An Efficient Algorithm for Training Mixtures of HMMs When the Number of Mixtures Is Unknown. ICDM 2005: 434-441
c13Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, Rémi Munos: Finite time bounds for sampling based fitted value iteration. ICML 2005: 880-887
2004
c12Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári: Shortest Path Discovery Problems: A Framework, Algorithms and Experimental Results. AAAI 2004: 550-555
c11no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, András Kocsor, Kornél Kovács: Kernel Machine Based Feature Extraction Algorithms for Regression Problems. ECAI 2004: 1091-1092
c10Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Péter Torma, Csaba Szepesvári: Enhancing Particle Filters Using Local Likelihood Sampling. ECCV (1) 2004: 16-27
c9Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Kocsor, Kornél Kovács, Csaba Szepesvári: Margin Maximizing Discriminant Analysis. ECML 2004: 227-238
c8Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, William D. Smart: Interpolation-based Q-learning. ICML 2004
2002
j17Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Mark French, Csaba Szepesvári, Eric Rogers: LQ performance bounds for adaptive output feedback controllers for functionally uncertain nonlinear systems. Automatica 38(4): 683-693 (2002)
j16Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Mark French, Csaba Szepesvári, Eric Rogers: An Asymptotic Scaling Analysis of LQ Performance for an Approximate Adaptive Control Design. MCSS 15(2): 145-176 (2002)
2001
j15Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári: Efficient approximate planning in continuous space Markovian Decision Problems. AI Commun. 14(3): 163-176 (2001)
j14Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
András Lörincz, György Hévízi, Csaba Szepesvári: Ockham's Razor Modeling of the Matrisome Channels of the Basal Ganglia Thalamocortical Loops. Int. J. Neural Syst. 11(2): 125-143 (2001)
2000
j13Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Zsolt Kalmár, Csaba Szepesvári, András Lörincz: Modular Reinforcement Learning: A Case Study in a Robot Domain. Acta Cybern. 14(3): 507-522 (2000)
j12Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Satinder P. Singh, Tommi Jaakkola, Michael L. Littman, Csaba Szepesvári: Convergence Results for Single-Step On-Policy Reinforcement-Learning Algorithms. Machine Learning 38(3): 287-308 (2000)
c7Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
György Balogh, Ervin Dobler, Tamás Gröbler, Béla Smodics, Csaba Szepesvári: FlexVoice: A Parametric Approach to High-Quality Speech Synthesis. TSD 2000: 189-194
1999
j11Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
János Murvai, Kristian Vlahovicek, Endre Barta, Csaba Szepesvári, Cristina Acatrinei, Sándor Pongor: The SBASE protein domain library, release 6.0: a collection of annotated protein sequence segments. Nucleic Acids Research 27(1): 257-259 (1999)
j10Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, Michael L. Littman: A Unified Analysis of Value-Function-Based Reinforcement Learning Algorithms. Neural Computation 11(8): 2017-2060 (1999)
j9Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Zsolt Kalmár, Zsolt Marczell, Csaba Szepesvári, András Lörincz: Parallel and robust skeletonization built on self-organizing elements. Neural Networks 12(1): 163-173 (1999)
1998
j8Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári: Non-Markovian Policies in Sequential Decision Problems. Acta Cybern. 13(3): 305-318 (1998)
j7Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Zsolt Kalmár, Csaba Szepesvári, András Lörincz: Module-Based Reinforcement Learning: Experiments with a Real Robot. Auton. Robots 5(3-4): 273-295 (1998)
j6Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, András Lörincz: An integrated architecture for motion-control and path-planning. J. Field Robotics 15(1): 1-15 (1998)
j5Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Zsolt Kalmár, Csaba Szepesvári, András Lörincz: Module-Based Reinforcement Learning: Experiments with a Real Robot. Machine Learning 31(1-3): 55-85 (1998)
c6no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Zoltán Gábor, Zsolt Kalmár, Csaba Szepesvári: Multi-criteria Reinforcement Learning. ICML 1998: 197-205
1997
j4Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, Szabolcs Cimmer, András Lörincz: Neurocontroller using dynamic state feedback for compensatory control. Neural Networks 10(9): 1691-1708 (1997)
c5Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári: Learning and Exploitation Do Not Conflict Under Minimax Optimality. ECML 1997: 242-249
c4Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Zsolt Kalmár, Csaba Szepesvári, András Lörincz: Module Based Reinforcement Learning: An Application to a Real Robot. EWLR 1997: 29-45
c3no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári: The Asymptotic Convergence-Rate of Q-learning. NIPS 1997
1996
j3Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Tibor Fomin, Tamás Rozgonyi, Csaba Szepesvári, András Lörincz: Self-Organizing Multi-Resolution Grid for Motion Planning and Control. Int. J. Neural Syst. 7(6): 757- (1996)
j2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, András Lörincz: Approximate geometry representations and sensory fusion. Neurocomputing 12(2-3): 267-287 (1996)
c2Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, András Lörincz: Inverse Dynamics Controllers for Robust Control: Consequences for Neurocontrollers. ICANN 1996: 791-796
c1no EE pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Michael L. Littman, Csaba Szepesvári: A Generalized Reinforcement-Learning Model: Convergence and Applications. ICML 1996: 310-318
1994
j1Electronic Edition pubzone.org CiteSeerX Google scholar BibTeX bibliographical record in XML
Csaba Szepesvári, László Balázs, András Lörincz: Topology Learning Solved by Extended Objects: A Neural Network Model. Neural Computation 6(3): 441-458 (1994)

Coauthor Index

1Yasin Abbasi-Yadkori
[i13] [j35] [j32] [c54] [i5] [c47] [c40]
2Cristina Acatrinei
[j11]
3Arash Afkanpour
[i11] [i3]
4András Antos
[j37] [i6] [i4] [j25] [c44] [j20] [c33] [c19] [c16]
5Pallavi Arora
[c55]
6Jean-Yves Audibert
[j22] [c38] [c30] [c24] [c22]
7Peter Auer
[c38] [c23]
8György Balogh
[c7]
9László Balázs
[j1]
10Endre Barta
[j11]
11Peter L. Bartlett
[i13]
12Gábor Bartók
[j37] [c62] [c61] [j31] [i6] [i4] [j28] [c52] [c32]
13Shalabh Bhatnagar
[c50] [c42] [c39] [c36] [c35]
14Michael H. Bowling (Michael Bowling)
[i11] [i9] [i3] [c25]
15Sébastien Bubeck
[j33] [i2] [c29]
16Vadim Bulitko
[i10] [c26]
17István Bíró
[c20]
18Olivier Cappé
[c45]
19Szabolcs Cimmer
[j4]
20Ervin Dobler
[c7]
21Diao Dongcui
[c35]
22Amir Massoud Farahmand
[j29] [c46] [c37] [c31] [c28] [c22]
23Mahdi Milani Fard
[i12] [c53]
24Alireza Farhangfar
[c41]
25Sarah Filippi
[c45]
26Tibor Fomin
[j3]
27Mark French
[j17] [j16]
28Aurélien Garivier
[c45]
29Sylvain Gelly
[j36]
30Alborz Geramifard
[i9] [c25]
31Mohammad Ghavamzadeh
[c31] [c28]
32Russell Greiner
[i10] [c49] [c41] [c40] [c26]
33Varun Grover
[j25] [c33]
34Tamás Gröbler
[c7]
35András György
[j34] [i11] [j26] [c51] [c44] [c21]
36Zoltán Gábor
[c6]
37György Hévízi
[j14]
38Alejandro Isaza
[i10] [c26]
39Tommi Jaakkola
[j12]
40Martin Jägersand
[c37]
41Zsolt Kalmár
[j13] [j9] [j7] [j5] [c6] [c4]
42Ryan Kiros
[c58]
43Sergey Kirshner
[j27]
44Jyrki Kivinen
[c57] [e1]
45Levente Kocsis
[j36] [c21] [j18] [c17] [c15]
46András Kocsor
[c11] [c9]
47Kornél Kovács
[c11] [c9]
48Alessandro Lazaric
[c38]
49Liuyang Li
[c49]
50Yuxi Li
[j24] [c34]
51Michael L. Littman
[j12] [j10] [c1]
52András Lörincz
[j14] [j13] [j9] [j7] [j6] [j5] [j4] [c4] [j3] [j2] [c2] [j1]
53Hamid Reza Maei
[c50] [c39] [c36] [c27]
54Shie Mannor
[c31] [c28]
55Zsolt Marczell
[j9]
56Volodymyr Mnih
[c30]
57Joseph Modayil
[c47]
58Rémi Munos
[j33] [c46] [i2] [j22] [c38] [j21] [j20] [c29] [c24] [c19] [c16] [c13]
59János Murvai
[j11]
60Gergely Neu
[j34] [i7] [c51] [c44] [j23] [c18]
61Ronald Ortner
[c23]
62Joelle Pineau
[i12] [c53]
63Bernardo Avila Pires
[c60]
64Sándor Pongor
[j11]
65Doina Precup
[c39] [c36]
66Dávid Pál
[j37] [j35] [j31] [c54] [i6] [i5] [c52] [c43] [i1]
67Barnabás Póczos
[j27] [c49] [c43] [i1] [c40]
68Eric Rogers
[j17] [j16]
69Tamás Rozgonyi
[j3]
70Daniil Ryabko
[c38]
71Marc Schoenauer
[j36]
72Dale Schuurmans
[j24] [c34]
73Michèle Sebag
[j36]
74Azad Shademan
[c37]
75David Silver
[j36] [c39] [c36]
76Satinder P. Singh
[j12]
77William D. Smart
[c8]
78Béla Smodics
[c7]
79Gilles Stoltz
[j33] [i2] [c29]
80Nathan R. Sturtevant
[c40]
81Richard S. Sutton
[i9] [c50] [c39] [c36] [c35] [c27] [c25]
82Ivett Szabó
[c21]
83Zoltán Szamonek
[c20] [c14]
84István Szita (Istvan Szita)
[j30] [c48]
85Olivier Teytaud
[j36]
86Péter Torma
[j26] [j19] [c10]
87Esko Ukkonen
[c57] [e1]
88Kristian Vlahovicek
[j11]
89Eric Wiewiora
[c39]
90Mark H. M. Winands
[c17]
91Hengshuai Yao
[c63] [c42] [c35]
92Yaoliang Yu
[c59] [i8] [c34]
93Thomas Zeugmann
[c57] [e1]
94Rong Zheng
[c55]
95Sandra Zilles
[j28] [c32]
96Navid Zolghadr
[c61]

Colors in the list of coauthors

Last update Thu May 23 21:06:07 2013 CET by the DBLP TeamThis material is Open Data Data released under the ODC-BY 1.0 license — See also our legal information page