| 2013 | ||
|---|---|---|
| j6 | Matthieu Geist, Olivier Pietquin: Algorithmic Survey of Parametric Value Function Approximation. IEEE Trans. Neural Netw. Learning Syst. 24(6): 845-867 (2013) | |
| i2 | Matthieu Geist, Bruno Scherrer: Off-policy Learning with Eligibility Traces: A Survey. CoRR abs/1304.3999 (2013) | |
| 2012 | ||
| j5 | Lucie Daubigney, Matthieu Geist, Senthilkumar Chandramohan, Olivier Pietquin: A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization. J. Sel. Topics Signal Processing 6(8): 891-902 (2012) | |
| c22 | Senthilkumar Chandramohan, Matthieu Geist, Fabrice Lefèvre, Olivier Pietquin: Behavior Specific User Simulation in Spoken Dialogue Systems. ITG Conference on Speech Communication 2012: 1-4 | |
| c21 | ||
| c20 | Senthilkumar Chandramohan, Matthieu Geist, Fabrice Lefèvre, Olivier Pietquin: Clustering behaviors of Spoken Dialogue Systems users. ICASSP 2012: 4981-4984 | |
| c19 | Lucie Daubigney, Matthieu Geist, Olivier Pietquin: Off-policy learning in large-scale POMDP-based dialogue systems. ICASSP 2012: 4989-4992 | |
| c18 | Matthieu Geist, Bruno Scherrer, Alessandro Lazaric, Mohammad Ghavamzadeh: A Dantzig Selector Approach to Temporal Difference Learning. ICML 2012 | |
| c17 | Bruno Scherrer, Victor Gabillon, Mohammad Ghavamzadeh, Matthieu Geist: Approximate Modified Policy Iteration. ICML 2012 | |
| c16 | Edouard Klein, Matthieu Geist, Bilal Piot, Olivier Pietquin: Inverse Reinforcement Learning through Structured Classification. NIPS 2012: 1016-1024 | |
| i1 | Bruno Scherrer, Victor Gabillon, Mohammad Ghavamzadeh, Matthieu Geist: Approximate Modified Policy Iteration. CoRR abs/1205.3054 (2012) | |
| 2011 | ||
| j4 | Matthieu Geist, Olivier Pietquin: Managing Uncertainty within KTD. Journal of Machine Learning Research - Proceedings Track 16: 157-168 (2011) | |
| j3 | Olivier Pietquin, Matthieu Geist, Senthilkumar Chandramohan, Hervé Frezza-Buet: Sample-efficient batch reinforcement learning for dialogue management optimization. TSLP 7(3): 7 (2011) | |
| c15 | ||
| c14 | Bruno Scherrer, Matthieu Geist: Recursive Least-Squares Learning with Eligibility Traces. EWRL 2011: 115-127 | |
| c13 | Edouard Klein, Matthieu Geist, Olivier Pietquin: Batch, Off-Policy and Model-Free Apprenticeship Learning. EWRL 2011: 285-296 | |
| c12 | Hadrien Glaude, Fadi Akrimi, Matthieu Geist, Olivier Pietquin: A Non-parametric Approach to Approximate Dynamic Programming. ICMLA (1) 2011: 317-322 | |
| c11 | Olivier Pietquin, Matthieu Geist, Senthilkumar Chandramohan: Sample Efficient On-Line Learning of Optimal Dialogue Policies with Kalman Temporal Differences. IJCAI 2011: 1878-1883 | |
| c10 | Senthilkumar Chandramohan, Matthieu Geist, Fabrice Lefèvre, Olivier Pietquin: User Simulation in Dialogue Systems Using Inverse Reinforcement Learning. INTERSPEECH 2011: 1025-1028 | |
| c9 | Lucie Daubigney, Milica Gasic, Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin, Steve Young: Uncertainty Management for On-Line Optimisation of a POMDP-Based Large-Scale Spoken Dialogue System. INTERSPEECH 2011: 1301-1304 | |
| 2010 | ||
| j2 | Matthieu Geist, Olivier Pietquin: Kalman Temporal Differences. J. Artif. Intell. Res. (JAIR) 39: 483-532 (2010) | |
| j1 | Matthieu Geist, Olivier Pietquin, Gabriel Fricout: Différences temporelles de Kalman. Cas déterministe. Revue d'Intelligence Artificielle 24(4): 423-443 (2010) | |
| c8 | Matthieu Geist, Olivier Pietquin: Statistically linearized least-squares temporal differences. ICUMT 2010: 450-457 | |
| c7 | ||
| c6 | Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin: Optimizing spoken dialogue management with fitted value iteration. INTERSPEECH 2010: 86-89 | |
| c5 | Matthieu Geist, Olivier Pietquin: Revisiting Natural Actor-Critics with Value Function Approximation. MDAI 2010: 207-218 | |
| c4 | Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin: Sparse Approximate Dynamic Programming for Dialog Management. SIGDIAL Conference 2010: 107-115 | |
| 2009 | ||
| c3 | Matthieu Geist, Olivier Pietquin, Gabriel Fricout: Kernelizing Vector Quantization Algorithms. ESANN 2009 | |
| c2 | Matthieu Geist, Olivier Pietquin, Gabriel Fricout: Tracking in Reinforcement Learning. ICONIP (1) 2009: 502-511 | |
| 2008 | ||
| c1 | ||
Colors in the list of coauthors
Last update Thu May 23 07:15:24 2013 CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page