Number of documents

18

Matthieu Geist


2013   

Journal articles3 documents

  • Matthieu Geist, Olivier Pietquin. An algorithmic Survey of Parametric Value Function Approximation. IEEE Transactions on Neural Networks and Learning Systems, IEEE, 2013, 24 (6), pp.845-867. ⟨10.1109/TNNLS.2013.2247418⟩. ⟨hal-00869725⟩
  • Edouard Klein, Bilal Piot, Matthieu Geist, Olivier Pietquin. Classification structurée pour l'apprentissage par renforcement inverse. Revue des Sciences et Technologies de l'Information - Série RIA : Revue d'Intelligence Artificielle, Lavoisier, 2013, 27 (2), pp.155-169. ⟨10.3166/ria.27.155-169⟩. ⟨hal-00869723⟩
  • Hervé Frezza-Buet, Matthieu Geist. A C++ Template-Based Reinforcement Learning Library: Fitting the Code to the Mathematics. Journal of Machine Learning Research, Microtome Publishing, 2013, 14 (1), pp.625-628. ⟨hal-00914768⟩

Conference papers12 documents

  • Bilal Piot, Matthieu Geist, Olivier Pietquin. Apprentissage par démonstrations : vaut-il la peine d'estimer une fonction de récompense?. Journées Francophones de Plannification, Décision et Apprentissage (JFPDA), Jul 2013, Lille, France. ⟨hal-00916941⟩
  • Bilal Piot, Matthieu Geist, Olivier Pietquin. Learning from demonstrations: Is it worth estimating a reward function?. 1st Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Oct 2013, Princeton, New Jersey, United States. ⟨hal-00916938⟩
  • Edouard Klein, Bilal Piot, Matthieu Geist, Olivier Pietquin. A cascaded supervised learning approach to inverse reinforcement learning. Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML/PKDD 2013), Sep 2013, Prague, Czech Republic. pp.1-16, ⟨10.1007/978-3-642-40988-2_1⟩. ⟨hal-00869804⟩
  • Radoslaw Niewiadomski, Jennifer Hofmann, Jérôme Urbain, Tracey Platt, Johannes Wagner, et al.. Laugh-aware virtual agent and its impact on user amusement. AAMAS '13, May 2013, Saint Paul, Minnesota, United States. pp.619-626. ⟨hal-00869751⟩
  • Lucie Daubigney, Matthieu Geist, Olivier Pietquin. Model-free POMDP optimisation of tutoring systems with echo-state networks. SIGDial 2013, Aug 2013, Metz, France. pp.102-106. ⟨hal-00869773⟩
  • Lucie Daubigney, Matthieu Geist, Olivier Pietquin. Optimisation par essaims particulaires de stratégies de dialogue. Journées Francophones de Plannification, Décision et Apprentissage (JFPDA), Jul 2013, Lille, France. ⟨hal-00918425⟩
  • Edouard Klein, Bilal Piot, Matthieu Geist, Olivier Pietquin. Apprentissage par renforcement inverse en cascadant classification et régression. Journées Francophones de Plannification, Décision et Apprentissage (JFPDA), Jul 2013, Lille, France. ⟨hal-00916942⟩
  • Matthieu Geist, Edouard Klein, Bilal Piot, Yann Guermeur, Olivier Pietquin. Around Inverse Reinforcement Learning and Score-based Classification. 1st Multidisciplinary Conference on Reinforcement Learning and Decision Making (RLDM 2013), Oct 2013, Princeton, New Jersey, United States. ⟨hal-00916936⟩
  • Lucie Daubigney, Matthieu Geist, Olivier Pietquin. Random Projections: a Remedy for Overfitting Issues in Time Series Prediction with Echo State Networks. ICASSP 2013, May 2013, Vancouver, Canada. pp.3253-3257, ⟨10.1109/ICASSP.2013.6638259⟩. ⟨hal-00869814⟩
  • Bilal Piot, Matthieu Geist, Olivier Pietquin. Classification régularisée par la récompense pour l'Apprentissage par Imitation. Journées Francophones de Plannification, Décision et Apprentissage (JFPDA), Jul 2013, Lille, France. ⟨hal-00916940⟩
  • Bilal Piot, Matthieu Geist, Olivier Pietquin. Learning from Demonstrations: Is It Worth Estimating a Reward Function?. Joint European Conference on Machine Learning and Knowledge Discovery in Databases (ECML/PKDD 2013), Sep 2013, Prague, Czech Republic. pp.17-32, ⟨10.1007/978-3-642-40988-2_2⟩. ⟨hal-00869801⟩
  • Lucie Daubigney, Matthieu Geist, Olivier Pietquin. Particle Swarm Optimisation of Spoken Dialogue System Strategies. Interspeech 2013, Aug 2013, Lyon, France. pp.1-5. ⟨hal-00916935⟩

Patents1 document

  • Gari Clifford, Julien Oster, Olivier Pietquin, Matthieu Geist. PERIODIC ARTIFACT REDUCTION FROM BIOMEDICAL SIGNALS. France, Patent n° : WO/2013/052944. 2013. ⟨hal-00869739⟩

Preprints, Working Papers, ...1 document

  • Bruno Scherrer, Matthieu Geist. Policy Search: Any Local Optimum Enjoys a Global Performance Guarantee. 2013. ⟨hal-00829548⟩

Reports1 document

  • Matthieu Geist, Bruno Scherrer. Off-policy Learning with Eligibility Traces: A Survey. [Research Report] 2013, pp.43. ⟨hal-00644516v2⟩