Number of documents

21

Matthieu Geist


2011   

Journal articles1 document

  • Olivier Pietquin, Matthieu Geist, Senthilkumar Chandramohan, Hervé Frezza-Buet. Sample-Efficient Batch Reinforcement Learning for Dialogue Management Optimization. ACM - Transactions on Speech and Language Processing, Association for Computing Machinery, 2011, 7 (3), pp.art. 7 (1-21). ⟨10.1145/1966407.1966412⟩. ⟨hal-00617517⟩

Conference papers19 documents

  • Remi Chou, Yvo Boers, Martin Podt, Matthieu Geist. Performance evaluation for particle filters. FUSION 2011, Jul 2011, Chicago, United States. pp.1-7. ⟨hal-00652168⟩
  • Olivier Pietquin, Matthieu Geist, Senthilkumar Chandramohan. Sample Efficient On-line Learning of Optimal Dialogue Policies with Kalman Temporal Differences. IJCAI 2011, Jul 2011, Barcelona, Spain. pp.1878-1883. ⟨hal-00618252⟩
  • Lucie Daubigney, Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin. Gestion de l'incertitude pour l'optimisation en ligne d'un gestionnaire de dialogues parlés à grande échelle basé sur les POMDP. JFPDA 2011, Jun 2011, Rouen, France. pp.1-7. ⟨hal-00652511⟩
  • Senthilkumar Chandramohan, Matthieu Geist, Fabrice Lefèvre, Olivier Pietquin. User Simulation in Dialogue Systems using Inverse Reinforcement Learning. Interspeech 2011, Aug 2011, Florence, Italy. pp.1025-1028. ⟨hal-00652446⟩
  • Hadrien Glaude, Fadi Akrimi, Matthieu Geist, Olivier Pietquin. A Non-Parametric Approach to Approximate Dynamic Programming. ICMLA 2011, Dec 2011, Honolulu, Hawaii, United States. pp.1-6. ⟨hal-00652438⟩
  • Edouard Klein, Matthieu Geist, Olivier Pietquin. Batch, Off-policy and Model-Free Apprenticeship Learning. IJCAI Workshop on Agents Learning Interactively from Human Teachers (ALIHT 2011), Jun 2011, Barcelona, Spain. 6 p. ⟨hal-00596370⟩
  • Matthieu Geist, Olivier Pietquin. Parametric value function approximation: A unified view. ADPRL 2011, Apr 2011, Paris, France. pp.9-16, ⟨10.1109/ADPRL.2011.5967355⟩. ⟨hal-00618112⟩
  • Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin. Apprentissage par Renforcement Inverse pour la Simulation d'Utilisateurs dans les Systèmes de Dialogue. JFPDA 2011, Jun 2011, Rouen, France. pp.1-7. ⟨hal-00652753⟩
  • Lucie Daubigney, Milica Gašić, Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin, et al.. Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system. Interspeech 2011, Aug 2011, Florence, Italy. pp.1301-1304. ⟨hal-00652194⟩
  • Olivier Pietquin, Lucie Daubigney, Matthieu Geist. Optimization of a Tutoring System from a Fixed Set of Data. SLaTE 2011, Aug 2011, Venice, Italy. pp.1-4. ⟨hal-00652324⟩
  • Lucie Daubigney, Matthieu Geist, Olivier Pietquin. Apprentissage par renforcement pour la personnalisation d'un logiciel d'enseignement des langues. EIAH 2011, May 2011, Mons, Belgique. pp.1-5. ⟨hal-00652516⟩
  • Matthieu Geist, Olivier Pietquin. Kalman filtering & colored noises: the (autoregressive) moving-average case. MLASA 2011, Dec 2011, Honolulu, United States. pp.1-4. ⟨hal-00660607⟩
  • Edouard Klein, Matthieu Geist, Olivier Pietquin. Reducing the dimentionality of the reward space in the Inverse Reinforcement Learning problem. MLASA 2011, Dec 2011, Honolulu, United States. pp.1-4. ⟨hal-00660612⟩
  • Jérémy Fix, Matthieu Geist, Olivier Pietquin, Hervé Frezza-Buet. Dynamic Neural Field Optimization using the Unscented Kalman Filter. CCMB 2011, Apr 2011, Paris, France. 7 p., ⟨10.1109/CCMB.2011.5952113⟩. ⟨hal-00618117⟩
  • Edouard Klein, Matthieu Geist, Olivier Pietquin. Apprentissage par imitation dans un cadre batch, off-policy et sans modèle. JFPDA 2011, Jun 2011, Rouen, France. pp.1-9. ⟨hal-00652762⟩
  • Edouard Klein, Matthieu Geist, Olivier Pietquin. Batch, Off-policy and Model-free Apprenticeship Learning. EWRL 2011, Sep 2011, Athens, Greece. pp.1-12. ⟨hal-00660623⟩
  • Bruno Scherrer, Matthieu Geist. Moindres carrés récursifs pour l'évaluation off-policy d'une politique avec traces d'éligibilité. 6ème Journées Francophones de Planification, Décision et Apprentissage pour la conduite de systèmes - JFPDA 2011, Jun 2011, Rouen, France. ⟨hal-00644874⟩
  • Bruno Scherrer, Matthieu Geist. Recursive Least-Squares Learning with Eligibility Traces. European Wrokshop on Reinforcement Learning (EWRL 11), Sep 2011, Athens, Greece. ⟨hal-00644511⟩
  • Matthieu Geist, Bruno Scherrer. l1-penalized projected Bellman residual. European Wrokshop on Reinforcement Learning (EWRL 11), Sep 2011, Athens, Greece. ⟨hal-00644507⟩

Reports1 document

  • Filip Jurcicek, Milica Gašić, Steve Young, Ghislain Putois, Romain Laroche, et al.. Online adaptation of dialogue systems. 2011. ⟨hal-00652841⟩