Number of documents

16

Matthieu Geist


2010   

Journal articles2 documents

  • Matthieu Geist, Olivier Pietquin. Kalman Temporal Differences. Journal of Artificial Intelligence Research, Association for the Advancement of Artificial Intelligence, 2010, 39, pp.483-532. ⟨hal-00858687⟩
  • Matthieu Geist, Olivier Pietquin, Gabriel Fricout. Différences temporelles de Kalman: Cas déterministe. Revue des Sciences et Technologies de l'Information - Série RIA : Revue d'Intelligence Artificielle, Lavoisier, 2010, 24 (4), pp.423-443. ⟨10.3166/ria.24.423-443⟩. ⟨hal-00512093⟩

Conference papers14 documents

  • Matthieu Geist, Olivier Pietquin, Gabriel Fricout. Astuce du Noyau & Quantification Vectorielle. RFIA'10, Jan 2010, Caen, France. 8 p. ⟨hal-00553114⟩
  • Matthieu Geist, Olivier Pietquin. Revisiting natural actor-critics with value function approximation. BNAIC 2010, Oct 2010, Luxembourg, Luxembourg. 1 page. ⟨hal-00553175⟩
  • Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin. Optimizing Spoken Dialogue Management with Fitted Value Iteration. Interspeech 2010, Sep 2010, Makuhari, Japan. pp.86-89. ⟨hal-00553184⟩
  • Matthieu Geist, Olivier Pietquin. Managing Uncertainty within the KTD Framework. Active Learning and Experimental Design workshop in conjunction with AISTATS 2010, May 2010, Sardinia, Italy. pp.157-168. ⟨hal-00599636⟩
  • Matthieu Geist, Olivier Pietquin. Statistically Linearized Recursive Least Squares. MLSP 2010, Aug 2010, Kittilä, Finland. pp.272-276, ⟨10.1109/MLSP.2010.5589236⟩. ⟨hal-00553168⟩
  • Senthilkumar Chandramohan, Matthieu Geist, Olivier Pietquin. Sparse Approximate Dynamic Programming for Dialog Management. SIGDial 2010, Sep 2010, Tokyo, Japan. pp.107-115. ⟨hal-00553180⟩
  • Matthieu Geist, Olivier Pietquin. Managing Uncertainty within Value Function Approximation in Reinforcement Learning. Active Learning and Experimental Design workshop (collocated with AISTATS 2010), May 2010, Sardinia, Italy. ⟨hal-00554398⟩
  • Matthieu Geist, Olivier Pietquin. Revisiting natural actor-critics with value function approximation. 5èmes Journées Francophones de Planification, Décision et Apprentissage pour la conduite de systèmes (JFPDA'10), Jun 2010, Besançon, France. ⟨hal-00554346⟩
  • Matthieu Geist, Olivier Pietquin. Statistically Linearized Least-Squares Temporal Differences. 5èmes Journées Francophones de Planification, Décision et Apprentissage pour la conduite de systèmes (JFPDA'10), Jun 2010, Besançon, France. ⟨hal-00554338⟩
  • Matthieu Geist. Statistical Linearization for Value Function Approximation in Reinforcement Learning. NIPS Workshop on Learning and Planning from Batch Time Series Data (OPT 2010), Dec 2010, Vancouver, Canada. pp.1-6. ⟨hal-00554324⟩
  • Matthieu Geist, Olivier Pietquin. Revisiting Natural Actor-Critics with Value Function Approximation. MDAI 2010, Oct 2010, Perpignan, France. pp.207-218, ⟨10.1007/978-3-642-16292-3_21⟩. ⟨hal-00553870⟩
  • Matthieu Geist, Olivier Pietquin. Gestion de l'incertitude dans le cadre de l'approximation de la fonction de valeur pour l'apprentissage par renforcement. CAP 2010, May 2010, Clermont-Ferrand, France. pp.101-112. ⟨hal-00553895⟩
  • Matthieu Geist, Olivier Pietquin. Eligibility Traces through Colored Noises. ICUMT 2010, Oct 2010, Moscow, Russia. pp.458-465, ⟨10.1109/ICUMT.2010.5676597⟩. ⟨hal-00553910⟩
  • Matthieu Geist, Olivier Pietquin. Statistically linearized least-squares temporal differences. ICUMT 2010, Oct 2010, Moscow, Russia. pp.450-457, ⟨10.1109/ICUMT.2010.5676598⟩. ⟨hal-00553913⟩