Recherche - Archive ouverte HAL Accéder directement au contenu

Filtrer vos résultats

9 résultats
Image document

Learning Nash Equilibrium for General-Sum Markov Games from Batch Data

Julien Pérolat , Florian Strub , Bilal Piot , Olivier Pietquin
AISTATS 2017 - The 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States. pp.1-14
Communication dans un congrès hal-01648489v1
Image document

Softened approximate policy iteration for Markov games

Julien Pérolat , Bilal Piot , Matthieu Geist , Bruno Scherrer , Olivier Pietquin
ICML 2016 - 33rd International Conference on Machine Learning, Jun 2016, New York City, United States
Communication dans un congrès hal-01393328v1

Generalizing the Wilcoxon rank-sum test for interval data

Julien Perolat , Ines Couso , Kevin Loquin , Olivier Strauss
International Journal of Approximate Reasoning, 2015, 56, pp.108-121. ⟨10.1016/j.ijar.2014.08.001⟩
Article dans une revue lirmm-01278071v1
Image document

Reinforcement Learning: The Multi-Player Case

Julien Pérolat
Artificial Intelligence [cs.AI]. Université de Lille 1 - Sciences et Technologies, 2017. English. ⟨NNT : ⟩
Thèse tel-01820700v1
Image document

Actor-Critic Fictitious Play in Simultaneous Move Multistage Games

Julien Pérolat , Bilal Piot , Olivier Pietquin
AISTATS 2018 - 21st International Conference on Artificial Intelligence and Statistics, Apr 2018, Playa Blanca, Lanzarote, Canary Islands, Spain
Communication dans un congrès hal-01724227v1
Image document

Approximate dynamic programming for two-player zero-sum Markov games

Julien Perolat , Bruno Scherrer , Bilal Piot , Olivier Pietquin
International Conference on Machine Learning (ICML 2015), Jul 2015, Lille, France
Communication dans un congrès hal-01153270v3

Fictitious Play for Mean Field Games: Continuous Time Analysis and Applications

Sarah Perrin , Julien Pérolat , Mathieu Laurière , Matthieu Geist , Romuald Elie , et al.
2020
Pré-publication, Document de travail hal-02931977v1
Image document

Human-Machine Dialogue as a Stochastic Game

Merwan Barlier , Julien Perolat , Romain Laroche , Olivier Pietquin
16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), Sep 2015, Prague, Czech Republic
Communication dans un congrès hal-01225848v1

On the Use of Non-Stationary Strategies for Solving Two-Player Zero-Sum Markov Games

Julien Pérolat , Bilal Piot , Bruno Scherrer , Olivier Pietquin
19th International Conference on Artificial Intelligence and Statistics (AISTATS 2016), May 2016, Cadiz, Spain
Communication dans un congrès hal-01291495v1