Accéder directement au contenu

Odalric-Ambrym Maillard

25
Documents

Publications

42232
Image document

From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits

Dorian Baudry , Patrick Saux , Odalric-Ambrym Maillard
NeurIPS 2021 - 35th International Conference on Neural Information Processing Systems, Dec 2021, Sydney, Australia
Communication dans un congrès hal-03421252v2
Image document

Indexed Minimum Empirical Divergence for Unimodal Bandits

Hassan Saber , Pierre Ménard , Odalric-Ambrym Maillard
NeurIPS 2021 - International Conference on Neural Information Processing Systems, Dec 2021, Virtual-only Conference, United States
Communication dans un congrès hal-03446617v1
Image document

Reinforcement Learning in Parametric MDPs with Exponential Families

Sayak Ray Chowdhury , Aditya Gopalan , Odalric-Ambrym Maillard
International Conference on Artificial Intelligence and Statistics, 2021, San diego, United States. pp.1855-1863
Communication dans un congrès hal-03472116v1
Image document

Stochastic bandits with groups of similar arms

Fabien Pesquerel , Hassan Saber , Odalric-Ambrym Maillard
NeurIPS 2021 - Thirty-fifth Conference on Neural Information Processing Systems, Dec 2021, Sydney, Australia
Communication dans un congrès hal-03427597v1
Image document

Optimal Thompson Sampling strategies for support-aware CVaR bandits

Dorian Baudry , Romain Gautron , Emilie Kaufmann , Odalric-Ambrym Maillard
38th International Conference on Machine Learning, Jul 2021, Virtual, United States
Communication dans un congrès hal-03447244v1
Image document

Sub-sampling for Efficient Non-Parametric Bandit Exploration

Dorian Baudry , Emilie Kaufmann , Odalric-Ambrym Maillard
NeurIPS 2020, Dec 2020, Vancouver, Canada
Communication dans un congrès hal-02977552v1
Image document

Restarted Bayesian Online Change-point Detector achieves Optimal Detection Delay

Réda Alami , Odalric-Ambrym Maillard , Raphael Féraud
International Conference on Machine Learning, Jul 2020, Wien, Austria
Communication dans un congrès hal-03021712v1
Image document

Tightening Exploration in Upper Confidence Reinforcement Learning

Hippolyte Bourel , Odalric-Ambrym Maillard , Mohammad Sadegh Talebi
International Conference on Machine Learning, Jul 2020, Vienna, Austria
Communication dans un congrès hal-03000664v1
Image document

Model-Based Reinforcement Learning Exploiting State-Action Equivalence

Mahsa Asadi , Mohammad Sadegh Talebi , Hippolyte Bourel , Odalric-Ambrym Maillard
ACML 2019, Proceedings of Machine Learning Research, Nov 2019, Nagoya, Japan. pp.204 - 219
Communication dans un congrès hal-02378887v1
Image document

Learning Multiple Markov Chains via Adaptive Allocation

Mohammad Sadegh Talebi , Odalric-Ambrym Maillard
Advances in Neural Information Processing Systems 32 (NIPS 2019), Dec 2019, Vancouver, Canada
Communication dans un congrès hal-02387345v1
Image document

Regret Bounds for Learning State Representations in Reinforcement Learning

Ronald Ortner , Matteo Pirotta , Ronan Fruit , Alessandro Lazaric , Odalric-Ambrym Maillard
Conference on Neural Information Processing Systems, Dec 2019, Vancouver, Canada
Communication dans un congrès hal-02375715v1
Image document

Budgeted Reinforcement Learning in Continuous State Space

Nicolas Carrara , Edouard Leurent , Romain Laroche , Tanguy Urvoy , Odalric-Ambrym Maillard
Conference on Neural Information Processing Systems, Dec 2019, Vancouver, Canada
Communication dans un congrès hal-02375727v1
Image document

Sequential change-point detection: Laplace concentration of scan statistics and non-asymptotic delay bounds

Odalric-Ambrym Maillard
Algorithmic Learning Theory, 2019, Chicago, United States. pp.1 - 23
Communication dans un congrès hal-02351665v1
Image document

Practical Open-Loop Optimistic Planning

Edouard Leurent , Odalric-Ambrym Maillard
European Conference on Machine Learning, Sep 2019, Würzburg, Germany
Communication dans un congrès hal-02375697v1
Image document

Boundary Crossing for General Exponential Families

Odalric-Ambrym Maillard
Algorithmic Learning Theory, Oct 2017, Kyoto, Japan. pp.1 - 34
Communication dans un congrès hal-01615427v1
Image document

Spectral Learning from a Single Trajectory under Finite-State Policies

Borja Balle , Odalric-Ambrym Maillard
International conference on Machine Learning, Jul 2017, Sidney, France
Communication dans un congrès hal-01590940v1
Image document

Efficient tracking of a growing number of experts

Jaouad Mourtada , Odalric-Ambrym Maillard
Algorithmic Learning Theory, Oct 2017, Tokyo, Japan. pp.1 - 23
Communication dans un congrès hal-01615424v1