Nombre de documents

30

CV de Aurélien Garivier


Document associé à des manifestations scientifiques3 documents

  • Aurelien Garivier. Choix de modèle pour les chaînes de Markov d'ordre variable. Journées MAS et Journée en l'honneur de Jacques Neveu, Aug 2010, Talence, France. <inria-00510200>
  • Aurelien Garivier. Apprentissage par renforcement. Journées MAS et Journée en l'honneur de Jacques Neveu, Aug 2010, Talence, France. <inria-00496719>
  • Sarah Filippi, Olivier Cappé, Aurelien Garivier. Optimisme en apprentissage par renforcement et divergence de Kullback-Leibler. Journées MAS et Journée en l'honneur de Jacques Neveu, Aug 2010, Talence, France. <inria-00510327>

Pré-publication, Document de travail7 documents

  • Emilie Kaufmann, Aurélien Garivier. Learning the distribution with largest mean: two bandit frameworks. 2017. <hal-01449822v2>
  • Pierre Ménard, Aurélien Garivier. A minimax and asymptotically optimal algorithm for stochastic bandits. 2017. <hal-01475078>
  • Aurélien Garivier, Pierre Ménard, Gilles Stoltz. Explore First, Exploit Next: The True Shape of Regret in Bandit Problems. 2016. <hal-01276324v2>
  • T Labopin-Richard, F Gamboa, A Garivier. CONDITIONAL QUANTILE SEQUENTIAL ESTIMATION FOR STOCHASTIC CODES. 2015. <hal-01187329v5>
  • Tatiana Labopin-Richard, Fabrice Gamboa, Aurélien Garivier, Bertrand Iooss. Bregman superquantiles. Estimation methods and applications.. 2014. <hal-00996440v8>
  • Randal Douc, Aurelien Garivier, Eric Moulines, Jimmy Olsson. On the Forward Filtering Backward Smoothing particle approximations of the smoothing distribution in general state spaces models. 2009. <hal-00370685>
  • Aurélien Garivier, Eric Moulines. On Upper-Confidence Bound Policies for Non-Stationary Bandit Problems. 24 pages. 2008. <hal-00281392>

Article dans une revue11 documents

  • Emilie Kaufmann, Olivier Cappé, Aurélien Garivier. On the Complexity of Best Arm Identification in Multi-Armed Bandit Models. Journal of Machine Learning Research, Journal of Machine Learning Research, 2016, 17, pp.1-42. <hal-01024894v2>
  • Aurélien Garivier. Perfect Simulation Of Processes With Long Memory: a 'Coupling Into And From The Past' Algorithm. Random Structures and Algorithms, Wiley, 2015, 46 (2), pp.300-319. <hal-00798388v2>
  • Philippe Besse, Aurélien Garivier, Jean-Michel Loubes. Big Data Analytics - Retour vers le Futur 3; De Statisticien à Data Scientist. Ingéniérie des Systèmes d'Information, Lavoisier, 2014, Systèmes d'information et big data, 19 (3), pp.93. <10.3166/isi.19.3.93-105>. <hal-00959267v3>
  • Olivier Cappé, Aurélien Garivier, Odalric-Ambrym Maillard, Rémi Munos, Gilles Stoltz. Kullback-Leibler Upper Confidence Bounds for Optimal Sequential Allocation. Annals of Statistics, Institute of Mathematical Statistics, 2013, 41 (3), pp.1516-1541. <hal-00738209v2>
  • Sébastien Bubeck, Damien Ernst, Aurélien Garivier. Optimal Discovery with Probabilistic Expert Advice: Finite Time Analysis and Macroscopic Optimality. Journal of Machine Learning Research, Journal of Machine Learning Research, 2013, 14, pp.601−623. <hal-00811860>
  • Antonio Galves, Aurélien Garivier, Elisabeth Gassiat. Joint estimation of intersecting context tree models. Scandinavian Journal of Statistics, Wiley, 2012, pp.early view. <hal-00738202>
  • Randal Douc, Aurelien Garivier, Eric Moulines, Jimmy Olsson. Sequential Monte Carlo smoothing for general state space hidden Markov models. Annals of Applied Probability, Institute of Mathematical Statistics (IMS), 2011, 21 (6), pp.2109-2145. <10.1214/10-AAP735>. <hal-00839311>
  • Sarah Filippi, Olivier Cappé, Aurélien Garivier. Optimally Sensing a Single Channel Without Prior Information: The Tiling Algorithm and Regret Bounds. IEEE Journal of Selected Topics in Signal Processing, IEEE, 2010, 5 (1), pp.68 - 76. <10.1109/JSTSP.2010.2058091>. <hal-00408867v2>
  • Stéphane Boucheron, Aurélien Garivier, Elisabeth Gassiat. Coding on countably infinite alphabets. IEEE Transactions on Information Theory, Institute of Electrical and Electronics Engineers, 2009, 55 (1), pp.358-373. <hal-00121892v2>
  • S. Boucheron, A. Garivier, E. Gassiat. Coding over infinite alphabets. IEEE Transactions on Information Theory, Institute of Electrical and Electronics Engineers, 2009, 55 (1), pp.358-373. <hal-00356674>
  • A. Garivier. A Lower-Bound for the Maximin Redundancy in Pattern Coding. Entropy, MDPI, 2009, 11 (4), pp.634-642. <hal-00479585>

Communication dans un congrès9 documents

  • Aurélien Garivier, Emilie Kaufmann. Optimal Best Arm Identification with Fixed Confidence. 29th Annual Conference on Learning Theory (COLT), Jun 2016, New York, United States. 49, 2016, JMLR Workshop and Conference Proceedings. <http://www.learningtheory.org/colt2016/>. <hal-01273838v2>
  • Aurélien Garivier, Emilie Kaufmann, Wouter Koolen. Maximin Action Identification: A New Bandit Framework for Games. 29th Annual Conference on Learning Theory (COLT), Jun 2016, New-York, United States. 49, JMLR Workshop and Conference Proceedings. <hal-01273842v2>
  • Aurélien Garivier, Emilie Kaufmann, Tor Lattimore. On Explore-Then-Commit Strategies. NIPS, Dec 2016, Barcelona, Spain. 29, 2016, Advances in Neural Information Processing Systems (NIPS). <hal-01322906v2>
  • Cecile Chouquet, Aurélien Garivier. Poursuite d'étude après un IUT STID : l'exemple du Cursus de Master en Ingénierie Statistique et Informatique Décisionnelle de Toulouse. Journées de Statistiques de la SFDS, Jun 2014, Rennes, France. 2014. <hal-00977409>
  • Emilie Kaufmann, Olivier Cappé, Aurélien Garivier. On the Complexity of A/B Testing. Conference on Learning Theory, Jun 2014, Barcelona, Spain. JMLR: Workshop and Conference Proceedings, 35, pp.461-481, 2014, Proceedings of The 27th Conference on Learning Theory. <http://jmlr.org/proceedings/papers/v35/>. <hal-00990254v2>
  • Céline Abraham, Jérémie Bettinelli, Gwendal Collet, Igor Kortchemski, Aurélien Garivier. Random maps. Journées MAS 2014, Aug 2014, Toulouse, France. Modélisation aléatoire et statistique - Journées MAS 2014, 51, pp.133-149, 2015, ESAIM : proceedings and surveys. <10.1051/proc/201551008>. <hal-01090666>
  • Aurélien Garivier. Informational Confidence Bounds for Self-Normalized Averages and Applications. IEEE Information Theory Workshop, Sep 2013, Seville, Spain. pp.489-493, 2013. <hal-00862062>
  • Sarah Filippi, Olivier Cappé, Aurélien Garivier. Optimism in Reinforcement Learning and Kullback-Leibler Divergence. Communication, Control, and Computing (Allerton), 2010 48th Annual Allerton Conference on, Sep 2010, Monticello (Illinois), United States. pp.115 - 122, 2010, <10.1109/ALLERTON.2010.5706896>. <hal-00476116v3>
  • Randal Douc, Aurélien Garivier, Eric Moulines, Jimmy Olsson. Approximation particulaire par FFBS de la loi de lissage pour des HMM dans des espaces d'états généraux. 41èmes Journées de Statistique, SFdS, Bordeaux, 2009, Bordeaux, France, France. 2009. <inria-00386750>