Recherche - Archive ouverte HAL Accéder directement au contenu

Filtrer vos résultats

66 résultats
Image document

Second-Order Kernel Online Convex Optimization with Adaptive Sketching

Daniele Calandriello , Alessandro Lazaric , Michal Valko
International Conference on Machine Learning, 2017, Sydney, Australia
Communication dans un congrès hal-01537799v1
Image document

Regret Minimization in MDPs with Options without Prior Knowledge

Ronan Fruit , Matteo Pirotta , Alessandro Lazaric , Emma Brunskill
NIPS 2017 - Neural Information Processing Systems, Dec 2017, Long Beach, United States. pp.1-36
Communication dans un congrès hal-01649082v1
Image document

Bayesian Multi-Task Reinforcement Learning

Alessandro Lazaric , Mohammad Ghavamzadeh
ICML - 27th International Conference on Machine Learning, Jun 2010, Haifa, Israel. pp.599-606
Communication dans un congrès inria-00475214v1
Image document

Risk-Aversion in Multi-armed Bandits

Amir Sani , Alessandro Lazaric , Rémi Munos
NIPS - Twenty-Sixth Annual Conference on Neural Information Processing Systems, Dec 2012, Lake Tahoe, United States
Communication dans un congrès hal-00772609v1
Image document

Risk-Aversion in Multi-armed Bandits

Amir Sani , Alessandro Lazaric , Rémi Munos
[Research Report] 2012
Rapport hal-00750298v1
Image document

Pack only the essentials: Adaptive dictionary learning for kernel ridge regression

Daniele Calandriello , Alessandro Lazaric , Michal Valko
Adaptive and Scalable Nonparametric Methods in Machine Learning at Neural Information Processing Systems, 2016, Barcelona, Spain
Communication dans un congrès hal-01482756v1
Image document

Active Learning for Accurate Estimation of Linear Models

Carlos Riquelme , Mohammad Ghavamzadeh , Alessandro Lazaric
ICML 2017 - 34th International Conference on Machine Learning, Aug 2017, Sydney, Australia. pp.36
Communication dans un congrès hal-01538762v1
Image document

Thompson Sampling for Linear-Quadratic Control Problems

Marc Abeille , Alessandro Lazaric
AISTATS 2017 - 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States
Communication dans un congrès hal-01493564v1
Image document

Finite-Sample Analysis of Least-Squares Policy Iteration

Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos
[Technical Report] 2010
Rapport inria-00528596v1
Image document

Analysis of a Classification-based Policy Iteration Algorithm

Alessandro Lazaric , Mohammad Ghavamzadeh , Remi Munos
ICML - 27th International Conference on Machine Learning, Jun 2010, Haifa, Israel. pp.607-614
Communication dans un congrès inria-00482065v3
Image document

Transfer in Reinforcement Learning: a Framework and a Survey

Alessandro Lazaric
Marco Wiering, Martijn van Otterlo. Reinforcement Learning - State of the art, 12, Springer, pp.143-173, 2012, ⟨10.1007/978-3-642-27645-3_5⟩
Chapitre d'ouvrage hal-00772626v1
Image document

Regret Bounds for Reinforcement Learning with Policy Advice

Mohammad Gheshlaghi Azar , Alessandro Lazaric , Emma Brunskill
ECML/PKDD - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2013, Prague, Czech Republic
Communication dans un congrès hal-00924021v1
Image document

Un sélecteur de Dantzig pour l'apprentissage par différences temporelles

Matthieu Geist , Bruno Scherrer , Alessandro Lazaric , Mohammad Ghavamzadeh
Journées Francophones sur la planification, la décision et l'apprentissage pour le contrôle des systèmes - JFPDA 2012, May 2012, Villers-lès-Nancy, France. 13 p
Communication dans un congrès hal-00736229v1
Image document

Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning

Ronan Fruit , Matteo Pirotta , Alessandro Lazaric , Ronald Ortner
ICML 2018 - The 35th International Conference on Machine Learning, Jul 2018, Stockholm, Sweden. pp.1578-1586
Communication dans un congrès hal-01941206v1
Image document

Least-squares methods for policy iteration

Lucian Busoniu , Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos , Robert Babuska , et al.
Reinforcement Learning: State of the Art, Springer, pp.75-109, 2011
Chapitre d'ouvrage hal-00830122v1
Image document

Distributed adaptive sampling for kernel matrix approximation

Daniele Calandriello , Alessandro Lazaric , Michal Valko
International Conference on Artificial Intelligence and Statistics, 2017, Fort Lauderdale, United States
Communication dans un congrès hal-01482760v1
Image document

Classification-based Policy Iteration with a Critic

Victor Gabillon , Alessandro Lazaric , Mohammad Ghavamzadeh , Bruno Scherrer
2011
Rapport hal-00590972v1
Image document

Sparse Multi-task Reinforcement Learning

Daniele Calandriello , Alessandro Lazaric , Marcello Restelli
NIPS - Advances in Neural Information Processing Systems 26, Dec 2014, Montreal, Canada
Communication dans un congrès hal-01073513v1
Image document

Improved Learning Complexity in Combinatorial Pure Exploration Bandits

Victor Gabillon , Alessandro Lazaric , Mohammad Ghavamzadeh , Ronald Ortner , Peter Bartlett
Proceedings of the 19th International Conference on Artificial Intelligence (AISTATS), May 2016, Cadiz, Spain
Communication dans un congrès hal-01322198v1
Image document

Exploration–Exploitation in MDPs with Options

Ronan Fruit , Alessandro Lazaric
AISTATS 2017 - 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States
Communication dans un congrès hal-01493567v2
Image document

Rotting bandits are not harder than stochastic ones

Julien Seznec , Andrea Locatelli , Alexandra Carpentier , Alessandro Lazaric , Michal Valko
International Conference on Artificial Intelligence and Statistics, 2019, Naha, Japan
Communication dans un congrès hal-01936894v2
Image document

Truthful Learning Mechanisms for Multi–Slot Sponsored Search Auctions with Externalities

Nicola Gatti , Alessandro Lazaric , Marco Rocco , Francesco Trovò
Artificial Intelligence, 2015, 227, pp.93-139. ⟨10.1016/j.artint.2015.05.012⟩
Article dans une revue hal-01237670v1
Image document

Learning with stochastic inputs and adversarial outputs

Alessandro Lazaric , Rémi Munos
Journal of Computer and System Sciences, 2012, 78 (5), pp.1516-1537. ⟨10.1016/j.jcss.2011.12.027⟩
Article dans une revue hal-00772046v1
Image document

A Truthful Learning Mechanism for Contextual Multi--Slot Sponsored Search Auctions with Externalities

Nicola Gatti , Alessandro Lazaric , Francesco Trov'{o}
EC - 13th ACM Conference on Electronic Commerce, Jun 2012, Valencia, Spain
Communication dans un congrès hal-00772624v1
Image document

Maximum Entropy Semi-Supervised Inverse Reinforcement Learning

Julien Audiffren , Michal Valko , Alessandro Lazaric , Mohammad Ghavamzadeh
International Joint Conference on Artificial Intelligence, Jul 2015, Bueons Aires, Argentina
Communication dans un congrès hal-01146187v1
Image document

Word-order biases in deep-agent emergent communication

Rahma Chaabouni , Eugene Kharitonov , Alessandro Lazaric , Emmanuel Dupoux , Marco Baroni
ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Jul 2019, Florence, Italy
Communication dans un congrès hal-02274157v1
Image document

Linear Thompson Sampling Revisited

Marc Abeille , Alessandro Lazaric
AISTATS 2017 - 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States
Communication dans un congrès hal-01493561v1

Parallel Higher Order Alternating Least Square for Tensor Recommender System

Romain Warlop , Alessandro Lazaric , Jérémie Mary
AAAI 2017 - Thirty-First AAAI Conference on Artificial Intelligence, Feb 2017, San Francisco, United States
Communication dans un congrès hal-01628298v1
Image document

Efficient second-order online kernel learning with adaptive embedding

Daniele Calandriello , Alessandro Lazaric , Michal Valko
Neural Information Processing Systems, 2017, Long Beach, United States
Communication dans un congrès hal-01643961v1
Image document

Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits

Alexandra Carpentier , Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos , Peter Auer
ALT - the 22nd conference on Algorithmic Learning Theory, Oct 2011, Espoo, Finland
Communication dans un congrès hal-00659696v1