Recherche - Archive ouverte HAL

66 résultats

	Pour les 66 documents Envoyer sur ORCID RSS ATOM Exporter BibTeX XML-TEI CSV RTF EndNote PDF HTML Export avancé	Page : Page précédente 1 2 3 Page suivante	triés par Pertinence Auteur A→Z Auteur Z→A Titre A→Z Titre Z→A Date de publication croissante Date de publication décroissante Date de dépôt croissante Date de dépôt décroissante

		Second-Order Kernel Online Convex Optimization with Adaptive Sketching Daniele Calandriello , Alessandro Lazaric , Michal Valko International Conference on Machine Learning, 2017, Sydney, Australia Communication dans un congrès hal-01537799v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Regret Minimization in MDPs with Options without Prior Knowledge Ronan Fruit , Matteo Pirotta , Alessandro Lazaric , Emma Brunskill NIPS 2017 - Neural Information Processing Systems, Dec 2017, Long Beach, United States. pp.1-36 Communication dans un congrès hal-01649082v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Bayesian Multi-Task Reinforcement Learning Alessandro Lazaric , Mohammad Ghavamzadeh ICML - 27th International Conference on Machine Learning, Jun 2010, Haifa, Israel. pp.599-606 Communication dans un congrès inria-00475214v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Risk-Aversion in Multi-armed Bandits Amir Sani , Alessandro Lazaric , Rémi Munos NIPS - Twenty-Sixth Annual Conference on Neural Information Processing Systems, Dec 2012, Lake Tahoe, United States Communication dans un congrès hal-00772609v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning Ronan Fruit , Matteo Pirotta , Alessandro Lazaric , Ronald Ortner ICML 2018 - The 35th International Conference on Machine Learning, Jul 2018, Stockholm, Sweden. pp.1578-1586 Communication dans un congrès hal-01941206v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Least-squares methods for policy iteration Lucian Busoniu , Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos , Robert Babuska , et al. Reinforcement Learning: State of the Art, Springer, pp.75-109, 2011 Chapitre d'ouvrage hal-00830122v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Distributed adaptive sampling for kernel matrix approximation Daniele Calandriello , Alessandro Lazaric , Michal Valko International Conference on Artificial Intelligence and Statistics, 2017, Fort Lauderdale, United States Communication dans un congrès hal-01482760v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Classification-based Policy Iteration with a Critic Victor Gabillon , Alessandro Lazaric , Mohammad Ghavamzadeh , Bruno Scherrer 2011 Rapport hal-00590972v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Sparse Multi-task Reinforcement Learning Daniele Calandriello , Alessandro Lazaric , Marcello Restelli NIPS - Advances in Neural Information Processing Systems 26, Dec 2014, Montreal, Canada Communication dans un congrès hal-01073513v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Improved Learning Complexity in Combinatorial Pure Exploration Bandits Victor Gabillon , Alessandro Lazaric , Mohammad Ghavamzadeh , Ronald Ortner , Peter Bartlett Proceedings of the 19th International Conference on Artificial Intelligence (AISTATS), May 2016, Cadiz, Spain Communication dans un congrès hal-01322198v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Exploration–Exploitation in MDPs with Options Ronan Fruit , Alessandro Lazaric AISTATS 2017 - 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States Communication dans un congrès hal-01493567v2	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Rotting bandits are not harder than stochastic ones Julien Seznec , Andrea Locatelli , Alexandra Carpentier , Alessandro Lazaric , Michal Valko International Conference on Artificial Intelligence and Statistics, 2019, Naha, Japan Communication dans un congrès hal-01936894v2	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Truthful Learning Mechanisms for Multi–Slot Sponsored Search Auctions with Externalities Nicola Gatti , Alessandro Lazaric , Marco Rocco , Francesco Trovò Artificial Intelligence, 2015, 227, pp.93-139. ⟨10.1016/j.artint.2015.05.012⟩ Article dans une revue hal-01237670v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Learning with stochastic inputs and adversarial outputs Alessandro Lazaric , Rémi Munos Journal of Computer and System Sciences, 2012, 78 (5), pp.1516-1537. ⟨10.1016/j.jcss.2011.12.027⟩ Article dans une revue hal-00772046v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		A Truthful Learning Mechanism for Contextual Multi--Slot Sponsored Search Auctions with Externalities Nicola Gatti , Alessandro Lazaric , Francesco Trov'{o} EC - 13th ACM Conference on Electronic Commerce, Jun 2012, Valencia, Spain Communication dans un congrès hal-00772624v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Maximum Entropy Semi-Supervised Inverse Reinforcement Learning Julien Audiffren , Michal Valko , Alessandro Lazaric , Mohammad Ghavamzadeh International Joint Conference on Artificial Intelligence, Jul 2015, Bueons Aires, Argentina Communication dans un congrès hal-01146187v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Word-order biases in deep-agent emergent communication Rahma Chaabouni , Eugene Kharitonov , Alessandro Lazaric , Emmanuel Dupoux , Marco Baroni ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Jul 2019, Florence, Italy Communication dans un congrès hal-02274157v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Sequential Transfer in Multi-armed Bandit with Finite Set of Models Mohammad Gheshlaghi Azar , Alessandro Lazaric , Emma Brunskill NIPS - Advances in Neural Information Processing Systems 25 - 2013, Dec 2013, Lake Tahoe, United States Communication dans un congrès hal-00924025v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Gaussian process optimization with adaptive sketching: Scalable and no regret Daniele Calandriello , Luigi Carratino , Alessandro Lazaric , Michal Valko , Lorenzo Rosasco Conference on Learning Theory, 2019, Phoenix, United States Communication dans un congrès hal-02144311v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Finite-Sample Analysis of Least-Squares Policy Iteration Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos Journal of Machine Learning Research, 2012, 13, pp.3041-3074 Article dans une revue hal-00772060v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence Victor Gabillon , Mohammad Ghavamzadeh , Alessandro Lazaric NIPS - Twenty-Sixth Annual Conference on Neural Information Processing Systems, Dec 2012, Lake Tahoe, United States Communication dans un congrès hal-00772615v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence Victor Gabillon , Mohammad Ghavamzadeh , Alessandro Lazaric [Research Report] 2012 Rapport hal-00747005v2	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Fighting Boredom in Recommender Systems with Linear Reinforcement Learning Romain Warlop , Alessandro Lazaric , Jérémie Mary Neural Information Processing Systems, Dec 2018, Montreal, Canada. ⟨10.5555/3326943.3327105⟩ Communication dans un congrès hal-01915468v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes Ronan Fruit , Matteo Pirotta , Alessandro Lazaric 32nd Conference on Neural Information Processing Systems, Dec 2018, Montréal, Canada Communication dans un congrès hal-01941220v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Adversarial Attacks on Linear Contextual Bandits Evrard Garcelon , Baptiste Roziere , Laurent Meunier , Jean Tarbouriech , Olivier Teytaud , et al. 2020 Pré-publication, Document de travail hal-02979184v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Risk-Aversion in Multi-armed Bandits Amir Sani , Alessandro Lazaric , Rémi Munos [Research Report] 2012 Rapport hal-00750298v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Pack only the essentials: Adaptive dictionary learning for kernel ridge regression Daniele Calandriello , Alessandro Lazaric , Michal Valko Adaptive and Scalable Nonparametric Methods in Machine Learning at Neural Information Processing Systems, 2016, Barcelona, Spain Communication dans un congrès hal-01482756v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Active Learning for Accurate Estimation of Linear Models Carlos Riquelme , Mohammad Ghavamzadeh , Alessandro Lazaric ICML 2017 - 34th International Conference on Machine Learning, Aug 2017, Sydney, Australia. pp.36 Communication dans un congrès hal-01538762v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Thompson Sampling for Linear-Quadratic Control Problems Marc Abeille , Alessandro Lazaric AISTATS 2017 - 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States Communication dans un congrès hal-01493564v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Finite-Sample Analysis of Least-Squares Policy Iteration Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos [Technical Report] 2010 Rapport inria-00528596v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More

Filtrer vos résultats

Second-Order Kernel Online Convex Optimization with Adaptive Sketching

Regret Minimization in MDPs with Options without Prior Knowledge

Bayesian Multi-Task Reinforcement Learning

Risk-Aversion in Multi-armed Bandits

Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning

Least-squares methods for policy iteration

Distributed adaptive sampling for kernel matrix approximation

Classification-based Policy Iteration with a Critic

Sparse Multi-task Reinforcement Learning

Improved Learning Complexity in Combinatorial Pure Exploration Bandits

Exploration–Exploitation in MDPs with Options

Rotting bandits are not harder than stochastic ones

Truthful Learning Mechanisms for Multi–Slot Sponsored Search Auctions with Externalities

Learning with stochastic inputs and adversarial outputs

A Truthful Learning Mechanism for Contextual Multi--Slot Sponsored Search Auctions with Externalities

Maximum Entropy Semi-Supervised Inverse Reinforcement Learning

Word-order biases in deep-agent emergent communication

Sequential Transfer in Multi-armed Bandit with Finite Set of Models

Gaussian process optimization with adaptive sketching: Scalable and no regret

Finite-Sample Analysis of Least-Squares Policy Iteration

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence

Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence

Fighting Boredom in Recommender Systems with Linear Reinforcement Learning

Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes

Adversarial Attacks on Linear Contextual Bandits

Risk-Aversion in Multi-armed Bandits

Pack only the essentials: Adaptive dictionary learning for kernel ridge regression

Active Learning for Accurate Estimation of Linear Models

Thompson Sampling for Linear-Quadratic Control Problems

Finite-Sample Analysis of Least-Squares Policy Iteration