Recherche - Archive ouverte HAL

66 résultats

	Pour les 66 documents Envoyer sur ORCID RSS ATOM Exporter BibTeX XML-TEI CSV RTF EndNote PDF HTML Export avancé	Page : Page précédente 1 2 3 Page suivante	triés par Pertinence Auteur A→Z Auteur Z→A Titre A→Z Titre Z→A Date de publication croissante Date de publication décroissante Date de dépôt croissante Date de dépôt décroissante

		Second-Order Kernel Online Convex Optimization with Adaptive Sketching Daniele Calandriello , Alessandro Lazaric , Michal Valko International Conference on Machine Learning, 2017, Sydney, Australia Communication dans un congrès hal-01537799v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Regret Minimization in MDPs with Options without Prior Knowledge Ronan Fruit , Matteo Pirotta , Alessandro Lazaric , Emma Brunskill NIPS 2017 - Neural Information Processing Systems, Dec 2017, Long Beach, United States. pp.1-36 Communication dans un congrès hal-01649082v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Bayesian Multi-Task Reinforcement Learning Alessandro Lazaric , Mohammad Ghavamzadeh ICML - 27th International Conference on Machine Learning, Jun 2010, Haifa, Israel. pp.599-606 Communication dans un congrès inria-00475214v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Risk-Aversion in Multi-armed Bandits Amir Sani , Alessandro Lazaric , Rémi Munos NIPS - Twenty-Sixth Annual Conference on Neural Information Processing Systems, Dec 2012, Lake Tahoe, United States Communication dans un congrès hal-00772609v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Risk-Aversion in Multi-armed Bandits Amir Sani , Alessandro Lazaric , Rémi Munos [Research Report] 2012 Rapport hal-00750298v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Pack only the essentials: Adaptive dictionary learning for kernel ridge regression Daniele Calandriello , Alessandro Lazaric , Michal Valko Adaptive and Scalable Nonparametric Methods in Machine Learning at Neural Information Processing Systems, 2016, Barcelona, Spain Communication dans un congrès hal-01482756v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Active Learning for Accurate Estimation of Linear Models Carlos Riquelme , Mohammad Ghavamzadeh , Alessandro Lazaric ICML 2017 - 34th International Conference on Machine Learning, Aug 2017, Sydney, Australia. pp.36 Communication dans un congrès hal-01538762v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Thompson Sampling for Linear-Quadratic Control Problems Marc Abeille , Alessandro Lazaric AISTATS 2017 - 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States Communication dans un congrès hal-01493564v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Finite-Sample Analysis of Least-Squares Policy Iteration Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos [Technical Report] 2010 Rapport inria-00528596v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Analysis of a Classification-based Policy Iteration Algorithm Alessandro Lazaric , Mohammad Ghavamzadeh , Remi Munos ICML - 27th International Conference on Machine Learning, Jun 2010, Haifa, Israel. pp.607-614 Communication dans un congrès inria-00482065v3	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Transfer in Reinforcement Learning: a Framework and a Survey Alessandro Lazaric Marco Wiering, Martijn van Otterlo. Reinforcement Learning - State of the art, 12, Springer, pp.143-173, 2012, ⟨10.1007/978-3-642-27645-3_5⟩ Chapitre d'ouvrage hal-00772626v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Regret Bounds for Reinforcement Learning with Policy Advice Mohammad Gheshlaghi Azar , Alessandro Lazaric , Emma Brunskill ECML/PKDD - European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, Sep 2013, Prague, Czech Republic Communication dans un congrès hal-00924021v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Un sélecteur de Dantzig pour l'apprentissage par différences temporelles Matthieu Geist , Bruno Scherrer , Alessandro Lazaric , Mohammad Ghavamzadeh Journées Francophones sur la planification, la décision et l'apprentissage pour le contrôle des systèmes - JFPDA 2012, May 2012, Villers-lès-Nancy, France. 13 p Communication dans un congrès hal-00736229v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning Ronan Fruit , Matteo Pirotta , Alessandro Lazaric , Ronald Ortner ICML 2018 - The 35th International Conference on Machine Learning, Jul 2018, Stockholm, Sweden. pp.1578-1586 Communication dans un congrès hal-01941206v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Least-squares methods for policy iteration Lucian Busoniu , Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos , Robert Babuska , et al. Reinforcement Learning: State of the Art, Springer, pp.75-109, 2011 Chapitre d'ouvrage hal-00830122v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Distributed adaptive sampling for kernel matrix approximation Daniele Calandriello , Alessandro Lazaric , Michal Valko International Conference on Artificial Intelligence and Statistics, 2017, Fort Lauderdale, United States Communication dans un congrès hal-01482760v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Classification-based Policy Iteration with a Critic Victor Gabillon , Alessandro Lazaric , Mohammad Ghavamzadeh , Bruno Scherrer 2011 Rapport hal-00590972v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Sparse Multi-task Reinforcement Learning Daniele Calandriello , Alessandro Lazaric , Marcello Restelli NIPS - Advances in Neural Information Processing Systems 26, Dec 2014, Montreal, Canada Communication dans un congrès hal-01073513v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Improved Learning Complexity in Combinatorial Pure Exploration Bandits Victor Gabillon , Alessandro Lazaric , Mohammad Ghavamzadeh , Ronald Ortner , Peter Bartlett Proceedings of the 19th International Conference on Artificial Intelligence (AISTATS), May 2016, Cadiz, Spain Communication dans un congrès hal-01322198v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Exploration–Exploitation in MDPs with Options Ronan Fruit , Alessandro Lazaric AISTATS 2017 - 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States Communication dans un congrès hal-01493567v2	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Rotting bandits are not harder than stochastic ones Julien Seznec , Andrea Locatelli , Alexandra Carpentier , Alessandro Lazaric , Michal Valko International Conference on Artificial Intelligence and Statistics, 2019, Naha, Japan Communication dans un congrès hal-01936894v2	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Truthful Learning Mechanisms for Multi–Slot Sponsored Search Auctions with Externalities Nicola Gatti , Alessandro Lazaric , Marco Rocco , Francesco Trovò Artificial Intelligence, 2015, 227, pp.93-139. ⟨10.1016/j.artint.2015.05.012⟩ Article dans une revue hal-01237670v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Learning with stochastic inputs and adversarial outputs Alessandro Lazaric , Rémi Munos Journal of Computer and System Sciences, 2012, 78 (5), pp.1516-1537. ⟨10.1016/j.jcss.2011.12.027⟩ Article dans une revue hal-00772046v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		A Truthful Learning Mechanism for Contextual Multi--Slot Sponsored Search Auctions with Externalities Nicola Gatti , Alessandro Lazaric , Francesco Trov'{o} EC - 13th ACM Conference on Electronic Commerce, Jun 2012, Valencia, Spain Communication dans un congrès hal-00772624v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Maximum Entropy Semi-Supervised Inverse Reinforcement Learning Julien Audiffren , Michal Valko , Alessandro Lazaric , Mohammad Ghavamzadeh International Joint Conference on Artificial Intelligence, Jul 2015, Bueons Aires, Argentina Communication dans un congrès hal-01146187v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Word-order biases in deep-agent emergent communication Rahma Chaabouni , Eugene Kharitonov , Alessandro Lazaric , Emmanuel Dupoux , Marco Baroni ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Jul 2019, Florence, Italy Communication dans un congrès hal-02274157v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Linear Thompson Sampling Revisited Marc Abeille , Alessandro Lazaric AISTATS 2017 - 20th International Conference on Artificial Intelligence and Statistics, Apr 2017, Fort Lauderdale, United States Communication dans un congrès hal-01493561v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Parallel Higher Order Alternating Least Square for Tensor Recommender System Romain Warlop , Alessandro Lazaric , Jérémie Mary AAAI 2017 - Thirty-First AAAI Conference on Artificial Intelligence, Feb 2017, San Francisco, United States Communication dans un congrès hal-01628298v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Efficient second-order online kernel learning with adaptive embedding Daniele Calandriello , Alessandro Lazaric , Michal Valko Neural Information Processing Systems, 2017, Long Beach, United States Communication dans un congrès hal-01643961v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits Alexandra Carpentier , Alessandro Lazaric , Mohammad Ghavamzadeh , Rémi Munos , Peter Auer ALT - the 22nd conference on Algorithmic Learning Theory, Oct 2011, Espoo, Finland Communication dans un congrès hal-00659696v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More

Filtrer vos résultats

Second-Order Kernel Online Convex Optimization with Adaptive Sketching

Regret Minimization in MDPs with Options without Prior Knowledge

Bayesian Multi-Task Reinforcement Learning

Risk-Aversion in Multi-armed Bandits

Risk-Aversion in Multi-armed Bandits

Pack only the essentials: Adaptive dictionary learning for kernel ridge regression

Active Learning for Accurate Estimation of Linear Models

Thompson Sampling for Linear-Quadratic Control Problems

Finite-Sample Analysis of Least-Squares Policy Iteration

Analysis of a Classification-based Policy Iteration Algorithm

Transfer in Reinforcement Learning: a Framework and a Survey

Regret Bounds for Reinforcement Learning with Policy Advice

Un sélecteur de Dantzig pour l'apprentissage par différences temporelles

Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning

Least-squares methods for policy iteration

Distributed adaptive sampling for kernel matrix approximation

Classification-based Policy Iteration with a Critic

Sparse Multi-task Reinforcement Learning

Improved Learning Complexity in Combinatorial Pure Exploration Bandits

Exploration–Exploitation in MDPs with Options

Rotting bandits are not harder than stochastic ones

Truthful Learning Mechanisms for Multi–Slot Sponsored Search Auctions with Externalities

Learning with stochastic inputs and adversarial outputs

A Truthful Learning Mechanism for Contextual Multi--Slot Sponsored Search Auctions with Externalities

Maximum Entropy Semi-Supervised Inverse Reinforcement Learning

Word-order biases in deep-agent emergent communication

Linear Thompson Sampling Revisited

Parallel Higher Order Alternating Least Square for Tensor Recommender System

Efficient second-order online kernel learning with adaptive embedding

Upper-Confidence-Bound Algorithms for Active Learning in Multi-Armed Bandits