Recherche - Archive ouverte HAL

93 résultats

	Pour les 93 documents Envoyer sur ORCID RSS ATOM Exporter BibTeX XML-TEI CSV RTF EndNote PDF HTML Export avancé	Page : Page précédente 1 2 3 4 Page suivante	triés par Pertinence Auteur A→Z Auteur Z→A Titre A→Z Titre Z→A Date de publication croissante Date de publication décroissante Date de dépôt croissante Date de dépôt décroissante

		Improving offline evaluation of contextual bandit algorithms via bootstrapping techniques Olivier Nicol , Jérémie Mary , Philippe Preux International Conference on Machine Learning, Jun 2014, Beijing, China Communication dans un congrès hal-00990840v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Compromis exploration-exploitation pour système de recommandation à grande échelle Frédéric Guillou , Romaric Gaudel , Philippe Preux Conférence francophone sur l'Apprentissage Automatique (CAp'16), Jul 2016, Marseille, France Communication dans un congrès hal-01406439v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Large-scale Bandit Recommender System Frédéric Guillou , Romaric Gaudel , Philippe Preux Proc. of the Second International Workshop on Machine Learning, Optimization and Big Data (MOD), Sep 2016, Volterra, Italy. pp.11, ⟨10.1007/978-3-319-51469-7_17⟩ Communication dans un congrès hal-01406389v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		gym-DSSAT: a crop model turned into a Reinforcement Learning environment Romain Gautron , Emilio J. Padrón , Philippe Preux , Julien Bigot , Odalric-Ambrym Maillard , et al. [Research Report] RR-9460, Inria Lille. 2022, pp.31 Rapport hal-03711132v4	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		User Engagement as Evaluation: a Ranking or a Regression Problem? Frédéric Guillou , Romaric Gaudel , Jérémie Mary , Philippe Preux 2014, ⟨10.1145/2668067.2668073⟩ Autre publication scientifique hal-01077986v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Sparse Temporal Difference Learning using LASSO Manuel Loth , Manuel Davy , Philippe Preux IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, Apr 2007, Hawaï, USA, United States Communication dans un congrès inria-00117075v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		The challenge of controlling microgrids in the presence of rare events with Deep Reinforcement Learning Tanguy Levent , Philippe Preux , Gonzague Henri , Réda Alami , Philippe Cordier , et al. IET Smart Grid, In press, ⟨10.1049/stg2.12003⟩ Article dans une revue hal-02971554v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Consistent Algorithms for Clustering Time Series Azadeh Khaleghi , Daniil Ryabko , Jérémie Mary , Philippe Preux Journal of Machine Learning Research, 2016, 17 (3), pp.1 - 32 Article dans une revue hal-01399613v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Scalable explore-exploit Collaborative Filtering Frédéric Guillou , Romaric Gaudel , Philippe Preux Pacific Asia Conference on Information Systems (PACIS'16), 2016, Chiayi, Taiwan Communication dans un congrès hal-01406418v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Collaborative Filtering as a Multi-Armed Bandit Frédéric Guillou , Romaric Gaudel , Philippe Preux NIPS'15 Workshop: Machine Learning for eCommerce, Dec 2015, Montréal, Canada Communication dans un congrès hal-01256254v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Advertising Campaigns Management: Should We Be Greedy? Sertan Girgin , Jérémie Mary , Philippe Preux , Olivier Nicol [Research Report] RR-7388, INRIA. 2010, pp.27 Rapport inria-00519694v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Equi-Gradient Temporal Difference Learning Manuel Loth , Manuel Davy , Rémi Coulom , Philippe Preux Kernel Methods and Reinforcement Learning, workshop of ICML 2006, Jun 2006, Pittsburgh, USA, United States Communication dans un congrès inria-00117178v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		MERL: Multi-Head Reinforcement Learning Yannis Flet-Berliac , Philippe Preux Deep Reinforcement Learning Workshop, NeurIPS, Dec 2019, Vancouver, Canada Communication dans un congrès hal-02305105v3	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Classification Localement Parcimonieuse par Méthodes Séquentielles Gabriel Dulac-Arnold , Ludovic Denoyer , Philippe Preux , Patrick Gallinari CAP 2012 - Conférence Francophone sur l'Apprentissage Automatique, May 2012, Nancy, France Communication dans un congrès hal-01357567v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Planning-based Approach for Optimizing the Display of Online Advertising Campaigns Sertan Girgin , Jérémie Mary , Philippe Preux , Olivier Nicol NIPS workshop on Machine Learning in Online ADvertising, Dec 2010, Whistler, Canada Communication dans un congrès hal-00772512v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		A comparison of two machine learning approaches for Photometric Solids Compression Delepoulle Samuel , François Rouselle , Renaud Christophe , Philippe Preux Plemenos, Dimitri; Miaoulis, Georgios. Intelligent Computer Graphics, 321, Springer, pp.145-164, 2010, Studies in Computational Intelligence Chapitre d'ouvrage hal-00826051v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Correctness Attraction: A Study of Stability of Software Behavior Under Runtime Perturbation Benjamin Danglot , Philippe Preux , Benoit Baudry , Martin Monperrus Empirical Software Engineering, 2018, 23 (4), pp.2086-2119. ⟨10.1007/s10664-017-9571-8⟩ Article dans une revue hal-01378523v3	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Feature Discovery in Approximate Dynamic Programming Philippe Preux , Sertan Girgin , Manuel Loth Approximate Dynamic Programming and Reinforcement Learning, Mar 2009, Nashville, United States Communication dans un congrès hal-00351144v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Simultaneous Optimistic Optimization on the Noiseless BBOB Testbed Bilel Derbel , Philippe Preux The 17th IEEE Congress on Evolutionary Computation (CEC), May 2015, Sendai, Japan Communication dans un congrès hal-01246420v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		A Generative Model of Software Dependency Graphs to Better Understand Software Evolution Vincenzo Musco , Martin Monperrus , Philippe Preux [Technical Report] hal-01078716, Inria. 2014 Rapport hal-01078716v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Learning crop management by reinforcement: gym-DSSAT Romain Gautron , Emilio J Padrón , Philippe Preux , Julien Bigot , Odalric-Ambrym Maillard , et al. AIAFS 2023 - 2nd AAAI Workshop on AI for Agriculture and Food Systems, Feb 2023, Washignton DC, United States Communication dans un congrès hal-03976393v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Better state exploration using action sequence equivalence Nathan Grinsztajn , Toby Johnstone , Johan Ferret , Philippe Preux NeurIPS 2022 - Deep Reinforcement Learning Workshop, Dec 2022, Virtual, United States Communication dans un congrès hal-03920349v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		L’apprentissage automatique : le diable n’est pas dans l’algorithme Philippe Preux , Marc Tommasi , Thierry Viéville , Colin de La Higuera 2015 Autre publication scientifique hal-01246178v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		A Multi-Armed Bandit Model Selection for Cold-Start User Recommendation Crícia Z Felício , Klérisson V R Paixão , Celia a Z Barcelos , Philippe Preux 25th ACM Conference on User Modelling, Adaptation and Personalization (UMAP), Jul 2017, Bratislava, Slovakia Communication dans un congrès hal-01517967v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		The Iso-regularization Descent Algorithm for the LASSO Manuel Loth , Philippe Preux 17th International Conference on Neural Information Processing, Nov 2010, Sidney, Australia Communication dans un congrès inria-00508257v2	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		General Framework for Nonlinear Functional Regression with Reproducing Kernel Hilbert Spaces Hachem Kadri , Emmanuel Duflos , Manuel Davy , Philippe Preux , Stephane Canu [Research Report] RR-6908, INRIA. 2009 Rapport inria-00378381v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Multiple functional regression with both discrete and continuous covariates Hachem Kadri , Philippe Preux , Emmanuel Duflos , Stéphane Canu 2nd International Workshop on Functional and Operatorial Statistics (IWFOS), Jun 2011, Santander, Spain. pp.189-195 Communication dans un congrès hal-00772425v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Functional Regularized Least Squares Classi cation with Operator-valued Kernels Hachem Kadri , Asma Rabaoui , Philippe Preux , Emmanuel Duflos , Alain Rakotomamonjy 28th International Conference on Machine Learning (ICML), Jun 2011, Seattle, United States. pp.993--1000 Communication dans un congrès hal-00772406v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Recent Advances in Reinforcement Learning Sertan Girgin , Manuel Loth , Rémi Munos , Philippe Preux , Daniil Ryabko Springer, Lectures Notes in Artificial Intelligence (LNAI), vol. 5323, pp.281, 2009 Ouvrages hal-00351128v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More
		Learning HJB Viscosity Solutions with PINNs for Continuous-Time Reinforcement Learning Alena Shilova , Thomas Delliaux , Philippe Preux , Bruno Raffin RR-9541, Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189; Univ. Lille, CNRS, Centrale Lille, Inria UMR 9189 - CRIStAL,INRIA Lille Nord Europe, Villeneuve d’Ascq, France; Univ. Grenoble Alps, CNRS, Inria, Grenoble INP, LIG, 38000 Grenoble, France. 2024, pp.1-30 Rapport hal-04445160v1	Envoyer sur ORCID Exporter BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite Export avancé Partager Gmail Facebook X LinkedIn More

Filtrer vos résultats

Improving offline evaluation of contextual bandit algorithms via bootstrapping techniques

Compromis exploration-exploitation pour système de recommandation à grande échelle

Large-scale Bandit Recommender System

gym-DSSAT: a crop model turned into a Reinforcement Learning environment

User Engagement as Evaluation: a Ranking or a Regression Problem?

Sparse Temporal Difference Learning using LASSO

The challenge of controlling microgrids in the presence of rare events with Deep Reinforcement Learning

Consistent Algorithms for Clustering Time Series

Scalable explore-exploit Collaborative Filtering

Collaborative Filtering as a Multi-Armed Bandit

Advertising Campaigns Management: Should We Be Greedy?

Equi-Gradient Temporal Difference Learning

MERL: Multi-Head Reinforcement Learning

Classification Localement Parcimonieuse par Méthodes Séquentielles

Planning-based Approach for Optimizing the Display of Online Advertising Campaigns

A comparison of two machine learning approaches for Photometric Solids Compression

Correctness Attraction: A Study of Stability of Software Behavior Under Runtime Perturbation

Feature Discovery in Approximate Dynamic Programming

Simultaneous Optimistic Optimization on the Noiseless BBOB Testbed

A Generative Model of Software Dependency Graphs to Better Understand Software Evolution

Learning crop management by reinforcement: gym-DSSAT

Better state exploration using action sequence equivalence

L’apprentissage automatique : le diable n’est pas dans l’algorithme

A Multi-Armed Bandit Model Selection for Cold-Start User Recommendation

The Iso-regularization Descent Algorithm for the LASSO

General Framework for Nonlinear Functional Regression with Reproducing Kernel Hilbert Spaces

Multiple functional regression with both discrete and continuous covariates

Functional Regularized Least Squares Classi cation with Operator-valued Kernels

Recent Advances in Reinforcement Learning

Learning HJB Viscosity Solutions with PINNs for Continuous-Time Reinforcement Learning