Filtrer vos résultats
- 81
- 12
- 60
- 12
- 12
- 3
- 3
- 2
- 1
- 8
- 1
- 84
- 28
- 4
- 4
- 2
- 2
- 1
- 9
- 7
- 4
- 5
- 3
- 3
- 3
- 11
- 6
- 5
- 1
- 8
- 5
- 7
- 5
- 6
- 3
- 1
- 89
- 4
- 91
- 50
- 43
- 43
- 8
- 7
- 5
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 93
- 15
- 9
- 8
- 8
- 5
- 5
- 5
- 5
- 5
- 5
- 5
- 4
- 4
- 4
- 4
- 4
- 4
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
93 résultats
|
Improving offline evaluation of contextual bandit algorithms via bootstrapping techniquesInternational Conference on Machine Learning, Jun 2014, Beijing, China
Communication dans un congrès
hal-00990840v1
|
||
Compromis exploration-exploitation pour système de recommandation à grande échelleConférence francophone sur l'Apprentissage Automatique (CAp'16), Jul 2016, Marseille, France
Communication dans un congrès
hal-01406439v1
|
|||
|
Large-scale Bandit Recommender SystemProc. of the Second International Workshop on Machine Learning, Optimization and Big Data (MOD), Sep 2016, Volterra, Italy. pp.11, ⟨10.1007/978-3-319-51469-7_17⟩
Communication dans un congrès
hal-01406389v1
|
||
|
gym-DSSAT: a crop model turned into a Reinforcement Learning environment[Research Report] RR-9460, Inria Lille. 2022, pp.31
Rapport
hal-03711132v4
|
||
|
User Engagement as Evaluation: a Ranking or a Regression Problem?Autre publication scientifique hal-01077986v1 |
||
|
Sparse Temporal Difference Learning using LASSOIEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, Apr 2007, Hawaï, USA, United States
Communication dans un congrès
inria-00117075v1
|
||
|
The challenge of controlling microgrids in the presence of rare events with Deep Reinforcement LearningIET Smart Grid, In press, ⟨10.1049/stg2.12003⟩
Article dans une revue
hal-02971554v1
|
||
|
Consistent Algorithms for Clustering Time SeriesJournal of Machine Learning Research, 2016, 17 (3), pp.1 - 32
Article dans une revue
hal-01399613v1
|
||
Scalable explore-exploit Collaborative FilteringPacific Asia Conference on Information Systems (PACIS'16), 2016, Chiayi, Taiwan
Communication dans un congrès
hal-01406418v1
|
|||
|
Collaborative Filtering as a Multi-Armed BanditNIPS'15 Workshop: Machine Learning for eCommerce, Dec 2015, Montréal, Canada
Communication dans un congrès
hal-01256254v1
|
||
|
Advertising Campaigns Management: Should We Be Greedy?[Research Report] RR-7388, INRIA. 2010, pp.27
Rapport
inria-00519694v1
|
||
|
Equi-Gradient Temporal Difference LearningKernel Methods and Reinforcement Learning, workshop of ICML 2006, Jun 2006, Pittsburgh, USA, United States
Communication dans un congrès
inria-00117178v1
|
||
|
MERL: Multi-Head Reinforcement LearningDeep Reinforcement Learning Workshop, NeurIPS, Dec 2019, Vancouver, Canada
Communication dans un congrès
hal-02305105v3
|
||
Classification Localement Parcimonieuse par Méthodes SéquentiellesCAP 2012 - Conférence Francophone sur l'Apprentissage Automatique, May 2012, Nancy, France
Communication dans un congrès
hal-01357567v1
|
|||
|
Planning-based Approach for Optimizing the Display of Online Advertising CampaignsNIPS workshop on Machine Learning in Online ADvertising, Dec 2010, Whistler, Canada
Communication dans un congrès
hal-00772512v1
|
||
A comparison of two machine learning approaches for Photometric Solids CompressionPlemenos, Dimitri; Miaoulis, Georgios. Intelligent Computer Graphics, 321, Springer, pp.145-164, 2010, Studies in Computational Intelligence
Chapitre d'ouvrage
hal-00826051v1
|
|||
|
Correctness Attraction: A Study of Stability of Software Behavior Under Runtime PerturbationEmpirical Software Engineering, 2018, 23 (4), pp.2086-2119. ⟨10.1007/s10664-017-9571-8⟩
Article dans une revue
hal-01378523v3
|
||
Feature Discovery in Approximate Dynamic ProgrammingApproximate Dynamic Programming and Reinforcement Learning, Mar 2009, Nashville, United States
Communication dans un congrès
hal-00351144v1
|
|||
Simultaneous Optimistic Optimization on the Noiseless BBOB TestbedThe 17th IEEE Congress on Evolutionary Computation (CEC), May 2015, Sendai, Japan
Communication dans un congrès
hal-01246420v1
|
|||
|
A Generative Model of Software Dependency Graphs to Better Understand Software Evolution[Technical Report] hal-01078716, Inria. 2014
Rapport
hal-01078716v1
|
||
|
Learning crop management by reinforcement: gym-DSSATAIAFS 2023 - 2nd AAAI Workshop on AI for Agriculture and Food Systems, Feb 2023, Washignton DC, United States
Communication dans un congrès
hal-03976393v1
|
||
|
Better state exploration using action sequence equivalenceNeurIPS 2022 - Deep Reinforcement Learning Workshop, Dec 2022, Virtual, United States
Communication dans un congrès
hal-03920349v1
|
||
L’apprentissage automatique : le diable n’est pas dans l’algorithme2015
Autre publication scientifique
hal-01246178v1
|
|||
|
A Multi-Armed Bandit Model Selection for Cold-Start User Recommendation 25th ACM Conference on User Modelling, Adaptation and Personalization (UMAP), Jul 2017, Bratislava, Slovakia
Communication dans un congrès
hal-01517967v1
|
||
|
The Iso-regularization Descent Algorithm for the LASSO17th International Conference on Neural Information Processing, Nov 2010, Sidney, Australia
Communication dans un congrès
inria-00508257v2
|
||
|
General Framework for Nonlinear Functional Regression with Reproducing Kernel Hilbert Spaces[Research Report] RR-6908, INRIA. 2009
Rapport
inria-00378381v1
|
||
|
Multiple functional regression with both discrete and continuous covariates2nd International Workshop on Functional and Operatorial Statistics (IWFOS), Jun 2011, Santander, Spain. pp.189-195
Communication dans un congrès
hal-00772425v1
|
||
|
Functional Regularized Least Squares Classi cation with Operator-valued Kernels28th International Conference on Machine Learning (ICML), Jun 2011, Seattle, United States. pp.993--1000
Communication dans un congrès
hal-00772406v1
|
||
Recent Advances in Reinforcement LearningSpringer, Lectures Notes in Artificial Intelligence (LNAI), vol. 5323, pp.281, 2009
Ouvrages
hal-00351128v1
|
|||
|
Learning HJB Viscosity Solutions with PINNs for Continuous-Time Reinforcement LearningRR-9541, Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189; Univ. Lille, CNRS, Centrale Lille, Inria UMR 9189 - CRIStAL,INRIA Lille Nord Europe, Villeneuve d’Ascq, France; Univ. Grenoble Alps, CNRS, Inria, Grenoble INP, LIG, 38000 Grenoble, France. 2024, pp.1-30
Rapport
hal-04445160v1
|