- 63
- 22
- 4
- 4
- 3
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
DC
David Causeur
111
Documents
Identifiants chercheurs
- david-causeur
- 0000-0001-6910-9440
- Google Scholar : https://scholar.google.com/citations?user=kCws61IAAAAJ&hl=fr
Présentation
Research topics
===============
My research interests are in statistical methodology for various issues motivated by biological applications. More particularly, the focus of my recent papers is on the handling of dependence in high-dimensional statistical inference.
**Statistical genomics**
Dependence within high-dimensional gene expression profiles generates a strong instability of gene selection in large scale significance analysis. A proper handling of this dependence by latent factor models or more general whitening techniques improves stability of multiple testing procedures and power (see for example Friguet *et al*, 2009 \[JASA\], Friguet and Causeur, 2010 \[CSDA\], Causeur *et al*, 2011 \[JSS\], Hornung *et al*, 2016 \[BMC Bioinf.\], Hornung *et al*, 2017 \[Bioinf.\], Hébert *et al*, 2021 \[CSDA\]).
**Functional data analysis**
Functional data are discretized observations of curves. Such data are generated by various technologies such as spectroscopy or electroencephalography (EEG). More and more study designs, such as Event-Related Potentials studies in neuroscience, aim at assessing the relationship between functional data and experimental covariates. Both for signal detection (global testing) and signal identification (search for significant intervals), dependence handling strategies can be designed to be efficient for a given pattern of association signal (see for example Causeur *et al*, 2012 \[BRM\], Sheu *et al*, 2016 \[AoAS\], Causeur *et al*, 2020 \[Biometrics\]).
**High-dimensional regression and classification modeling**
Both in genomic and functional data analysis, estimation of regression and classification models has to deal with a possibly strong dependence within high-dimensional profiles of explanatory variables. Whitening procedures can help stabilizing model selection methods and improve prediction performance (see for example Perthame *et al*, 2015 \[StatCo\], Hébert *et al*, 2021 \[Under revision\]).
**Biostatistics**
I am involved in various research projects with a diversity of partners in biology, recently for multi-omic data integration issues (see Gondret *et al*., 2017 \[BMC Gen.\], Désert *et al*., 2018 \[BMC Gen.\]), peptidomics (see Suwareh *et al*., 2021 \[Food Ch.\]), electromyographic data analysis (see Comfort *et al*., 2021 \[PACA\]), etc.
Publications
- 6
- 5
- 5
- 5
- 4
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 25
- 22
- 15
- 13
- 12
- 9
- 7
- 6
- 6
- 5
- 5
- 5
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 5
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 3
- 6
- 3
- 2
- 2
- 3
- 8
- 2
- 8
- 4
- 8
- 10
- 7
- 15
- 7
- 5
- 5
- 5
- 1
- 1
- 2
- 2
- 1
Statistical modelling of in vitro pepsinolysis using peptidomic dataJournées Scientifiques de l'ED EGAAL 2021, L’école doctorale « Écologie, Géosciences, Agronomie, ALimentation, Jun 2021, Rennes, France
Communication dans un congrès
hal-03318607v1
|
|
Statistical modeling of in vitro pepsin specificityVirtual International Conference on Food Digestion, Cost Infogest, May 2021, Virtual International Conference on Food Digestion (#VICFD2021) on 6-7th May 2021., France
Communication dans un congrès
hal-03220832v1
|
|
Prediction of fatty acids in the rainbow trout Oncorhynchus mykiss: a Raman scattering spectroscopy approach22. International Conference on Transparent Optical Networks (ICTON), Jul 2020, Bari, Italy. pp.1-4, ⟨10.1109/ICTON51198.2020.9203514⟩
Communication dans un congrès
hal-02948337v1
|
|
|
Décorrélation adaptative pour la prédiction en grande dimension51es Journées de Statistique 2019, Société Française de Statistique, Jun 2019, Nancy, France
Communication dans un congrès
hal-02361735v1
|
|
Identification of the physicochemical characteristics of peptides that influence their hydrolysis by pepsinWorkshop Infogest statistical treatment of peptidomic data, Institut National de Recherche Agronomique (INRA). UMR UMR INRA / AgroCampus Rennes : Science et Technologie du Lait et de l'?uf (1253)., Dec 2019, Lyon, France
Communication dans un congrès
hal-02737436v1
|
|
PROPENSITY WEIGHTING FOR SURVEY NONRESPONSE THROUGH MACHINE LEARNING13es Journées de méthodologie statistique de l'Insee (JMS), Jun 2018, Paris, France
Communication dans un congrès
hal-02076739v1
|
Identification des fonctions biologiques, spécifiques d’un tissu ou partagées entre tissus, associées aux différences d’efficacité alimentaire chez le porc en croissance49. Journées de la Recherche Porcine, Jan 2017, Paris, France. pp.13-18
Communication dans un congrès
hal-01602699v1
|
|
|
Variable selection for correlated data in high dimension using decorrelation methodsStatlearn: Challenging problems in statistical learning, Apr 2016, Vannes, France
Communication dans un congrès
hal-01310571v1
|
Data integrationMeeting INRA-ISU, Mar 2015, Ames, United States. 11 diapositives
Communication dans un congrès
hal-01210940v1
|
|
ERP: an R package for Event-Related Potentials data analysisuseR!2014, UCLA Statistics Department; Foundation for Open Access Statistics; Los Angels R user group, Jun 2014, Los Angeles, United States
Communication dans un congrès
hal-01167321v1
|
|
Dealing with long-time range dependence in large-scale multiple testing of Event-Related Potentials data46e Journées de Statistique, Société Française de Statistique, Jun 2014, Rennes, France
Communication dans un congrès
hal-01167092v1
|
|
|
Sparse factor model for gene co-expression networksBioNetVisA Workshop, Sep 2014, Strasbourg, France
Communication dans un congrès
hal-01210988v1
|
FADA: an R package for variable selection in supervised classification of strongly dependent datauseR!2014, UCLA Statistics Department; Foundation for Open Access Statistics; Los Angels R user group, Jun 2014, Los Angeles, United States
Communication dans un congrès
hal-01167304v1
|
|
|
Stabilité de la sélection de variables pour la classification de données en grande dimension45 èmes Journées de Statistique, May 2013, Toulouse, France
Communication dans un congrès
hal-00913047v1
|
Generalized linear factor modeling for dependence between SNPs in GWASStatistical fort Genomic Data (SMPGD'13), Jan 2013, Amsterdam, Netherlands
Communication dans un congrès
hal-01451794v1
|
|
|
Inferring gene networks using a sparse factor model approach, Statistical Learning and Data ScienceLearning and Data Science, May 2012, Florence (IT), Italy
Communication dans un congrès
hal-00841017v1
|
Sparse factor models for gene co-expression networks.Séminaire de Biostatistiques, Dec 2012, Toulouse (FR), France. non paginé
Communication dans un congrès
hal-00841019v1
|
|
Genetic analysis of a complex trait using transcriptomic data: contribution of gene regulatory network modelingXX. Conference of the plant & animal genome (PAG), Jan 2012, San Diego, United States. 1 p
Communication dans un congrès
hal-00841103v1
|
|
Inférence de réseaux de co-expression génique10ème Journée Biogenouest, Nov 2012, Rennes (FR), France. non paginé
Communication dans un congrès
hal-00841018v1
|
|
|
Inferring gene networks using a sparse factor model approachStatistical learning and data science, May 2012, Florence, Italy. 12 p
Communication dans un congrès
hal-02806777v1
|
Selection stability for supervised classification of heterogeneous data.Selection stability for supervised classification of heterogeneous data., Oct 2011, Rennes (FR), France. non paginé
Communication dans un congrès
hal-00842502v1
|
|
|
Integrating QTL controlling fatness, lipid metabolites and gene expressions to genetically dissect the adiposity complex trait in a meat chicken cross62. Annual Meeting of the European Association for Animal Production (EAAP), Aug 2011, Stavanger, Norway
Communication dans un congrès
hal-02748764v1
|
Effect of the gene diacylglycerol-O-transferase 1 (DGAT1) polymorphism on the global expression pattern of genes in the mammary gland tissue of dairy cows62. Annual Meeting of the European Federation of Animal Science (EAAP), Aug 2011, Stavanger, Norway
Communication dans un congrès
hal-02747109v1
|
|
Integrating biological knowledge in gene expression data analysisISI 2011, Aug 2011, Dublin (IE), Ireland. non paginé
Communication dans un congrès
hal-00842500v1
|
|
|
Large-scale significance testing of high thoroughput Data with FAMT14. Conference of the Applied Stochastic Models and Data Analysis International Society, Jun 2011, Rome, Italy
Communication dans un congrès
hal-02746752v1
|
|
Transcriptome profiling reveals interaction between two QTL for fatness in chicken15th European workshop on QTL mapping and marker assisted selection (QTLMAS), May 2011, Rennes (FR), France. 1 p
Communication dans un congrès
hal-00841016v1
|
Feature selection stability in high-dimensional heterogeneous data.Cladag 2011, Sep 2011, Pavia (IT), Italy. non paginé
Communication dans un congrès
hal-00842499v1
|
|
A factor model to analyze heterogeneity in gene expression in a context of QTL mapping8th workshop " Statistical Methods for Post-Genomic Data ", Jan 2010, Marseille, France
Communication dans un congrès
hal-00459362v1
|
|
|
Inférence sur réseaux géniques par Analyse en Facteurs42èmes Journées de Statistique, 2010, Marseille, France, France
Communication dans un congrès
inria-00494802v1
|
Estimation conditionnelle de la proportion d'hypotheses nulles en grande dimension41èmes Journées de Statistiques, Société Française de Statistiques, May 2009, Bordeaux, France
Communication dans un congrès
hal-00461645v1
|
|
Factor Analysis for Multiple Testing (FAMT) : an R package for simultaneous tests under dependence in high-dimensional dataUseR!2009. Agrocampus, Jul 2009, Rennes, France
Communication dans un congrès
hal-00461700v1
|
|
|
Estimation conditionnelle de la proportion d'hypothèses nulles en grande dimension41èmes Journées de Statistique, SFdS, Bordeaux, 2009, Bordeaux, France, France
Communication dans un congrès
inria-00386591v1
|
Estimation of the proportion of null p-values among dependent testsPetersburg Workshop on Simulation, Jun 2009, St Petersburg, Russia
Communication dans un congrès
hal-00461648v1
|
|
Factor Analysis for Multiple Testing: a general approach for differential analysis of genome-scale dependent dataWorkshop on Statistical advances in Genome-scale Data Analysis, May 2009, Ascona, Switzerland
Communication dans un congrès
hal-00461632v1
|
|
Conditional Fdr estimation based on a factor analytic approach of multiple testingWorkshop on Simulation, Jun 2009, St Petersburg, Russia
Communication dans un congrès
hal-00461697v1
|
|
Factor Analysis for Multiple Testing: an R-package to analyze a genome-scale datasetWorkshop « Statistical advances in Genome-scale Data Analysis », May 2009, Ascona, Switzerland
Communication dans un congrès
hal-00459370v1
|
|
Multiple tests for high-throughput data assuming a factor modeling of dependence40th Journées de Statistiques (Société Française de Statistiques) / joint with Société Statistique, May 2008, Ottawa, Canada
Communication dans un congrès
hal-00461705v1
|
|
Approche conditionnelle des tests multiples pour données biologiques à haut-débit7ème Journée Jeunes Chercheurs (Société Française de Biométrie). INSERM, Dec 2008, Villejuif, France
Communication dans un congrès
hal-00461708v1
|
|
Accounting for a factor structure in high-dimensional data to improve multiple testing procedures6th workshop on Statistical Methods for Post-Genomic Data. Agrocampus, Feb 2008, Rennes, France
Communication dans un congrès
hal-00461702v1
|
|
Improving supervised classification for high dimensional data by adding external information. Application to microarray dataAnnual conference of the Société française de Statistique (SFdS) and the Statistical Society of Canada (SSC), May 2008, Ottawa, Canada
Communication dans un congrès
hal-00461717v1
|
|
Impact of Dependence on the Stability of Model Selection in Supervised Classification for High-Throughput DataInternational Indian Statistical Association (IISA) Conference on Frontiers of Probability and Statistical Science, May 2008, Storrs, United States
Communication dans un congrès
hal-00461704v1
|
|
Reducing the Non-Discovery Rate by use of auxiliary variables in microarray experiments5th Workshop: Statistical methods for post-genomic data, Jan 2007, Paris, France
Communication dans un congrès
hal-00461724v1
|
|
Improving type II error rates of multiple tests by use of auxiliary variables. Application to microarray dataXIIth International conference on Applied Stochastic Models andData Analysis (ASMDA2007), May 2007, Chania, Greece
Communication dans un congrès
hal-00461712v1
|
|
Reducing the non-discovery rate by use of auxiliary variables in microarray experiments5. Workshop Statistical Methods for Post-Genomic Data (SMPGD), Jan 2007, Paris, France. 2 p
Communication dans un congrès
hal-02818987v1
|
|
Variations in mammary extraction of nutrients under the effect of a 36-h milk accumulation into the udder in dairy cows2. International Symposium on Energy and protein metabolism and nutrition ISEP, Sep 2007, Vichy, France
Communication dans un congrès
hal-02757430v1
|
|
Facteurs de variation de la détection des chaleurs chez la vache laitière conduite en vêlages groupésXIIIè Rencontres recherche ruminants, Dec 2006, Paris, France
Communication dans un congrès
hal-00461729v1
|
|
Maximising piglet survival57. Annual Meeting of the European Association for Animal Production, Sep 2006, Antalya, Turkey
Communication dans un congrès
hal-02754310v1
|
|
Maximizing piglet survival57th Annual Meeting of the European Association for Animal Production, Sep 2006, Antalya, Turkey
Communication dans un congrès
hal-00461746v1
|
|
Facteurs de variation de la détection des chaleurs chez la vache laitière conduite en vêlages groupés13. Rencontres Recherches Ruminants, Dec 2006, Paris, France
Communication dans un congrès
hal-02755093v1
|
|
Prédiction de la composition chimique des truies reproductrices à partir du poids vif et de l'épaisseur de lard dorsal : Application à la définition des besoins énergétiques29. Journées de la recherche porcine en France, Feb 1997, Paris, France
Communication dans un congrès
hal-02841306v1
|
|
Prediction sur matrices de distancesJournees de Statistique, May 1993, Vannes, France
Communication dans un congrès
hal-02779050v1
|
|
Integrative responses of pig adipose tissues to high-fat high-fiber diet: towards key regulators of energy flexibilityASAS/ADSA midwest meeting, Mar 2015, Des Moines, United States. Journal of Animal Science, 93 (Suppl. 2), 2015, Abstract book of the ASAS/ADSA midwest meeting
Poster de conférence
hal-01210925v1
|
How food protein structure can modulate the protein digestion behaviour: Evaluation by nutritional peptidomicsFood Structure and Functionality Forum Symposium from Molecules to Functionality, Mar 2014, Amsterdam, Netherlands. 2014
Poster de conférence
hal-01189918v1
|
|
|
The R package FANet: sparse factor analysis model for high dimensional gene co-expression networksThe International R Users Conference, Jun 2014, Los Angeles, United States. 2014, UserR contributed abstracts
Poster de conférence
hal-01211038v1
|
Factor modelling of multiple time series to detect temporal variations in common dynamics4. Channel Network Conference - IBS (International Biometry Society), Jul 2013, Saint Andrews, United Kingdom. 2013
Poster de conférence
hal-02805490v1
|
|
|
Quantitative Mass Spectrometry for nutritional peptidomics Evaluating the impact of food processing by multivariate statistical approachesEUPA 2013, Oct 2013, Saint-Malo, France. 2013
Poster de conférence
hal-01209499v1
|
Genetic analysis of a complex trait using transcriptomic data: contribution of gene regulatory network modelingXX. Conference of the plant & animal genome (PAG), Jan 2012, San Diego, United States. 2012
Poster de conférence
hal-02806842v1
|
|
|
Transcriptome profiling reveals interaction between two QTL for fatness in chickenXXXII International Conference on Animal Genetics (ISAG), Jul 2010, Edinburgh, United Kingdom. 2010, Proceedings of the 32nd International Conference on Animal Genetics
Poster de conférence
hal-02754276v1
|
|
A factor model to analyse heterogeneity in gene expression in a context of QTL characterizationXXXII International Conference on Animal Genetics (ISAG), Jul 2010, Edinburgh, United Kingdom. 2010, Proceedings of the 32nd International Conference on Animal Genetics
Poster de conférence
hal-02754275v1
|
Improving Type-II error rates of multiple testing procedures by use of auxiliary variables. Application to microrray dataRecent advances in Stochastic Modelling and data Analysis, World Scientific, pp.645-xx, 2007
Chapitre d'ouvrage
hal-00459319v1
|
|
Signal identification in ERP data by decorrelated Higher Criticism Thresholding2016
Pré-publication, Document de travail
hal-01310739v1
|
|
Large-scale significance testing of the full Moon effect on deliveries2009
Pré-publication, Document de travail
hal-00482743v1
|