Skip to Main content

Keywords

Co-authors

Social networks

    Number of documents

    86

    Richard Dufour


    Associate Professor in Computer science

    Speech and Language processing


    Research topics

    • Natural language processing
    • Information extraction
    • Social network analysis
    • Speech recognition

    Projects, partners and evaluation campaigns

    Projects

    • 2018-... : ANR The Voice: Study of dubbing voices. Partners : IRCAM, Dubbing Brothers.
    • 2015-2018: ANR GaFes: Study of uses via data collected on the Internet and re-editorialization of content captured or produced by Internet users. Partners: Centre Norbert Elias, Syllabs, GECE.
    • 2013-2017: ANR ContNomina: Identification of multimedia concepts. Partners : Eurecom, Syllabs, Wikio.
    • 2013-2014: ANR SuMACC: Identification de concepts multimédias par patrons de collaboration. Partenaires : Eurecom, Syllabs, Wikio.
    • 2012-2014: ANR PERCOL: Person identification in audiovisual streams. Partners : Orange Labs, LIF, LILF.
    • 2007-2010: ANR EPAC: Automatic transcription of spontaneous speech. Partners : LIUM (Le Mans University), IRIT (University of Toulouse), LI (University of Tours), LIA (University of Avignon).

    Industrial collaborations

    • Orkis: Ph.D. thesis of Killian Janod.
    • Européenne des Données (EDD): Ph.D. thesis of Mohamed Bouaziz.

    Evaluation campaigns participation

    Scientific supervision

    Past thesis students

    • Adrien Gresse (2015-2020). Title : The art of the voice: characterizing the vocal information in an artistic choice. Co-supervised with Vincent Labatut (30%) and Jean-François Bonastre (40 %)
    • Mohamed Bouaziz (2013-2017). Title: Recurrent neural networks for sequence classification in parallel audiovisual streams. Co-supervised with Mohamed Morchid (30%) and Georges Linarès (40 %)
    • Killian Janod (2013-2017). Title: The representation of documents by neural networks for the comprehension of spoken documents. Co-supervised with Mohamed Morchid (30%) and Georges Linarès (40 %)
    • Mohamed Morchid (2011-2014). Title: Robust representations of noisy documents in homogeneous spaces. Co-supervised with Georges Linarès (50 %)

    Current thesis students

    • Noé Cécillon (2019-...). Title: Combination of content and structure by representation learning: application to the analysis of textual documents. Co-supervised with Vincent Labatut (30 %) and Georges Linarès (40 %)
    • Mathias Quillot (2018-...). Title: Representation of the acted voice. Co-supervised with Nicolas Obin (30 %) and Jean-François Bonastre (40 %)

    Master Interns

    • Noé Cécillon - Master 2 (6 mois - février à août 2019). Title: Exploring characteristics of graph embeddings for the detection of abusive messages. Co-supervised with Vincent Labatut (50 %)
    • Adrien Gresse - Master 2 (6 months - February to August 2015). Title: Recommendation of movie music. Co-supervised with Georges Linarès (50 %)
    • Mathias Quillot - Master 2 (Alternating student 2015-2017). Title: Conception and realization of the observatory of festivals as part of the ANR project GaFes. Co-supervised with Georges Linarès (50 %)
    • Mathias Quillot - Licence 2 et 3 (3 months - 2014/2015). Title: Demonstrator for the project ANR ContNomina. Co-supervised with Georges Linarès (50 %)

    Associate professor at LIA (Since 2012)

    Since September 2012, I am an associate professor at the Computer Science Laboratory of Avignon (LIA) in France. My research interests include automatic speech recognition, natural language processing and information extraction. I am particularly interested in issues related to the automatic recognition of person names in speech transcriptions and their diachronic aspect, as well as the problematic of the evaluation of speech recognition system performance in an application context. I am also involved in various projects funded by the French National Research Agency (ANR) and in different evaluation campaigns. 

    Post-doctoral researcher at Orange Labs (2011-2012)

    From June 2011 to June 2012, I was a post-doctoral research at Orange Labs in Lannion (France). I mainly worked on the detection, characterization and correction of speech recognition errors. The objective of this research was to find the error regions (i.e. consecutive errors) contained in automatic transcriptions, and then to categorize them in order to better understand the nature of the error. Thus, the nature of these errors can be various: errors on person names, proper names, or due to the phenomenon of homophony. 

    In the continuity of these works around error detection, we proposed a solution to correct these person name errors in the automatic transcriptions. This work has a direct link with the project défi REPERE. Errors on these person names could have a direct impact, for example, in the context of document indexing. We chose to automatically correct these errors using the error region detection previously described. We applied a correction approach at the phonetic level. Indeed, the wrongly transcribed words could be very phonetically closed to the person name that should be initially found. We propose a solution that compares the phonetic sequence of a targeted error region with all the phonetic sequences of person names contained in a dictionary. The person name with the closest phonetic sequence is chosen as the correction. For example, the person name Sébastien Chabal ("s ei b a s t i in ch a b a l") has the closest phonetic sequence to the error region "s ei b a t i in ch a r a d e" (c'est bah tiens charade). 

    Research intern at M*Modal (2010)

    I had the opportunity to do a 4-month research internship (June to October 2010) at M*Modal in Pittsburgh (USA). The major activity of this company is to provide a perfect transcription of medical reports. 

    I mainly worked on the problematic of the automatic word phonetization contained in the dictionary of the company multilingual ASR system. I proposed a strategy to estimate confidence measures for each automatic phonetization proposed by the Grapheme-To-Phoneme tool. This confidence score has for objective to guide the manual correction of these automatic phonetizations: the human correctors focus in priority on the words having a very bad automatic phonetization (low confidence score). I also proposed a solution that allows to automatically choose the n-best automatic pronunciations of a word using audio documents.

    Assistant professor at LIUM (2010-2011)

    From October 2010 to June 2011, I was an assistant professor at the Computer Science Laboratory (LIUM) of the University of Le Mans (France). I mainly worked on the possibility to apply the automatic spontaneous speech detection system, proposed during my Ph.D. thesis, to characterize multimedia documents, and particularly for the speaker role recognition problem. The initial study seek to highlight the link between speech spontaneity and the role of a speaker in a show. Indeed, out initial intuition was that, for example, a journalist has a tendency to prepare his discourse, while an interviewee should mainly have a less structured and fluent talk (and so a more prepared speech). For this study, we used a 100-hours radio broadcast corpus which was manually annotated in speaker roles and type of shows (corpus made in the context of the EPAC project). I then applied the automatic type of speech detection system proposed during my Ph.D. thesis. 

    The second part of my part concerned the automatic recognition of speaker roles in radio broadcast shows using the features already extracted to detect spontaneous speech. We wanted to demonstrate that it was possible to obtain an alternative speaker role recognition system to those already proposed using our type of speech detection system. Results obtained showed that a speaker role recognition was possible with this approach: 74.4% of the speakers have been associated with their correct role. 

    Ph.D. in Computer science at LIUM (2007-2010)

    Title: Automatic transcription of spontaneous speech

    Defended the 1st December 2010 at University of Le Mans (France).

    Thesis committee:

    President
    Martine ADDA-DECKER (LPP/CNRS - Université de Paris 3)

    Members
    Guillaume GRAVIER (IRISA/CNRS - Université de Rennes 1)
    Denis JOUVET (LORIA/INRIA - Université de Nancy)

    Advisors
    Paul DELÉGLISE (LIUM - Université du Maine)
    Yannick ESTÈVE (LIUM - Université du Maine)

    Thesis abstract

    Automatic speech recognition (ASR) systems already reach a sufficient level of performance to be integrated in various applications (human-machine dialogue, information extraction, automatic indexing…). Nonetheless, in the context of large vocabulary continuous speech recognition (LVCSR), the transcription quality may vary depending the type of speech used in the documents. Indeed, ASR systems are performant when dealing with prepared speech, close to a text read, while the have much more difficulty when transcribing spontaneous speech, characterized by various specificities (disfluencies, ungrammaticality, decreased speech fluency…). 

    The work of this thesis concerns the treatment of spontaneous speech, which takes part in the EPAC project context. The main objective is to propose ways to improve the performance of ASR systems on this type of speech. In our work, we chose to address spontaneous speech as a special study object requiring specific treatments. 

    Thus, in a first step, we propose a tool to automatically detect spontaneous speech, based on this type of speech specificities. This proposed system is very important because it allows us, in a second step, to propose a semi-supervised adaptation of acoustic and language models of the ASR system approach to spontaneous speech. Transcriptions resulting from this adaptation offer recognition hypotheses different from those provided by the "classic" system. A significant reduction in terms of word error rate has been observed using the combination of the two systems (classic and adapted). 

    The need for specific solutions finally oriented part of our work toward correcting a particularly linguistic problem: the homophony. We then seek to correct the transcripts provided by an ASR system, using a method offering specific solutions to specific homophony problems. The proposed method, in a post-processing step, corrects some homophonic word errors, regardless of the ASR system used.


    Journal articles8 documents

    • Etienne Papegnies, Vincent Labatut, Richard Dufour, Georges Linares. Conversational Networks for Automatic Online Moderation. IEEE Transactions on Computational Social Systems, IEEE, 2019, 6 (1), pp.38-55. ⟨10.1109/tcss.2018.2887240⟩. ⟨hal-01999546v2⟩
    • Killian Janod, Mohamed Morchid, Richard Dufour, Georges Linares, Renato de Mori. Denoised Bottleneck Features From Deep Autoencoders for Telephone Conversation Analysis. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2017, 25 (9), pp.1809-1820. ⟨10.1109/TASLP.2017.2718843⟩. ⟨hal-02356138⟩
    • Mohamed Morchid, Juan-Manuel Torres-Moreno, Richard Dufour, Javier Ramírez-Rodríguez, Georges Linares. Automatic Text Summarization Approaches to Speed up Topic Model Learning Process. International Journal of Computational Linguistics and Applications, Alexander Gelbukh, 2016, 7, pp.87 - 109. ⟨hal-02356467⟩
    • Mohamed Morchid, Mohamed Bouallegue, Richard Dufour, Georges Linares, Driss Matrouf, et al.. Compact Multiview Representation of Documents Based on the Total Variability Space. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2015, ⟨10.1109/TASLP.2015.2431854⟩. ⟨hal-01319808⟩
    • Mohamed Morchid, Mohamed Bouallegue, Richard Dufour, Georges Linarès, Renato de Mori. Total Variability Space for LDA-based multi-view text categorization. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2015, ⟨10.1109/TASLP.2015.2431854⟩. ⟨hal-01322940⟩
    • Mohamed Morchid, Yonathan Portilla, Didier Josselin, Richard Dufour, Eitan Altman, et al.. An Author-Topic based Approach to Cluster Tweets and Mine their Location. Procedia Environmental Sciences, Elsevier, 2015, 27, pp.26-29. ⟨10.1016/j.proenv.2015.07.109⟩. ⟨hal-01251313⟩
    • Mohamed Morchid, Richard Dufour, Pierre-Michel Bousquet, Georges Linares, Juan-Manuel Torres-Moreno. Feature selection using Principal Component Analysis for massive retweet detection. Pattern Recognition Letters, Elsevier, 2014, ⟨10.1016/j.patrec.2014.05.020⟩. ⟨hal-01319767⟩
    • Richard Dufour, Yannick Estève, Paul Deléglise. Characterizing and Detecting Spontaneous Speech: Application To Speaker Role Recognition. Speech Communication, Elsevier : North-Holland, 2014, pp.1--18. ⟨hal-01433222⟩

    Conference papers75 documents

    • Salima Mdhaffar, Yannick Estève, Antoine Laurent, Nicolas Hernandez, Richard Dufour, et al.. A Multimodal Educational Corpus of Oral Courses: Annotation, Analysis and Case Study. LREC, May 2020, Marseille, France. ⟨hal-02496700⟩
    • Mathias Quillot, Lauriane Guillou, Adrien Gresse, Rafaël Ferro, Raphaël Röth, et al.. La voix actée : pratiques, enjeux, applications. 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole, 2020, Nancy, France. pp.525-533. ⟨hal-02798582v3⟩
    • Noé Cecillon, Vincent Labatut, Richard Dufour, Georges Linares. WAC: A Corpus of Wikipedia Conversations for Online Abuse Detection. 12th International Conference on Language Resources and Evaluation (LREC 2020), May 2020, Marseille, France. ⟨hal-02497514⟩
    • Adrien Gresse, Mathias Quillot, Richard Dufour, Jean-François Bonastre. Apprentissage automatique de représentation de voix à l’aide d’une distillation de la connaissance pour le casting vocal. 6e conférence conjointe Journées d'Études sur la Parole (JEP, 33e édition), Traitement Automatique des Langues Naturelles (TALN, 27e édition), Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues (RÉCITAL, 22e édition). Volume 1 : Journées d'Études sur la Parole, 2020, Nancy, France. pp.280-288. ⟨hal-02798550v3⟩
    • Noé Cecillon, Vincent Labatut, Richard Dufour, Georges Linarès. Abusive Language Detection in Online Conversations by Combining Content-and Graph-based Features. ICWSM International Workshop on Modeling and Mining Social-Media-Driven Complex Networks, Jun 2019, Munich, Germany. pp.8. ⟨hal-02130205⟩
    • Salima Mdhaffar, Yannick Estève, Nicolas Hernandez, Antoine Laurent, Richard Dufour, et al.. Qualitative Evaluation of ASR Adaptation in a Lecture Context: Application to the PASTEL Corpus. Interspeech 2019, Sep 2019, Graz, Austria. pp.569-573, ⟨10.21437/Interspeech.2019-2661⟩. ⟨hal-02304620⟩
    • Adrien Gresse, Mathias Quillot, Richard Dufour, Vincent Labatut, Jean-François Bonastre. Similarity Metric Based on Siamese Neural Networks for Voice Casting. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2019, Brighton, United Kingdom. pp.6585-6589, ⟨10.1109/ICASSP.2019.8683178⟩. ⟨hal-02004762⟩
    • Richard Dufour, Mickael Rouvier, Alexandre Delorme, Damien Malinas. LIA@CLEF 2018: Mining events opinion argumentation from raw unlabeled Twitter data using convolutional neural network. CLEF 2018, Sep 2018, Avignon, France. ⟨hal-02356434⟩
    • Adrien Gresse, Richard Dufour, Vincent Labatut, Mickael Rouvier, Jean-François Bonastre. Mesure de similarité fondée sur des réseaux de neurones siamois pour le doublage de voix. XXXIIèmes Journées d’Études sur la Parole (JEP), Jun 2018, Aix-en-Provence, France. ⟨10.21437/JEP.2018-2⟩. ⟨hal-01819198⟩
    • Etienne Papegnies, Vincent Labatut, Richard Dufour, Georges Linarès. Detection of abusive messages in an on-line community. 14ème Conférence en Recherche d'Information et Applications (CORIA), Mar 2017, Marseille, France. pp.153-168, ⟨10.24348/coria.2017.16⟩. ⟨hal-01505017⟩
    • Etienne Papegnies, Vincent Labatut, Richard Dufour, Georges Linares. Impact Of Content Features For Automatic Online Abuse Detection. International Conference on Computational Linguistics and Intelligent Text Processing, Apr 2017, Budapest, Hungary. pp.404-419, ⟨10.1007/978-3-319-77116-8_30⟩. ⟨hal-01505502⟩
    • Etienne Papegnies, Richard Dufour, Vincent Labatut, Georges Linarès. Détection de messages abusifs au moyen de réseaux conversationnels. 8ème Conférence Modèles et Analyse des Réseaux : Approches Mathématiques et Informatiques (MARAMI), Oct 2017, La Rochelle, France. ⟨hal-01614279⟩
    • Adrien Gresse, Mickael Rouvier, Richard Dufour, Vincent Labatut, Jean-Francois Bonastre. Acoustic Pairing of Original and Dubbed Voices in the Context of Video Game Localization. Interspeech, Aug 2017, Stockholm, Sweden. pp.2839-2843, ⟨10.21437/Interspeech.2017-1311⟩. ⟨hal-01572151⟩
    • Etienne Papegnies, Vincent Labatut, Richard Dufour, Georges Linares. Graph-based Features for Automatic Online Abuse Detection. 5th International Conference on Statistical Language and Speech Processing, Oct 2017, Le Mans, France. pp.70-81, ⟨10.1007/978-3-319-68456-7_6⟩. ⟨hal-01571639⟩
    • Mathias Quillot, Cassandre Ollivier, Richard Dufour, Vincent Labatut. Exploring Temporal Analysis of Tweet Content from Cultural Events. 5th International Conference on Statistical Language and Speech Processing, Oct 2017, Le Mans, France. pp.82-93, ⟨10.1007/978-3-319-68456-7_7⟩. ⟨hal-01580578⟩
    • Titouan Parcollet, Mohamed Morchid, Pierre-Michel Bousquet, Richard Dufour, Georges Linarès, et al.. Quaternion Neural Networks for Spoken Language Understanding. 2016 IEEE Spoken Language Technology Workshop (SLT), Dec 2016, San Diego, United States. pp.362-368. ⟨hal-02107532⟩
    • Richard Dufour, Mohamed Morchid, Titouan Parcollet. Tracking dialog states using an Author-Topic based representation. 2016 IEEE Spoken Language Technology Workshop (SLT), Dec 2016, San Diego, United States. pp.544-551. ⟨hal-02107641⟩
    • Janod Killian, Mohamed Morchid, Richard Dufour, Georges Linares. A log-linear weighting approach in the Word2vec space for spoken language understanding. 2016 IEEE Spoken Language Technology Workshop (SLT), Dec 2016, San Diego, United States. pp.356-361, ⟨10.1109/SLT.2016.7846289⟩. ⟨hal-02356406⟩
    • Mohamed Bouaziz, Mohamed Morchid, Richard Dufour, Georges Linares, Renato de Mori. Parallel Long Short-Term Memory for multi-stream classification. 2016 IEEE Spoken Language Technology Workshop (SLT), Dec 2016, San Diego, United States. pp.218-223, ⟨10.1109/SLT.2016.7846268⟩. ⟨hal-02356410⟩
    • Mohamed Bouaziz, Mohamed Morchid, Richard Dufour, Georges Linares. Improving multi-stream classification by mapping sequence-embedding in a high dimensional space. 2016 IEEE Spoken Language Technology Workshop (SLT), Dec 2016, San Diego, United States. pp.224-231, ⟨10.1109/SLT.2016.7846269⟩. ⟨hal-02356420⟩
    • Mohamed Morchid, Mohamed Bouaziz, Waad Kheder, Killian Janod, Pierre-Michel Bousquet, et al.. Spoken Language Understanding in a Latent Topic-based Subspace. Interspeech 2016, Sep 2016, San Francisco, United States. ⟨10.21437/Interspeech.2016-50⟩. ⟨hal-02356390⟩
    • Killian Janod, Mohamed Morchid, Richard Dufour, Georges Linares, Renato De Mori. Deep Stacked Autoencoders for Spoken Language Understanding. Interspeech 2016, Sep 2016, San Francisco, United States. pp.720-724, ⟨10.21437/Interspeech.2016-63⟩. ⟨hal-02356395⟩
    • Frédéric Béchet, Meriem Bendris, Delphine Charlet, Géraldine Damnati, Benoit Favre, et al.. Identification de personnes dans des flux multimédia. CORIA 2015 - Conférence en Recherche d'Infomations et Applications - 12th French Information Retrieval Conference, Mar 2015, PARIS, France. pp.239-251. ⟨hal-01194234⟩
    • Mohamed Morchid, Richard Dufour, Georges Linares, Youssef Hamadi. Latent Topic Model Based Representations for a Robust Theme Identification of Highly Imperfect Automatic Transcriptions. 16th International Conference, CICLing 2015, Apr 2015, Le Caire, Egypt. ⟨10.1007/978-3-319-18117-2_44⟩. ⟨hal-01293908⟩
    • Mohamed Morchid, Richard Dufour, Driss Matrouf. A Comparison of Normalization Techniques Applied to Latent Space Representations for Speech Analytics. Interspeech 2015, Sep 2015, Dresden, Germany. ⟨hal-02356373⟩
    • Mohamed Morchid, Richard Dufour, Georges Linarès. Topic-space based setup of a neural network for theme identification of highly imperfect transcriptions. 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) , Dec 2015, Scottsdale, United States. ⟨10.1109/ASRU.2015.7404815⟩. ⟨hal-01339956⟩
    • Yannick Estève, Mohamed Bouallegue, Carole Lailler, Mohamed Morchid, Richard Dufour, et al.. Integration of word and semantic features for theme identification in telephone conversations. 6th International Workshop on Spoken Dialog Systems (IWSDS 2015), 2015, Busan, South Korea. ⟨hal-01433213⟩
    • Frédéric Béchet, Meriem Bendris, Delphine Charlet, Géraldine Damnati, Benoit Favre, et al.. Multimodal understanding for person recognition in video broadcasts. Interspeech, Singapore, 2014, Unknown, Unknown Region. ⟨hal-01194244⟩
    • Mohamed Bouallegue, Mohamed Morchid, Richard Dufour, Driss Matrouf, Georges Linarès, et al.. Factor Analysis based Semantic Variability Compensation for Automatic Conversation Representation. Interspeech, May 2014, Singapore, Singapore. ⟨hal-01313121⟩
    • Mohamed Bouallegue, Mohamed Morchid, Richard Dufour, Driss Matrouf, Georges Linarès, et al.. Subspace Gaussian Mixture Models for Dialogues Classification. Interspeech, May 2014, Singapore, Singapore. ⟨hal-01313132⟩
    • Mohamed Morchid, Richard Dufour, Georges Linares, Renato de Mori. Classification de transcriptions automatiques imparfaites : Doit-on adapter le calcul du taux d'erreur-mot ?. JEP, Jun 2014, Le Mans, France. ⟨hal-01319782⟩
    • Mohamed Morchid, Mohamed Bouallegue, Richard Dufour, Georges Linarès, Driss Matrouf, et al.. An I-vector Based Approach to Compact Multi-Granularity Topic Spaces Representation of Textual Documents. the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP),, Oct 2014, Doha, Qatar. ⟨10.3115/v1/D14-1051⟩. ⟨hal-01318651⟩
    • Mohamed Morchid, Mohamed Bouallegue, Richard Dufour, Georges Linarès, Driss Matrouf, et al.. I-vector based Representation of Highly Imperfect Automatic Transcriptions. INTERSPEECH, May 2014, Singapore, Singapore. ⟨hal-01318657⟩
    • Mohamed Morchid, Richard Dufour, Mohamed Bouallegue, Georges Linarès, Renato de Mori. Theme Identification in Human-Human Conversations with Features from Specific Speaker Type Hidden Spaces. INTERSPEECH, May 2014, Singapore, Singapore. ⟨hal-01318666⟩
    • Mohamed Morchid, Richard Dufour, Pierre-Michel Bousquet, Mohamed Bouallegue, Georges Linarès, et al.. Improving dialogue classification using a topic space representation and a Gaussian classifier based on the decision rule. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , May 2014, Florence, Italy. ⟨10.1109/ICASSP.2014.6853571⟩. ⟨hal-01318674⟩
    • Mohamed Morchid, Stéphane Huet, Richard Dufour. A Topic-Based Approach for Post-processing Correction of Automatic Translations. 11th International Workshop on Spoken Language Translation (IWSLT), 2014, South Lake Tahoe, NV, United States. ⟨hal-02021817⟩
    • Benjamin Bigot, Gregory Senay, Georges Linares, Corinne Fredouille, Richard Dufour. Modèles contextuels continus pour la reconnaissance des noms de personnes dans des transcriptions automatiques. JEP 2014, Jun 2014, Le Mans, France. ⟨hal-02356488⟩
    • Mohamed Morchid, Richard Dufour, Georges Linarès. A COMBINED THEMATIC AND ACOUSTIC APPROACH FOR A MUSIC RECOMMENDATION SERVICE IN TV COMMERCIALS. ISMIR, Oct 2014, Tapei, Taiwan. ⟨hal-01319769⟩
    • Mohamed Morchid, Richard Dufour, Georges Linarès. A LDA-Based Topic Classification Approach from Highly Imperfect Automatic Transcriptions. LREC, May 2014, Reykjavik, Iceland. ⟨hal-01319771⟩
    • Mohamed Morchid, Richard Dufour, Usman Niaz, Francis Bouvier, Clément de Groc, et al.. Sumacc Project's Corpus: A Topic-based Query Extention Approach to Retrieve Multimedia Documents. 17th International Conference, TSD, Sep 2014, Brno, Czech Republic. ⟨hal-01319787⟩
    • Mohamed Morchid, Richard Dufour, Mohamed Bouallegue, Georges Linarès. Author-topic based representation of call-center conversations. IEEE Spoken Language Technology Workshop (SLT) , Dec 2014, South Lake Tahoe United States. ⟨10.1109/SLT.2014.7078577⟩. ⟨hal-01318662⟩
    • Babak Loni, Jonathon Hare, Mihai Georgescu, Michael Riegler, Xiaofei Zhu, et al.. Getting by with a Little Help from the Crowd: Practical Approaches to Social Image Labeling. CrowdMM '14 International ACM Workshop on Crowdsourcing for Multimedia , Nov 2014, Orlando, United States. ⟨10.1145/2660114.2660123⟩. ⟨hal-01318338⟩
    • Mohamed Morchid, Georges Linarès, Richard Dufour. Characterizing and Predicting Bursty Events: The Buzz Case Study on Twitter. LREC, May 2014, Reykjavik, Iceland. ⟨hal-01319806⟩
    • Xavier Bost, Ilaria Brunetti, Luis Adrian Cabrera Diego, Jean-Valère Cossu, Andréa Linhares, et al.. Systèmes du LIA à DEFT'13. DEFT2013, Jun 2013, Les Sables d'Olonne, France. ⟨hal-01313065⟩
    • Favre Benoit, Géraldine Damnati, Frédéric Béchet, Meriem Bendris, Delphine Charlet, et al.. PERCOLI: a person identification system for the 2013 REPERE challenge. SLAM 2013 : First Workshop on Speech, Language and Audio in Multimedia, Aug 2013, Marseille, France. pp.55-60. ⟨hal-00834204⟩
    • Benjamin Bigot, Gregory Senay, Georges Linarès, Corinne Fredouille, Richard Dufour. Combining Acoustic Name Spotting and Continuous Context Models to improve Spoken Person Name Recognition in Speech. Interspeech 2013, Aug 2013, Lyon, France. ⟨hal-02102829⟩
    • Grégory Senay, Benjamin Bigot, Richard Dufour, Georges Linarès, Corinne Fredouille. Person name spotting by combining acoustic matching and LDA topic models. Interspeech 2013, Aug 2013, Lyon, France. ⟨hal-01340026⟩
    • Mohamed Morchid, Richard Dufour, Georges Linarès. Combinaison de thèmes latents pour la contextualisation de Tweets. EGC'2013 13e Conférence Francophone sur l'Extraction et la Gestion des connaissances, Jan 2013, Toulouse, France. ⟨hal-01339909⟩
    • Benjamin Bigot, Gregory Senay, Georges Linarès, Corinne Fredouille, Richard Dufour. Person name recognition in ASR outputs using continuous context models. 2013 IEEE International Conference on Acoustics, Speech and Signal, May 2013, Vancouver, Canada. ⟨hal-01339113⟩
    • Benjamin Bigot, Gregory Senay, Georges Linarès, Corinne Fredouille, Richard Dufour. Person name recognition in ASR outputs using continuous context models. 2013 IEEE International Conference on Acoustics, Speech and Signal, May 2013, Vancouver, Canada. ⟨hal-01339105⟩
    • Mohamed Morchid, Richard Dufour, Georges Linarès. Thematic Representation of Short Text Messages with Latent Topics: Application in the Twitter context. PACLING 2013, Sep 2013, Tokyo, Japan. ⟨hal-01319779⟩
    • Mohamed Morchid, Richard Dufour, Georges Linarès. Event detection from image hosting services by slightly-supervised multi-span context models. 2013 11TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), Jun 2013, VESZPREM,, Hungary. ⟨10.1109/CBMI.2013.6576563⟩. ⟨hal-01318645⟩
    • Mohamed Morchid, Richard Dufour, Mohamed Bouallegue, Georges Linares, Driss Matrouf. LIA @ MediaEval 2013 Crowdsourcing Task: Metadata or not Metadata? That is a Fashion Question. MediaEval, Oct 2013, Barcelone, Spain. ⟨hal-01318659⟩
    • Mohamed Morchid, Richard Dufour, Mohamed Bouallegue, Georges Linares, Driss Matrouf. LIA @ MediaEval 2013 MusiClef Task: A Combined Thematic and Acoustic Approach. MediaEval, Oct 2013, Barcelone, Spain. ⟨hal-01318668⟩
    • Mohamed Bouallegue, Grégory Senay, Mohamed Morchid, Driss Matrouf, Georges Linares, et al.. LIA @ MediaEval 2013 Spoken Web Search Task: An I-Vector based Approach. MediaEval, Oct 2013, Barcelone, Spain. ⟨hal-01313144⟩
    • Benjamin Bigot, Grégory Senay, Georges Linarès, Corinne Fredouille, Richard Dufour. Person name recognition in ASR outputs using continuous context models. IEEE International Conference on Acoustics, Speech and Signal Processing , May 2013, Vancouver, Canada. ⟨10.1109/ICASSP.2013.6639318⟩. ⟨hal-01314411⟩
    • Jean Valère Cossu, Benjamin Bigot, Ludovic Bonnefoy, Mohamed Morchid, Xavier Bost, et al.. LIA@RepLab 2013. Replab: An evaluation campaign for Online Reputation Management Systems, Fourth International Conference of the CLEF initiative, Sep 2013, Valencia, Spain. ⟨hal-01967853⟩
    • Richard Dufour, Antoine Laurent, Yannick Estève. Combinaison d’approches pour la reconnaissance du rôle des locuteurs. JEP 2012, 2012, Grenoble, France. ⟨hal-01450727⟩
    • Richard Dufour, Geraldine Damnati, Delphine Charlet. Automatic error region detection and characterization in LVCSR transcriptions of TV news shows. ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2012, Kyoto, Japan. pp.4445-4448, ⟨10.1109/ICASSP.2012.6288906⟩. ⟨hal-02356284⟩
    • Richard Dufour, Géraldine Damnati, Delphine Charlet. Détection et caractérisation des régions d'erreurs dans des transcriptions de contenus multimédia : application à la recherche des noms de personnes. JEP 2012, Jun 2012, Grenoble, France. pp.811 - 818. ⟨hal-02356478⟩
    • Richard Dufour, Geraldine Damnati, Delphine Charlet, Frédéric Béchet. Automatic transcription error recovery for Person Name Recognition. Interspeech 2012, Sep 2012, Portland, United States. ⟨hal-02356295⟩
    • Mickael Rouvier, Richard Dufour, Georges Linarès, Yannick Estève. LIA @ MediaEval 2011 : Compact Representation of Heterogeneous Descriptors for Video Genre Classification. MediaEval, Sep 2011, Pise, Italy. ⟨hal-01320178⟩
    • Richard Dufour, Yannick Estève, Paul Deléglise. Investigation of Spontaneous Speech Characterization Applied to Speaker Role Recognition. Interspeech 2011, 2011, Firenze, Italy. ⟨hal-01433512⟩
    • Richard Dufour, Benoit Favre. Semi-supervised Part-of-speech Tagging in Speech Applications. Interspeech, Tokyo (Japan), 2010, Unknown, Unknown Region. ⟨hal-01194268⟩
    • Richard Dufour, Yannick Estève, Paul Deléglise. Automatic indexing of speech segments with spontaneity levels on large audio database. ACM Workshop on Searching Spontaneous Conversational Speech, 2010, Firenze, Italy. ⟨hal-01452962⟩
    • Mickael Rouvier, Richard Dufour, Georges Linarès, Yannick Estève. A Language-identification inspired method for spontaneous speech detection. INTERSPEECH, Sep 2010, Makuhari, Japan. ⟨hal-01320176⟩
    • Dufour Richard, Favre Benoit. Semi-supervised Part-of-speech Tagging in Speech Applications. Interspeech 2010, 2010, Makuhari, Japan. ⟨hal-01433898⟩
    • Yannick Estève, Paul Deléglise, Sylvain Meignier, Simon Petitrenaud, Holger Schwenk, et al.. Some recent research work at LIUM based on the use of CMU Sphinx. CMU SPUD Workshop, 2010, Dallas (Texas), Unknown Region. ⟨hal-01434933⟩
    • Richard Dufour, Yannick Estève, Paul Deléglise, Frédéric Béchet. Utilisation conjointe de modèles locaux et globaux pour la caractérisation et la détection de segments de parole spontanée. JEP 2010, 2010, Mons, Belgium. ⟨hal-01433896⟩
    • Richard Dufour, Fethi Bougares, Yannick Estève, Paul Deléglise. Unsupervised model adaptation on targeted speech segments for LVCSR system combination. Interspeech 2010, 2010, Makuhari, Japan. ⟨hal-01433900⟩
    • Richard Dufour, Yannick Estève, Paul Deléglise, Frédéric Béchet. Local and global models for spontaneous speech segment detection and characterization. IEEE ASRU 2009, 2009, Merano, Italy. ⟨hal-01433914⟩
    • Richard Dufour, Vincent Jousse, Yannick Estève, Frédéric Béchet, Georges Linarès. Spontaneous Speech Characterization and Detection in Large Audio Database. 13-th International Conference on Speech and Computer (SPECOM 2009), 2009, St Petersburg, Russia. ⟨hal-01433943⟩
    • Richard Dufour, Yannick Estève, Paul Deléglise. Corrections spécifiques du français sur les systèmes de reconnaissance automatique de la parole. RJCP 2009, 2009, Avignon, France. ⟨hal-01433948⟩
    • Richard Dufour. From prepared speech to spontaneous speech recognition system. CSTST, Oct 2008, Cergy-Pontoise, France. pp.595, ⟨10.1145/1456223.1456345⟩. ⟨hal-02356750⟩
    • Richard Dufour, Yannick Estève. Correcting ASR outputs: specific solutions to specific errors in French. IEEE Workshop on Spoken Language Technology (SLT 2008), 2008, Goa, India. ⟨hal-01434038⟩

    Poster communications1 document

    • Mohamed Morchid, Yonathan Portilla, Didier Josselin, Richard Dufour, Eitan Altman, et al.. An Author-Topic based Approach to Cluster Tweets and Mine their Location. Spatial Statistics 2015, Jun 2015, Avignon, France. ⟨hal-01250549⟩

    Preprints, Working Papers, ...1 document

    • Adrien Gresse, Mathias Quillot, Richard Dufour, Jean-Francçois Bonastre. Learning Voice Representation Using Knowledge Distillation For Automatic Voice Casting. 2020. ⟨hal-02572383⟩

    Theses1 document

    • Richard Dufour. Transcription automatique de la parole spontanée. Informatique [cs]. Université du Maine, 2010. Français. ⟨tel-00595465⟩