Gérard Bailly
201
Documents
Présentation
Publications
- 80
- 32
- 17
- 16
- 15
- 15
- 13
- 11
- 10
- 9
- 9
- 8
- 8
- 7
- 7
- 7
- 7
- 7
- 7
- 7
- 6
- 6
- 6
- 6
- 6
- 5
- 5
- 5
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 4
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 9
- 3
- 3
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 8
- 7
- 12
- 3
- 4
- 11
- 9
- 10
- 11
- 8
- 5
- 10
- 11
- 18
- 7
- 17
- 11
- 22
- 12
- 1
- 1
- 1
- 1
- 21
- 14
- 12
- 11
- 5
- 4
- 3
- 2
- 1
|
Probing the Inductive Biases of a Gaze Model for Multi-party InteractionHRS 2024 - 2024 ACM/IEEE International Conference on Human-Robot Interaction (HRI '24), Mar 2024, Boulder, CO, United States. pp.507-511, ⟨10.1145/3610978.3640702⟩
Communication dans un congrès
hal-04510252v1
|
|
Advocating for text input in multi-speaker text-to-speech systemsSSW 2023 - 12th ISCA Speech Synthesis Workshop (SSW2023), Gérard Bailly; Olivier Perrotin; Thomas Hueber; Damien Lolive; Nicolas Obin, Aug 2023, Grenoble, France. pp.1-7, ⟨10.21437/SSW.2023-1⟩
Communication dans un congrès
hal-04257685v1
|
|
On the Benefit of Independent Control of Head and Eye Movements of a Social Robot for Multiparty Human-Robot InteractionHCII 2023 - 25th International Conference on. Human-Computer Interaction HCII 2023, Jul 2023, Copenhague, Denmark. pp.450-466, ⟨10.1007/978-3-031-35596-7_29⟩
Communication dans un congrès
hal-04185780v1
|
|
Local Style Tokens: Fine-Grained Prosodic Representations For TTS Expressive ControlSSW 2023 - 12th ISCA Speech Synthesis Workshop (SSW2023), Gérard Bailly; Olivier Perrotin; Thomas Hueber; Damien Lolive; Nicolas Obin, Aug 2023, Grenoble, France. pp.120-126, ⟨10.21437/SSW.2023-19⟩
Communication dans un congrès
hal-04257713v1
|
|
The GIPSA-Lab Text-To-Speech System for the Blizzard Challenge 202318th Blizzard Challenge Workshop, Aug 2023, Grenoble, France. pp.34-39, ⟨10.21437/Blizzard.2023-3⟩
Communication dans un congrès
hal-04269935v1
|
|
Data-Driven Generation of Eyes and Head Movements of a Social Robot in Multiparty ConversationICSR 2023 - 15th International Conference on Social Robotics (ICSR 2023), Dec 2023, Doha, Qatar. pp.191-203, ⟨10.1007/978-981-99-8715-3_17⟩
Communication dans un congrès
hal-04335472v1
|
|
The Blizzard Challenge 202318th Blizzard Challenge Workshop, Aug 2023, Grenoble, France. pp.1-27, ⟨10.21437/Blizzard.2023-1⟩
Communication dans un congrès
hal-04269927v1
|
|
Multiparty attention management for an embodied conversational agentJNRH 2022 - Journées Nationales de la Robotique Humanoïde, Jul 2022, Angers, France
Communication dans un congrès
hal-03780683v1
|
|
Automatic Verbal Depiction of a Brick Assembly for a Robot Instructing HumansSIGDIAL 2022 - 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL 2022), Sep 2022, Edinburgh, United Kingdom
Communication dans un congrès
hal-03754055v1
|
|
Modélisation de la Parole avec Tacotron2 : Analyse acoustique et phonétique des plongements de caractèreJEP 2022 - 34e Journées d’Études sur la Parole, Jun 2022, Noirmoutier, France
Communication dans un congrès
hal-03727735v1
|
|
Speaking Rate Control of end-to-end TTS Models by Direct Manipulation of the Encoder's Output EmbeddingsInterspeech 2022 - 23rd Annual Conference of the International Speech Communication Association, Sep 2022, Incheon, South Korea. pp.11-15, ⟨10.21437/interspeech.2022-759⟩
Communication dans un congrès
hal-03793220v2
|
|
Comparing NLP solutions for the disambiguation of French heterophonic homographs for end-to-end TTS systemsSPECOM 2022 - 24th International Conference on Speech and Computer (SPECOM), Nov 2022, Kitt Gurugram, India. pp.265-278, ⟨10.1007/978-3-031-20980-2_23⟩
Communication dans un congrès
hal-03858736v1
|
|
Impact of Segmentation and Annotation in French end-to-end SynthesisSSW 11th ISCA Speech Synthesis Workshop, Aug 2021, Budapest, Hungary. pp.13-18, ⟨10.21437/SSW.2021-3⟩
Communication dans un congrès
hal-03362000v1
|
FLUENCE : projet de conception et d’expérimentation in-situ, longitudinale et à grande échelle d’applications tablettes pour prévenir les difficultés d’apprentissage de la lectureSILE 2021, May 2021, Sherbrooke, Canada
Communication dans un congrès
hal-03248965v1
|
|
Évaluation de dispositifs numériques innovants pour l’apprentissage de la lecture et de l’anglais : une expérimentation longitudinale en condition écologiqueSFERE 2021 - 2ème édition du Colloque de SFERE-Provence, Mar 2021, Marseille, France
Communication dans un congrès
hal-03187570v1
|
|
|
L'impact des robots sur notre cognition : l'effet de présence robotiqueWACAI 2021 - Workshop sur les “Affects, Compagnons Artificiels et Interactions” (ACAI), Centre National de la Recherche Scientifique [CNRS], Oct 2021, Saint Pierre d'Oléron, France
Communication dans un congrès
hal-03377544v1
|
|
EVASION, ELARGIR et LUCIOLE : 3 jeux tablettes du projet FLUENCE pour prévenir les difficultés d’apprentissage de la lecture et de l’anglaisPRUNE II 2021 - Colloque Perspectives de Recherches sur les Usages du Numérique dans l'Éducation, Apr 2021, Paris (virtuel), France
Communication dans un congrès
hal-03187547v1
|
|
Suivi longitudinal de la fluence en lecture par évaluation automatique de la paroleEIAH 2021 - 10e Conférence sur les Environnements Informatiques pour l’Apprentissage Humain, Jun 2021, Fribourg (Virtual), Suisse. pp.70-81
Communication dans un congrès
hal-03292753v1
|
|
Expérimentation à grande échelle d'applications pour tablettes pour favoriser l'apprentissage de la lecture et de l'anglais oralEIAH 2021 - 10e Conférence sur les Environnements Informatiques pour l’Apprentissage Humain, Marie Lefevre, Christine Michel, Jun 2021, Fribourg, Suisse. pp.118-129
Communication dans un congrès
hal-03292798v1
|
|
Impact of social presence of humanoid robots: does competence matter?ICSR 2021 - International Conference on Remote Sensing, Nov 2021, Singapour, Singapore
Communication dans un congrès
hal-03411321v1
|
|
Evaluating the Extrapolation Capabilities of Neural Vocoders to Extreme Pitch ValuesInterspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.11-15, ⟨10.21437/Interspeech.2021-1547⟩
Communication dans un congrès
hal-03338483v1
|
|
Speech in action: designing challenges that require incremental processing of self and others' speech and performative gesturesNLG4HRI 2020 - 2nd Workshop on Natural Language Generation for Human-Robot Interaction, Dec 2020, Dublin (virtual), Ireland
Communication dans un congrès
hal-03084920v1
|
|
Predicting Multidimensional Subjective Ratings of Children' Readings from the Speech Signals for the Automatic Assessment of FluencyLREC 2020 - 12th Conference on Language Resources and Evaluation (LREC 2020), May 2020, Marseille, France. pp.317-322
Communication dans un congrès
hal-03039160v1
|
|
Style Transfer and Extraction for the Handwritten Letters Using Deep LearningICAART 2019 - 11th International Conference on Agents and Artificial Intelligence, Feb 2019, Prague, Czech Republic
Communication dans un congrès
hal-02049006v1
|
|
Reading Prosody Development: Automatic Assessment for a Longitudinal StudySLaTE 2019 - 8th ISCA Workshop on Speech and Language Technology in Education, Sep 2019, Graz, Austria. ⟨10.21437/SLaTE.2019-20⟩
Communication dans un congrès
hal-02181469v1
|
|
Un Karaoké pour Entraîner Prosodie et Compréhension en LectureEIAH 2019 - Environnements Informatiques pour l'Apprentissage Humain, Jun 2019, Paris, France
Communication dans un congrès
hal-02141164v1
|
|
PySFC - A System for Prosody Analysis based on the Superposition of Functional Contours Prosody ModelSpeech Prosody 2018 - 9th International Conference on Speech Prosody, Jun 2018, Poznan, Poland. pp.774-778, ⟨10.21437/SpeechProsody.2018-157⟩
Communication dans un congrès
hal-01821214v1
|
|
Immersive Teleoperation of the Eye Gaze of Social Robots Assessing Gaze-Contingent Control of Vergence, Yaw and Pitch of Robotic EyesISR 2018 - 50th International Symposium on Robotics, VDE, Jun 2018, Munich, Germany. pp.232-239
Communication dans un congrès
hal-01779633v1
|
|
The significance of scope in modelling tones in ChineseTAL 2018 - Sixth International Symposium on Tonal Aspects of Languages (TAL2018), Jun 2018, Berlin, Germany. pp.183-187, ⟨10.21437/TAL.2018-37⟩
Communication dans un congrès
hal-01834964v1
|
|
Handwriting Styles: Benchmarks and Evaluation MetricsIEEE International Workshop on Deep and Transfer Learning (DTL 2018), Oct 2018, Valencia, Spain
Communication dans un congrès
hal-01900765v1
|
|
Comparing cascaded LSTM architectures for generating head motion from speech in task-oriented dialogsHCI 2018 - 20th International Conference on Human-Computer Interaction, Jul 2018, Las Vegas, United States. pp.164-175
Communication dans un congrès
hal-01848063v1
|
|
Demonstrating and Learning Multimodal Socio-communicative Behaviors for HRI: Building Interactive Models from Immersive Teleoperation DataFAIM/ISCA Workshop on Artificial Intelligence for Multimodal Human Robot Interaction, Jul 2018, Stockholm, Sweden. pp.39-43, ⟨10.21437/AI-MHRI.2018-10⟩
Communication dans un congrès
hal-01835008v1
|
|
Autonomous Sensorimotor Learning for Sound Source Localization by a Humanoid RobotIROS 2018 - Workshop on Crossmodal Learning for Intelligent Robotics in conjunction with IEEE/RSJ IROS, Oct 2018, Madrid, Spain
Communication dans un congrès
hal-01921882v1
|
|
A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic ContoursInterspeech 2018 - 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India. ⟨10.21437/interspeech.2018-1286⟩
Communication dans un congrès
hal-01921906v1
|
|
Evaluation of Reading Performance of Primary School Children: Objective Measurements vs. Subjective RatingsWOCCI 2017 - 6th Workshop on Child Computer Interaction, Nov 2017, Glasgow, United Kingdom. ⟨10.21437/WOCCI.2017-4⟩
Communication dans un congrès
hal-01638355v1
|
|
An Evaluation Framework to Assess and Correct the Multimodal Behavior of a Humanoid Robot in Human-Robot InteractionGESPIN 2017 - GEstures and SPeech in INteraction, Aug 2017, Posnan, Poland
Communication dans un congrès
hal-01578713v1
|
|
Improving fluency of young readers: introducing a Karaoke to learn how to breath during a Reading-while-Listening taskSLaTE 2017 - 7th ISCA Workshop on Speech and Language Technology in Education, Aug 2017, Stockholm, Sweden. pp.127-131, ⟨10.21437/SLaTE.2017-22⟩
Communication dans un congrès
hal-01575223v1
|
|
Acquiring Human-Robot Interaction skills with Transfer Learning TechniquesACM/IEEE International Conference on Human-Robot Interaction, Mar 2016, Vienne, Austria. pp.359 - 360, ⟨10.1145/3029798.3034823⟩
Communication dans un congrès
hal-01490211v1
|
|
Demonstrating to a humanoid robot how to conduct neuropsychological testsJNRH 2016 - Journées Nationales de la Robotique Humanoïde, LAAS - Toulouse, Jun 2016, Toulouse, France. pp.10-12
Communication dans un congrès
hal-01342349v1
|
|
Conducting neuropsychological tests with a humanoid robot: design and evaluationCogInfoCom 2016 - IEEE International Conference on Cognitive Infocommunications, Oct 2016, Wroclaw, Poland. pp.337-342
Communication dans un congrès
hal-01385666v1
|
|
Characterization of Audiovisual Dramatic AttitudesInterspeech 2016 - 17th Annual Conference of the International Speech Communication Association, Sep 2016, San Francisco, United States. pp.585-589, ⟨10.21437/Interspeech.2016-75⟩
Communication dans un congrès
hal-01337077v1
|
|
Quantitative analysis of backchannels uttered by an interviewer during neuropsychological testsInterspeech 2016 - 17th Annual Conference of the International Speech Communication Association, Sep 2016, San Francisco, United States. ⟨10.21437/Interspeech.2016-22⟩
Communication dans un congrès
hal-01372819v1
|
|
Adaptive Latency for Part-of-Speech Tagging in Incremental Text-to-Speech SynthesisInterspeech 2016 - 17th Annual Conference of the International Speech Communication Association, Sep 2016, San Francisco, CA, United States. pp.2846 - 2850, ⟨10.21437/Interspeech.2016-165⟩
Communication dans un congrès
hal-01374782v1
|
|
HMM Training Strategy for Incremental Speech SynthesisInterspeech 2015 - 16th Annual Conference of the International Speech Communication Association, ISCA, Sep 2015, Dresden, Germany. pp.1201-1205
Communication dans un congrès
hal-01228889v1
|
The effect of audio-visual synchronization in reading while listening to texts: An eye-tracking studyESCOP 2015 - 19th Meetings of the European Society for Cognitive Psychology, Sep 2015, Paphos, Cyprus
Communication dans un congrès
hal-01838760v1
|
|
|
Audiovisual Generation of Social Attitudes from Neutral StimuliFAAVSP 2015 - 1st Joint Conference on Facial Analysis, Animation and Auditory-Visual Speech Processing, Sep 2015, Vienne, Austria. pp.34-39
Communication dans un congrès
hal-01178056v1
|
|
Using Karaoke to enhance reading while listening: impact on word memorization and eye movementsSLaTE 2015 - ISCA Workshop on Speech and Language Technology in Education, Sep 2015, Leipzig, Germany. pp.59-64
Communication dans un congrès
hal-01192870v1
|
|
Impact of Iris Size and Eyelids Coupling on the Estimation of the Gaze Direction of a Robotic Talking Head by Human ViewersHumanoids 2015 - IEEE-RAS 15th International Conference on Humanoid Robots, Nov 2015, Séoul, South Korea. pp.148-153
Communication dans un congrès
hal-01228887v1
|
|
Learning joint multimodal behaviors for face-to-face interaction: performance & properties of statistical modelsHuman-Robot Interaction. Workshop on Behavior Coordination between Animals, Humans, and Robots, Mar 2015, Portland, United States
Communication dans un congrès
hal-01110290v1
|
|
Beaming the Gaze of a Humanoid RobotHuman-Robot Interaction. Workshop on Behavior Coordination between Animals, Humans, and Robots, Mar 2015, Portland, United States. ⟨10.1145/2701973.2701992⟩
Communication dans un congrès
hal-01110288v1
|
|
Qualitative assessment of an immersive teleoperation environment for collaborative professional activities in a "beaming" experimentEuroVR 2015 - European conference for Virtual Reality and Augmented Reality, Oct 2015, Milan, Italy. 8 p
Communication dans un congrès
hal-01228890v1
|
|
Assessing objective characterizations of phonetic convergenceInterspeech 2014 - 15th Annual Conference of the International Speech Communication Association, Sep 2014, Singapour, Singapore. pp.P-19-9
Communication dans un congrès
hal-01067610v1
|
|
An articulated talking face for the iCubHumanoids 2014 - IEEE-RAS International Conference on Humanoid Robots, Nov 2014, Madrid, Spain
Communication dans un congrès
hal-01110293v1
|
Virtual conversational agents and social robots: converging challengesWACAI 2014 - Workshop Affect, Compagnon Artificiel, Interaction, Jun 2014, Rouen, France
Communication dans un congrès
hal-01488239v1
|
|
|
Is breathing sensitive to the communication partner?Speech Prosody 2014 - 7th International Conference on Speech Prosody, May 2014, Dublin, Ireland. pp.613-618
Communication dans un congrès
hal-01004426v1
|
Modeling sensory-motor behaviors for social robotsWACAI 2014 - Workshop Affect, Compagnon Artificiel, Interaction, Jun 2014, Rouen, France
Communication dans un congrès
hal-01527421v1
|
|
|
Modeling Perception-Action Loops: Comparing Sequential Models with Frame-Based ClassifiersHAI 2014 - 2nd International Conference on Human-Agent Interaction, Oct 2014, Tsukuba, Japan. pp.309-314
Communication dans un congrès
hal-01061454v1
|
|
Beyond Basic Emotions: Expressive Virtual Actors with Social AttitudesMIG 2014 - 7th International ACM SIGGRAPH Conference on Motion in Games 2014 (MIG 2014), Nov 2014, Los Angeles, United States. pp.39-47, ⟨10.1145/2668084.2668084⟩
Communication dans un congrès
hal-01064989v1
|
|
Adaptation of respiratory patterns in collaborative readingInterspeech 2013 - 14th Annual Conference of the International Speech Communication Association, Aug 2013, Lyon, France. pp.1653-1657
Communication dans un congrès
hal-00851890v1
|
|
Audio-Visual Speaker Conversion using Prosody FeaturesAVSP 2013 - 12th International Conference on Auditory-Visual Speech Processing, Aug 2013, Annecy, France. pp.11-16
Communication dans un congrès
hal-00842928v1
|
|
Speaker adaptation of an acoustic-to-articulatory inversion model using cascaded Gaussian mixture regressionsInterspeech 2013 - 14th Annual Conference of the International Speech Communication Association, Aug 2013, Lyon, France. pp.2753-2757
Communication dans un congrès
hal-00851894v1
|
Introduction to the proceedings of the SLaTE 2013 workshop on Speech and Language Technology in EducationSLaTE 2013 - Speech and Language Technology in Education, Aug 2013, Grenoble, France. pp.8-10
Communication dans un congrès
hal-00960360v1
|
|
Cross-speaker acoustic-to-articulatory inversion using phone-based trajectory HMMInterspeech 2012 - 13th Annual Conference of the International Speech Communication Association, Sep 2012, Portland, United States. pp.Tue.SS3.08
Communication dans un congrès
hal-00974347v1
|
|
|
Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech InterfaceInterspeech 2012 - 13th Annual Conference of the International Speech Communication Association, Sep 2012, Portland, United States. pp.Tue.P3c.01
Communication dans un congrès
hal-00741682v1
|
|
Pauses and respiratory markers of the structure of book readingInterspeech 2012 - 13th Annual Conference of the International Speech Communication Association, Sep 2012, Portland, United States. pp.Thu.O9d.05
Communication dans un congrès
hal-00741667v1
|
|
Original objective and subjective characterization of phonetic convergenceISICS 2012: International Symposium on Imitation and Convergence in Speech, Sep 2012, Aix-en-Provence, France. pp.O1:2
Communication dans un congrès
hal-00741686v1
|
|
Characterizing phonetic convergence with speaker recognition techniquesLISTA 2012 - The Listening Talker Workshop (LISTA 2012), May 2012, Édimbourg, United Kingdom. pp.28-31
Communication dans un congrès
hal-00695558v1
|
|
Vizart3D : retour articulatoire visuel pour l'aide à la prononciationJEP-TALN-RECITAL 2012 - conférence conjointe 29e Journées d'Études sur la Parole, 19e Traitement Automatique des Langues Naturelles, 14e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, Jun 2012, Grenoble, France. pp.17-18
Communication dans un congrès
hal-00725513v1
|
|
Toward a multi-speaker visual articulatory feedback systemInterspeech 2011 - 12th Annual Conference of the International Speech Communication Association, Aug 2011, Florence, Italy. pp.589-592
Communication dans un congrès
hal-00618781v1
|
Improvement of HMM-based acoustic-to-articulatory speech inversionISSP 2011 - 9th International Seminar on Speech Production, Jun 2011, Montreal, Canada. pp.n/c
Communication dans un congrès
hal-00724652v1
|
|
|
Synchronous reading: learning French orthography by audiovisual trainingInterspeech 2011 - 12th Annual Conference of the International Speech Communication Association, Aug 2011, Florence, Italy. pp.1153-1156
Communication dans un congrès
hal-00618780v1
|
Toward a speaker-independent visual articulatory feedback systemISSP 2011 - 9th International Seminar on Speech Production, Jun 2011, Montreal, Canada. pp.n/c
Communication dans un congrès
hal-00724655v1
|
|
Statistical mapping between articulatory and acoustic data. Application to Silent Speech Interface and Visual Articulatory FeedbackP3S - 1st International Workshop on Performative Speech and Singing Synthesis [P3S], Mar 2011, Vancouver, Canada
Communication dans un congrès
hal-00640395v1
|
|
Articulatory-to-acoustic mapping: application to silent speech interface and visual articulatory feedbackPEVOC 2015 - 11th Pan-European Voice Conference, Aug 2011, Marseille, France
Communication dans un congrès
hal-00640396v1
|
|
Human-Machine Interaction. Mutual attention and accommodation during face-to-face interactionRESCOM 2012 - Researching Communication at UWS Brain, Behaviour and Computation, Dec 2011, Sydney, Australia
Communication dans un congrès
hal-00652587v1
|
|
Differences in articulatory strategies between silent, whispered and normal speech ? A pilot study using ElectroMagnetic ArticulographyISSP 2011 - 9th International Seminar on Speech Production, Jun 2011, Montreal, Canada. pp.n/c
Communication dans un congrès
hal-00724657v1
|
|
Visual articulatory feedback for phonetic correction in second language learningInterspeech 2010 - 11th Annual Conference of the International Speech Communication Association, Sep 2010, Makuhari, Japan. pp.n.c
Communication dans un congrès
hal-00508272v1
|
|
Can tongue be recovered from face? The answer of data-driven statistical modelsInterspeech 2010 - 11th Annual Conference of the International Speech Communication Association, Sep 2010, Makuhari, Japan. pp.2002-2005
Communication dans un congrès
hal-00508276v1
|
|
|
Speech, gaze and head motion in a face-to-face collaborative taskESSV 2010 - 21st Conference on Electronic Speech Signal Processing, Sep 2010, Berlin, Germany
Communication dans un congrès
hal-00523906v1
|
|
Facilitative Effects of Communicative Gaze and Speech in Human-Robot CooperationAFFINE 2010 - 3rd International Workshop on Affective Interaction in Natural Environments, Oct 2010, Florence, Italy. pp.71-74
Communication dans un congrès
hal-00531002v1
|
Méthodes basées sur les HMMs et les GMMs pour l'inversion acoustico-articulatoire en paroleJEP 2010 - 28e Journées d'Etudes sur la Parole, May 2010, Mons, Belgique. pp.249-252
Communication dans un congrès
hal-00508281v1
|
|
Exploiting multimodal data fusion in robust speech recognitionICME 2010 - IEEE International Conference on Multimedia and Expo, Jul 2010, Singapour, Singapore. in press
Communication dans un congrès
hal-00508288v1
|
|
Acoustic-to-articulatory inversion in speech based on statistical modelsAVSP 2010 - 9th International Conference on Auditory-Visual Speech Processing, Sep 2010, Hakone, Kanagawa, Japan. pp.S8-3
Communication dans un congrès
hal-00508279v1
|
|
|
Speech dominoes and phonetic convergenceInterspeech 2010 - 11th Annual Conference of the International Speech Communication Association, Sep 2010, Makuhari, Japan. pp.1153-1156
Communication dans un congrès
hal-00523890v1
|
|
On the Importance of Eye Gaze in a Face-to-Face Collaborative TaskAFFINE 2010 - 3rd International Workshop on Affective Interaction in Natural Environments, Oct 2010, Florence, Italy. pp.81-85
Communication dans un congrès
hal-00531001v1
|
|
HMMs and GMMs based methods in acoustic-to-articulatory speech inversionRJCP 2009 - 8ème Rencontres des Jeunes Chercheurs en Parole, Nov 2009, Avignon, France. pp.Article 182
Communication dans un congrès
hal-00443662v1
|
|
Acoustic-to-articulatory inversion using speech recognition and trajectory formation based on phoneme hidden Markov modelsInterspeech 2009 - 10th Annual Conference of the International Speech Communication Association, Sep 2009, Brighton, United Kingdom. pp.2255-2258
Communication dans un congrès
hal-00419227v1
|
|
Multimodal HMM-based NAM-to-speech conversionInterspeech 2009 - 10th Annual Conference of the International Speech Communication Association, Sep 2009, Brighton, United Kingdom. pp.656-659
Communication dans un congrès
hal-00419232v1
|
|
Improvement to a NAM captured whisper-to-speech systemInterspeech 2008 - 9th Annual Conference of the International Speech Communication Association, Sep 2008, Brisbane, Australia. pp.1465-1468
Communication dans un congrès
hal-00333288v1
|
|
Predicting F0 and voicing from NAM-captured whispered speechSpeech Prosody 2008 - 4th International Conference on Speech Prosody, May 2008, Campinas, Brazil. pp.107-110
Communication dans un congrès
hal-00333290v1
|
|
Speaking with smile or disgust: data and modelsAVSP 2008 - 7th International Conference on Auditory-Visual Speech Processing, Sep 2008, Moreton Island, Australia. pp.111-116
Communication dans un congrès
hal-00333673v1
|
|
Amélioration de la conversion de voix chuchotée enregistrée par capteur NAM vers la voix audibleJEP 2008 - 27e Journées d'Etudes sur la Parole, Jun 2008, Avignon, France. pp.110-113
Communication dans un congrès
hal-00339058v1
|
|
The trainable trajectory formation model TD-HMM parameterized for the LIPS 2008 challengeInterspeech 2008 - 9th Annual Conference of the International Speech Communication Association, Sep 2008, Brisbane, Australia. pp.561
Communication dans un congrès
hal-00339043v1
|
|
From 3-D speaker cloning to text-to-audiovisual speechAVSP 2008 - 7th International Conference on Auditory-Visual Speech Processing, Sep 2008, Moreton Island, Australia. pp.43-46
Communication dans un congrès
hal-00361888v1
|
|
Can you "read tongue movements"?Interspeech 2008 - 9th Annual Conference of the International Speech Communication Association, Sep 2008, Brisbane, Australia. pp.2635-2637
Communication dans un congrès
hal-00333688v1
|
|
LIPS2008: Visual speech synthesis challengeInterspeech 2008 - 9th Annual Conference of the International Speech Communication Association, Sep 2008, Brisbane, Australia. pp.2310-2313
Communication dans un congrès
hal-00333655v1
|
|
From 3-D speaker cloning to text-to-audiovisual speechInterspeech 2008 - 9th Annual Conference of the International Speech Communication Association, Sep 2008, Brisbane, Australia. pp.2325
Communication dans un congrès
hal-00361886v1
|
Vision of tongue in augmented speech: contribution to speech comprehension and visual tracking strategiesSpeech and Face-to-Face communication - A workshop / Summer School dedicated to the Memory of Christian Benoît, Oct 2008, Grenoble, France
Communication dans un congrès
hal-00337412v1
|
|
|
Generating Spanish intonation with a trainable prosodic modelSpeech Prosody 2008 - 4th International Conference on Speech Prosody, May 2008, Campinas, Brazil. pp.63-66
Communication dans un congrès
hal-00339046v1
|
|
Reconstruction faciale 3D à partir d'images 3DRFIA 2008 - 16ème congrès francophone AFRIF-AFIA Reconnaissance des formes et Intelligence Artificielle, Jan 2008, Amiens, France. pp.article 94
Communication dans un congrès
hal-00419243v1
|
|
Retargeting cued speech hand gestures for different talking heads and speakersAVSP 2008 - 7th International Conference on Auditory-Visual Speech Processing, Sep 2008, Moreton Island, Australia. pp.8
Communication dans un congrès
hal-00342426v1
|
|
Learning optimal audiovisual phasing for a HMM-based control model for facial animationSSW2007 - 6th ISCA Workshop on Speech Synthesis (SSW6), Aug 2007, Bonn, Germany. pp.1-4
Communication dans un congrès
hal-00169576v1
|
|
Can you "read tongue movements"? Evaluation of the contribution of tongue display to speech understandingASSISTH 2007 - 1ère Conférence internationale sur l'accessibilité et les systèmes de suppléance aux personnes en situation de handicaps, Dec 2007, Toulouse, France. pp.187-193
Communication dans un congrès
hal-00175680v1
|
|
Towards eyegaze-aware analysis and synthesis of audiovisual speechAVSP 2007 - 6th International Conference on Auditory-Visual Speech Processing, Aug 2007, Hilvarenbeek, Netherlands. pp.50-56
Communication dans un congrès
hal-00169556v1
|
|
Analyzing and modeling gaze during face-to-face interactionIVA 2007 - 7th International Conference on Intelligent Virtual Agents, Sep 2007, Paris, France. pp.100-101
Communication dans un congrès
hal-00169579v1
|
|
Intelligibility of natural and 3D-cloned German speechAVSP 2007 - 6th International Conference on Auditory-Visual Speech Processing, Aug 2007, Hilvarenbeek, Netherlands. pp.56-61
Communication dans un congrès
hal-00169563v1
|
Peut-on lire sur la langue ? Évaluation de l'apport de la vision de la langue à la compréhension de la paroleJPC 2007 - 2èmes Journées de Phonétique Clinique, Dec 2007, Grenoble, France
Communication dans un congrès
hal-00175679v1
|
|
|
Mutual gaze during face-to-face interactionAVSP 2007 - 6th International Conference on Auditory-Visual Speech Processing, Aug 2007, Hilvarenbeek, Netherlands. pp.180-185
Communication dans un congrès
hal-00169566v1
|
|
Scrutinizing natural scenes: controlling the gaze of an embodied conversational agentIVA 2007 - 7th International Conference on Intelligent Virtual Agents, Sep 2007, Paris, France. pp.50-61
Communication dans un congrès
hal-00170337v1
|
|
Evaluation de systèmes de génération de mouvements faciauxJournées d'Etudes sur la Parole, Jun 2006, Rennes, France. pp.305-308
Communication dans un congrès
hal-00366538v1
|
|
Does a Virtual Talking Face Generate Proper Multimodal Cues to Draw User's Attention to Points of Interest?International conference on Language Resources and Evaluation (LREC), May 2006, Genoa, Italy. pp.2544-2549
Communication dans un congrès
hal-00366537v1
|
|
Evaluation d'un système de synthèse 3D de Langue française Parlée ComplétéeJournées d'Etudes sur la Parole, Jun 2006, Rennes, France. pp.495-498
Communication dans un congrès
hal-00366539v1
|
|
Plateforme expérimentale de capturerestitution croisée pour l'étude de la communication face-à-faceWorkshop sur les Agents Conversationnels Animés (WACA), Oct 2006, Toulouse, France. pp.C18
Communication dans un congrès
hal-00480347v1
|
|
Scrutation de scènes naturelles par un agent conversationnel animéWorkshop sur les Agents Conversationnels Animés (WACA), Oct 2006, Toulouse, France. pp.C16
Communication dans un congrès
hal-00480354v1
|
|
Degrees of freedom of facial movements in face-to-face conversational speechInternational Workshop on Multimodal Corpora, 2006, Genoa, Italy. pp.33-36
Communication dans un congrès
hal-00195551v1
|
|
Evaluation of a virtual speech cuerWorkshop on Experimental Linguistics, Aug 2006, Athens, Greece. pp.141-144
Communication dans un congrès
hal-00366488v1
|
|
A joint intelligibility evaluation of French text-to-speech synthesis systems: the EvaSy SUS/ACR campaign5th edition of the International Conference on Language Ressources and Evaluation (LREC 2006), May 2006, Genoa, Italy. pp.2034-2037
Communication dans un congrès
hal-00103571v1
|
|
A joint prosody evaluation of French text-to-speech synthesis systems5th edition of the International Conference on Language Ressources and Evaluation (LREC 2006), May 2006, Genoa, Italy. pp.307-310
Communication dans un congrès
hal-00103557v1
|
|
Contrôle du regard et des mouvements des paupières d'une tête parlante virtuelleWorkshop sur les Agents Conversationnels Animés (WACA), Oct 2006, Toulouse, France. pp.C15
Communication dans un congrès
hal-00480356v1
|
|
TDA: A new trainable trajectory formation system for facial animationInterspeech, Sep 2006, Pittsburgh, United States. pp.2474-2477
Communication dans un congrès
hal-00366489v1
|
|
Generating German intonation with a trainable prosodic modelInterspeech, Sep 2006, Pittsburgh, United States. pp.2366-2369
Communication dans un congrès
hal-00366490v1
|
|
Rackham: An Interactive Robot-GuideIEEE International Workshop on Robots and Human Interactive Communications (ROMAN), Sep 2006, Hatfield, United Kingdom. pp.502-509
Communication dans un congrès
hal-00480381v1
|
|
Evaluating a virtual speech cuerInterspeech, Sep 2006, Pittsburgh, United States. pp.2430-2433
Communication dans un congrès
hal-00366491v1
|
|
A new trainable trajectory formation system for facial animationWorkshop on Experimental Linguistics, Aug 2006, Athens, Greece. pp.25-32
Communication dans un congrès
hal-00366487v1
|
Audiovisual Speech Enhancement Experiments for Mouth Segmentation Evaluation2006
Communication dans un congrès
hal-00114114v1
|
|
|
Embodied conversational agents : computing and rendering realistic gaze patternsPacific Rim Conference on Multimedia Processing, Oct 2006, Hangzhou, China. pp.9-18
Communication dans un congrès
hal-00143624v1
|
|
ARTUS : calcul et tatouage audiovisuel des mouvements d'un personnage animé virtuel pour l'accessibilité d'émissions télévisuelles aux téléspectateurs sourds comprenant la Langue Française Parlée ComplétéeHandicap, Jun 2006, Paris, France. pp.265-270
Communication dans un congrès
hal-00366492v1
|
|
Basic components of a face-to-face interaction with a conversational agent: multual attention and deixisSmart Objects and Ambient Intelligence, Oct 2005, Grenoble, France. pp.247-252
Communication dans un congrès
hal-00366540v1
|
Non-Linear Active Model for Mouth Inner and Outer Contours DetectionEUSIPCO, 2005, Antalya, Turkey
Communication dans un congrès
hal-00378352v1
|
|
Modèle statistique et description locale d'apparence non linéaire pour la détection des contours des lèvresGRETSI, 2005, Louvain, Belgique
Communication dans un congrès
hal-00378354v1
|
|
|
Face-to-face interaction with a conversationnal agent: eye-gaze and deixisInternational Conference on Autonomous Agents and Multiagent Systems, 2005, Utrecht, Netherlands. pp.17-22
Communication dans un congrès
hal-00419299v1
|
|
Capturing data and realistic 3D models for cued speech analysis and audiovisual synthesisAuditory-Visual Speech Processing Workshop, 2005, Vancouver, Canada. pp.125-130
Communication dans un congrès
hal-00419311v1
|
|
3D statistical facial reconstruction2005, pp.365-370
Communication dans un congrès
hal-00108498v1
|
|
Evaluating the pronunciation of proper names by four French grapheme-to-phoneme convertersInterspeech'2005 - Eurospeech. 9th European Conference on Speech Communication and Technology, Sep 2005, Lisbonne, Portugal. pp.1521-1524
Communication dans un congrès
hal-00103607v1
|
Statistical Active Model for Mouth Components SegmentationICASSP, 2005, Philadelphia, United States
Communication dans un congrès
hal-00378353v1
|
|
|
Multimodal face-to-face interaction with a talking face: mutual attention and deixisHuman-Computer Interaction, Jul 2005, France. 10 p
Communication dans un congrès
hal-00516324v1
|
|
MOTHER: A new generation of talking heads providing a flexible articulatory control for video-realistic speech animationInt. Conference of Spoken Language Processing, ICSLP'2000, Oct 2000, Pekin, China
Communication dans un congrès
inria-00389362v1
|
|
A Closer Look at Latent Representations of End-to-end TTS ModelsJournée commune AFIA-TLH / AFCP – “Extraction de connaissances interprétables pour l’étude de la communication parlée”, Dec 2023, Avignon, France.
Poster de conférence
hal-04269953v1
|
|
ELARGIR : s’entraîner à lire avec fluence et expressivitéPoster de conférence hal-04328841v1 |
|
Un Karaoké pour Entrainer la Prosodie en LectureEIAH 2019, Jun 2019, Paris, France
Poster de conférence
hal-02141234v1
|
|
SGCS: Stereo Gaze Contingent Steering for Immersive TelepresencePoster de conférence hal-01677481v1 |
|
SGCS : Stereo Gaze Contingent Steering for Immersive TelepresenceECEM 2017 - 19th European Conference on Eye Movements, Aug 2017, Wuppertal, Germany. , European Conference on Eye Movements (ECEM)
Poster de conférence
hal-01638383v1
|
|
Cognition, Affects et Interaction2017, Cognition, Affects et Interaction
Ouvrages
hal-01483705v1
|
|
Cognition, Affects et Interactionpp.88, 2016
Ouvrages
cel-01258860v2
|
Audiovisual Speech ProcessingG. Bailly, P. Perrier and E. Vatikiotis-Bateson (Eds.). Cambridge University Press, pp.506, 2012, 978-1107006829
Ouvrages
hal-00691938v1
|
|
Speech and face-to-face communicationElsevier, pp.135, 2010, ISSN:0167-6393
Ouvrages
hal-00941200v1
|
THERADIA: Digital Therapies Augmented by Artificial IntelligenceAdvances in Neuroergonomics and Cognitive Engineering. AHFE 2021., 259, Springer International Publishing, pp.478-485, 2021, Lecture Notes in Networks and Systems, 978-3-030-80284-4. ⟨10.1007/978-3-030-80285-1_55⟩
Chapitre d'ouvrage
hal-03411225v1
|
|
|
Gaze and face-to-face interactionGeert Brône & Bert Oben. Eye-tracking in Interaction. Studies on the role of eye gaze in dialogue, Benjamins, pp.139 - 168, 2018, ⟨10.1075/ais.10.07bai⟩
Chapitre d'ouvrage
hal-01939223v1
|
|
Social behavior modeling based on Incremental Discrete Hidden Markov ModelsHuman Behavior Understanding. 4th International Workshop, HBU 2013, Barcelona, Spain, October 22, 2013. Proceedings, Springer International Publishing, pp.172-183, 2013, Lecture Notes in Computer Science, n°8212, 978-3-319-02714-2. ⟨10.1007/978-3-319-02714-2_15⟩
Chapitre d'ouvrage
hal-00851903v1
|
Sensorimotor characteristics of speech productionG. Bailly, P. Perrier and E. Vatikiotis-Bateson (Eds.). Audiovisual Speech Processing, Cambridge University Press, pp.368-396, 2012, 978-1107006829
Chapitre d'ouvrage
hal-00694313v1
|
|
|
Des machines parlantes aux agents conversationnels incarnésCatherine Garbay, Daniel Kayser (Eds.). Informatique et Sciences Cognitives : influences ou confluences?, Ophrys, pp.215-234, 2011
Chapitre d'ouvrage
hal-00634982v1
|
|
Study of the phenomenon of phonetic convergence thanks to speech dominoesA. Esposito, A. Vinciarelli, K. Vicsi, C. Pelachaud and A. Nijholt. Analysis of Verbal and Nonverbal Communication and Enactment: The Processing Issue, Springer Verlag, pp.280-293, 2011, LNCS AI
Chapitre d'ouvrage
hal-00603164v1
|
Speech, Gaze and Head Motion in a Face-to-Face Collaborative TaskAnna Esposito, Antonietta M. Esposito, Raffaele Martone, Vincent C. Müller, Gaetano Scarpetta. Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces: Theoretical and Practical Issues, Springer-Verlag, pp.265-274, 2010, Lecture Notes in Computer Science (LNCS) n°6456, 978-3-642-18183-2
Chapitre d'ouvrage
hal-00531003v1
|
|
The Analysis of French Cued Speech Production-Perception. Towards a complete Text-to-Cued Speech SynthesizerC. LaSasso, K. L. Crain and J. Leybaert. Cued Speech and Cued Language Development for Deaf and Hard of Hearing Children, Plural Publishing Inc., San Diego, CA,, pp.449-466, 2010, 978-1-59756-334-5
Chapitre d'ouvrage
hal-00473028v1
|
|
|
Speech technologies for augmented communicationJ. Mullennix and S. Stern. Computer synthesized speech technologies: tools for aiding impairment, IGI Global, Hershey, PA, pp.116-128, 2010
Chapitre d'ouvrage
hal-00473026v1
|
|
Parole et expression des émotions sur le visage d'humanoïdes virtuelsP. Fuchs, G. Moreau and S. Donikian. Traité de la réalité virtuelle: Volume 5 : les humains virtuels, Presses de l'Ecole des Mines de Paris, pp.187-208, 2009, Mathématique et informatique, 9782911256042
Chapitre d'ouvrage
hal-00376537v1
|
|
La campagne EvaSy d'évaluation de la synthèse de la parole à partir du texteStéphane Chaudiron; Khalid Choukri. L' évaluation des technologies de traitement de la langue : les campagnes Technolangue, Hermès science publ.; Lavoisier, pp.183-208, 2008, (IC2. Cognition et traitement de l'information), 978-2-7462-1992-2
Chapitre d'ouvrage
hal-00361911v1
|
An audiovisual talking head for augmented speech generation: models and animations based on a real speaker's articulatory dataF.J. Perales & R.B. Fisher. Proceedings of the Vth Conference on Articulated Motion and Deformable Objects (AMDO 2008), 5098, Springer Verlag: Berlin, Heidelberg, Germany, pp.132-143, 2008, Lecture Notes in Computer Science, 5098, ⟨10.1007/978-3-540-70517-8_14⟩
Chapitre d'ouvrage
hal-00296599v1
|
|
|
Virtual talking heads and ambiant face-to-face communicationA. Esposito; E. Keller; M. Marinaro and M. Bratanic. The fundamentals of verbal and non-verbal communication and the biometrical issue, IOS Press BV, Amsterdam, pp.302-316, 2007, 978-1-58603-733-8
Chapitre d'ouvrage
hal-00361913v1
|
|
Morphing Generic Organs To Speaker-Specific AnatomiesJ. Harrington & M. Tabain. Speech Production: Models, Phonetic Processes, and Techniques, Psychology Press, New York, pp.341-362, 2006, Chapter 20, ISBN 1841694371
Chapitre d'ouvrage
hal-00108522v1
|
Procédé de repositionnement des frontières de phonèmes pour la synthèse visuelle des mouvements faciaux liés à la paroleFrance, N° de brevet: FR0757063. Département Parole et Cognition. 2008
Brevet
hal-00361889v1
|
|
Échelle MultiDimensionnelle de Fluence EMDF2021
Autre publication scientifique
hal-03208139v1
|
|
Statistical skull models from 3D X-ray images2006
Pré-publication, Document de travail
hal-00108507v1
|
Cognition, Affects et InteractionMaster. St Martin d'Hères, France. 2014
Cours
cel-01110281v1
|