Laurent Girin

139

Documents

Publications

	A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning Samir Sadok , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda , Renaud Séguier Neural Networks, 2024, 172, pp.106120. ⟨10.1016/j.neunet.2024.106120⟩ Article dans une revue hal-04132316v1
	Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation Xiaoyu Lin , Laurent Girin , Xavier Alameda-Pineda Transactions on Machine Learning Research Journal, 2024, pp.1-19 Article dans une revue hal-03584014v1
	Learning and controlling the source-filter representation of speech with a variational autoencoder Samir Sadok , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda , Renaud Séguier Speech Communication, 2023, 148, pp.53-65. ⟨10.1016/j.specom.2023.02.005⟩ Article dans une revue hal-03650569v3
	A survey of sound source localization with deep learning methods Pierre-Amaury Grumiaux , Srđan Kitić , Laurent Girin , Alexandre Guérin Journal of the Acoustical Society of America, 2022, 152 (1), pp.107-151. ⟨10.1121/10.0011809⟩ Article dans une revue hal-03952034v1
	Unsupervised Speech Enhancement using Dynamical Variational Autoencoders Xiaoyu Bie , Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022, 30, pp.2993 - 3007. ⟨10.1109/TASLP.2022.3207349⟩ Article dans une revue hal-03295630v1
	Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers Yutong Ban , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43 (5), pp.1761-1776. ⟨10.1109/TPAMI.2019.2953020⟩ Article dans une revue hal-01950866v2
	Dynamical Variational Autoencoders: A Comprehensive Review Laurent Girin , Simon Leglaive , Xiaoyu Bie , Julien Diard , Thomas Hueber Foundations and Trends in Machine Learning, 2021, 15 (1-2), pp.1-175. ⟨10.1561/2200000089⟩ Article dans une revue hal-02926215v2
	Make That Sound More Metallic: Towards a Perceptually Relevant Control of the Timbre of Synthesizer Sounds Using a Variational Autoencoder Fanny Roche , Thomas Hueber , Maëva Garnier , Samuel Limier , Laurent Girin Transactions of the International Society for Music Information Retrieval (TISMIR), 2021, 4, pp.52 - 66. ⟨10.5334/tismir.76⟩ Article dans une revue hal-03247371v1
	Evaluating the Potential Gain of Auditory and Audiovisual Speech-Predictive Coding Using Deep Learning Thomas Hueber , Eric Tatulli , Laurent Girin , Jean-Luc Schwartz Neural Computation, 2020, 32 (3), pp.596-625. ⟨10.1162/neco_a_01264⟩ Article dans une revue hal-03016083v1
	Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders Mostafa Sadeghi , Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, 28, pp.1788-1800. ⟨10.1109/TASLP.2020.3000593⟩ Article dans une revue hal-02364900v3
	Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering Xiaofei Li , Laurent Girin , Sharon Gannot , Radu Horaud IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019, 27 (9), pp.1365-1377. ⟨10.1109/TASLP.2019.2919183⟩ Article dans une revue hal-01969041v1
	Audio-noise Power Spectral Density Estimation Using Long Short-term Memory Xiaofei Li , Simon Leglaive , Laurent Girin , Radu Horaud IEEE Signal Processing Letters, 2019, 26 (6), pp.918-922. ⟨10.1109/LSP.2019.2911879⟩ Article dans une revue hal-02100059v1
	Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function Xiaofei Li , Laurent Girin , Sharon Gannot , Radu Horaud IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019, 27 (3), pp.645-659. ⟨10.1109/TASLP.2019.2892412⟩ Article dans une revue hal-01799809v1
	Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments Xiaofei Li , Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud IEEE Journal of Selected Topics in Signal Processing, 2019, 13 (1), pp.88-103. ⟨10.1109/JSTSP.2019.2903472⟩ Article dans une revue hal-01851985v2
	Expectation-Maximization for Speech Source Separation using Convolutive Transfer Function Xiaofei Li , Laurent Girin , Radu Horaud CAAI Transactions on Intelligent Technologies, 2019, 4 (1), pp.47 - 53. ⟨10.1049/trit.2018.1061⟩ Article dans une revue hal-01982250v1
	Assessing the Performances of different Neural Network Architectures for the Detection of Screams and Shouts in Public Transportation Pierre Laffitte , Yun Wang , David Sodoyer , Laurent Girin Expert Systems with Applications, 2019, 117, pp.29-41. ⟨10.1016/j.eswa.2018.08.052⟩ Article dans une revue hal-01892436v1
	Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function Xiaofei Li , Sharon Gannot , Laurent Girin , Radu Horaud IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018, 26 (10), pp.1755-1768. ⟨10.1109/TASLP.2018.2839362⟩ Article dans une revue hal-01645749v3
	Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization Xiaofei Li , Laurent Girin , Radu Horaud , Sharon Gannot IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (10), pp.1997 - 2012. ⟨10.1109/TASLP.2017.2740001⟩ Article dans une revue hal-01413417v1
	Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract Diandra Fabre , Thomas Hueber , Laurent Girin , Xavier Alameda-Pineda , Pierre Badin Speech Communication, 2017, 93, pp.63 - 75. ⟨10.1016/j.specom.2017.08.002⟩ Article dans une revue hal-01578315v1
	Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping Laurent Girin , Thomas Hueber , Xavier Alameda-Pineda IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩ Article dans une revue hal-01485540v1
	Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces Florent Bocquelet , Thomas Hueber , Laurent Girin , Christophe Savariaux , Blaise Yvert PLoS Computational Biology, 2016, 12 (11), pp.e1005119. ⟨10.1371/journal.pcbi.1005119⟩ Article dans une revue hal-01459706v1
	Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization Xiaofei Li , Laurent Girin , Radu Horaud , Sharon Gannot IEEE/ACM Transactions on Audio, Speech and Language Processing, 2016, 24 (11), pp.2171 - 2186. ⟨10.1109/TASLP.2016.2598319⟩ Article dans une revue hal-01349691v1
	Key considerations in designing a speech brain-computer interface Florent Bocquelet , Thomas Hueber , Laurent Girin , Stephan Chabardès , Blaise Yvert Journal of Physiology - Paris, 2016, 110 (4, Part A), pp.392-401. ⟨10.1016/j.jphysparis.2017.07.002⟩ Article dans une revue hal-01978301v1
	Low Bit-Rate Speech Codec Based on a Long-Term Harmonic Plus Noise Model Faten Ben Ali , Sonia Djaziri-Larbi , Laurent Girin Journal of the Audio Engineering Society, 2016, 64 (11), pp.844-857. ⟨10.17743/jaes.2016.0028⟩ Article dans une revue hal-02520614v1
	A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Sharon Gannot , Radu Horaud IEEE/ACM Transactions on Audio, Speech and Language Processing, 2016, 24 (8), pp.1408-1423. ⟨10.1109/TASLP.2016.2554286⟩ Article dans une revue hal-01301762v1
	Speaker-Adaptive Acoustic-Articulatory Inversion using Cascaded Gaussian Mixture Regression Thomas Hueber , Laurent Girin , Xavier Alameda-Pineda , Gérard Bailly IEEE/ACM Transactions on Audio, Speech and Language Processing, 2015, 23 (12), pp.2246-2259. ⟨10.1109/TASLP.2015.2464702⟩ Article dans une revue hal-01231197v1
	Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression Antoine Deleforge , Radu Horaud , Yoav Y. Schechner , Laurent Girin IEEE Transactions on Audio, Speech and Language Processing, 2015, 23 (4), pp.718-731. ⟨10.1109/TASLP.2015.2405475⟩ Article dans une revue hal-01112834v3
	A high-rate data hiding technique for uncompressed audio signals Jonathan Pinel , Laurent Girin , Cléo Baras Journal of the Audio Engineering Society, 2014, 62 (6), pp.400-413. ⟨10.17743/jaes.2014.0024⟩ Article dans une revue hal-01143294v1
	Fast and accurate direct MDCT to DFT conversion with arbitrary window functions Shuhua Zhang , Laurent Girin IEEE Transactions on Audio, Speech and Language Processing, 2013, 21 (3), pp.567-578. ⟨10.1109/TASL.2012.2227737⟩ Article dans une revue hal-00807031v1
	A mediating role of the auditory dorsal pathway in selective adaptation to speech: a state-dependent transcranial magnetic stimulation study Krystyna Grabski , Pascale Tremblay , Vincent Gracco , Laurent Girin , Marc Sato Experimental Brain Research, 2013, 1515 (5), pp.55-65. ⟨10.1016/j.brainres.2013.03.024⟩ Article dans une revue hal-00915287v1
	Informed source separation through spectrogram coding and data embedding Antoine Liutkus , Jonathan Pinel , Roland Badeau , Laurent Girin , Gael Richard Signal Processing, 2012, 92 (8), pp.1937-1949. ⟨10.1016/j.sigpro.2011.09.016⟩ Article dans une revue hal-00643957v1
	Interactive Music with Active Audio CDs Sylvain Marchand , Boris Mansencal , Laurent Girin Lecture Notes in Computer Science, 2011, 6684, pp.31-50. ⟨10.1007/978-3-642-23126-1_3⟩ Article dans une revue hal-00625211v1
	Informed source separation of linear instantaneous under-determined audio mixtures by source index embedding Mathieu Parvaix , Laurent Girin IEEE Transactions on Audio, Speech and Language Processing, 2011, 19 (6), pp.1721-1733. ⟨10.1109/TASL.2010.2097250⟩ Article dans une revue hal-00695763v1
	A Watermarking-Based Method for Informed Source Separation of Audio Signals with a Single Sensor Mathieu Parvaix , Laurent Girin , Jean-Marc Brossier IEEE Transactions on Audio, Speech and Language Processing, 2010, 18 (6), pp.1464-1475 Article dans une revue hal-00486809v1
	Adaptive long-term coding of LSF parameters trajectories for large delay / very- to ultra-low bit-rate speech coding Laurent Girin EURASIP Journal on Audio, Speech, and Music Processing, 2010, 2010 (Article ID 597039), pp.n/c. ⟨10.1155/2010/597039⟩ Article dans une revue hal-00534492v1
	A study of lip movements during spontaneous dialog and its application to voice activity detection David Sodoyer , Bertrand Rivet , Laurent Girin , Christophe Savariaux , Jean-Luc Schwartz Journal of the Acoustical Society of America, 2009, 125 (2), pp.1184-1196. ⟨10.1121/1.3050257⟩ Article dans une revue hal-00941145v1
	Perceptual long-term variable-rate sinusoidal modeling of speech Laurent Girin , Mohammad Firouzmand , Sylvain Marchand IEEE Transactions on Audio, Speech and Language Processing, 2007, 15 (3), pp.851-861. ⟨10.1109/TASL.2006.885928⟩ Article dans une revue hal-00194164v1
	Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures Bertrand Rivet , Laurent Girin , Christian Jutten IEEE Transactions on Audio, Speech and Language Processing, 2007, 15 (1), pp.96-108. ⟨10.1109/TASL.2006.872619⟩ Article dans une revue hal-00174100v1
	Visual voice activity detection as a help for speech source separation from convolutive mixtures Bertrand Rivet , Laurent Girin , Christian Jutten Speech Communication, 2007, 49 (7-8), pp.667-677. ⟨10.1016/j.specom.2007.04.008⟩ Article dans une revue hal-00499184v1
	Log-Rayleigh Distribution: A Simple and Efficient Statistical Representation of Log-Spectral Coefficients Bertrand Rivet , Laurent Girin , Christian Jutten IEEE Transactions on Audio, Speech and Language Processing, 2007, 15 (3), pp.796-802. ⟨10.1109/TASL.2006.885922⟩ Article dans une revue hal-00174096v1
	ARTUS: synthesis and audiovisual watermarking of the movements of a virtual agent interpreting subtitling using cued speech for deaf televiewers Gérard Bailly , Virginie Attina , Cléo Baras , Patrick Bas , Séverine Baudry Modelling, measurement and control C, 2006, 67SH (2, supplement : handicap), pp.177-187 Article dans une revue hal-00157826v1
	Developing an audio-visual speech source separation algorithm David Sodoyer , Laurent Girin , Christian Jutten , Jean-Luc Schwartz Speech Communication, 2004, 44, pp.113-125 Article dans une revue hal-00186591v1

	Unsupervised speech enhancement with deep dynamical generative speech and noise models Xiaoyu Lin , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda Interspeech 2023 - 24th Annual Conference of the International Speech Communication Association, ISCA, Aug 2023, Dublin, Ireland. pp.1-5 Communication dans un congrès hal-04132312v1
	Exploring the multidimensional representation of unidimensional speech of acoustic parameters extracted by deep unsupervised models Maxime Jacquelin , Maëva Garnier , Laurent Girin , Rémy Vincent , Olivier Perrotin Journée commune AFIA-TLH / AFCP – “Extraction de connaissances interprétables pour l’étude de la communication parlée”, AFIA-TLH; AFCP, Dec 2023, Avignon (FR), France Communication dans un congrès hal-04416200v1
	Speech Modeling with a Hierarchical Transformer Dynamical VAE Xiaoyu Lin , Xiaoyu Bie , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing, Jun 2023, Rhodes, Greece. pp.1-5, ⟨10.1109/ICASSP49357.2023.10096751⟩ Communication dans un congrès hal-04132313v1
	Exploring the multidimensional representation of individual speech acoustic parameters extracted by deep unsupervised models Maxime Jacquelin , Maëva Garnier , Laurent Girin , Rémy Vincent , Olivier Perrotin SSW 2023 - 12th ISCA Speech Synthesis Workshop (SSW2023), Aug 2023, Grenoble, France. pp.240-241 Communication dans un congrès hal-04274170v1
	BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model Brooke Stephenson , Laurent Besacier , Laurent Girin , Thomas Hueber Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association, Sep 2022, Incheon, South Korea. pp.3383-3387, ⟨10.21437/Interspeech.2022-10116⟩ Communication dans un congrès hal-03791472v1
	Repeat after Me: Self-Supervised Learning of Acoustic-to-Articulatory Mapping by Vocal Imitation Marc-Antoine Georges , Julien Diard , Laurent Girin , Jean-Luc Schwartz , Thomas Hueber ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore, Singapore. pp.8252-8256, ⟨10.1109/ICASSP43922.2022.9747804⟩ Communication dans un congrès hal-03688189v1
	Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole Laurent Girin , Xiaoyu Bie , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda JEP 2022 - 34e Journées d’Études sur la Parole, Université de Nantes, Jun 2022, Noirmoutier, France. pp.655-663, ⟨10.21437/JEP.2022-69⟩ Communication dans un congrès hal-03978396v1
	Learning and controlling the source-filter representation of speech with a variational autoencoder Samir Sadok , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda , Renaud Seguier CFA 2022 - 16ème Congrès Français d'Acoustique, Société Française d'Acoustique (SFA), Apr 2022, Marseille, France Communication dans un congrès hal-03603791v1
	Improved feature extraction for CRNN-based multiple sound source localization Pierre-Amaury Grumiaux , Srdan Kitić , Laurent Girin , Alexandre Guérin EUSIPCO 2021 - 29th European Signal Processing Conference (EUSIPCO), Aug 2021, Dublin, Ireland. pp.231-235, ⟨10.23919/EUSIPCO54536.2021.9616124⟩ Communication dans un congrès hal-03537334v1
	A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling Xiaoyu Bie , Laurent Girin , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.46-50, ⟨10.21437/Interspeech.2021-256⟩ Communication dans un congrès hal-03295657v1
	Learning robust speech representation with an articulatory-regularized variational autoencoder Marc-Antoine Georges , Laurent Girin , Jean-Luc Schwartz , Thomas Hueber Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3345-3349, ⟨10.21437/Interspeech.2021-1604⟩ Communication dans un congrès hal-03373252v1
	Saladnet: Self-Attentive Multisource Localization in the Ambisonics Domain Pierre-Amaury Grumiaux , Srdan Kitić , Prerak Srivastava , Laurent Girin , Alexandre Guérin WASPAA 2021 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2021, New Paltz / Virtual, United States. pp.336-340, ⟨10.1109/WASPAA52581.2021.9632737⟩ Communication dans un congrès hal-03537340v1
	High-resolution speaker counting in reverberant rooms using CRNN with Ambisonics features Pierre-Amaury Grumiaux , Srdan Kitic , Laurent Girin , Alexandre Guerin EUSIPCO 2020 - 28th European Signal Processing Conference (EUSIPCO), Jan 2021, Amsterdam, Netherlands. pp.71-75, ⟨10.23919/Eusipco47968.2020.9287637⟩ Communication dans un congrès hal-03537323v1
	Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input Brooke Stephenson , Thomas Hueber , Laurent Girin , Laurent Besacier Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3865-3869, ⟨10.21437/Interspeech.2021-275⟩ Communication dans un congrès hal-03372802v1
	Towards an articulatory-driven neural vocoder for speech synthesis Marc-Antoine Georges , Pierre Badin , Julien Diard , Laurent Girin , Jean-Luc Schwartz ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence (virtual), United States Communication dans un congrès hal-03184762v1
	Multichannel source counting with CRNN : analysis of the performance Pierre-Amaury Grumiaux , Srdan Kitic , Laurent Girin , Alexandre Guérin Forum Acusticum 2020, Dec 2020, Lyon (virtual), France. pp.829-835, ⟨10.48465/fa.2020.0766⟩ Communication dans un congrès hal-03235360v1
	A Recurrent Variational Autoencoder for Speech Enhancement Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud ICASSP 2020 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, May 2020, Barcelone (virtual), Spain. pp.371-375, ⟨10.1109/ICASSP40776.2020.9053164⟩ Communication dans un congrès hal-02329000v2
	What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS Brooke Stephenson , Laurent Besacier , Laurent Girin , Thomas Hueber Interspeech 2020 - 21st Annual Conference of the International Speech Communication Association, Oct 2020, Shanghai (Virtual Conf), China. pp.215-219, ⟨10.21437/Interspeech.2020-2103⟩ Communication dans un congrès hal-02962234v1
	Autoencoders for music sound modeling : a comparison of linear, shallow, deep, recurrent and variational models Fanny Roche , Thomas Hueber , Samuel Limier , Laurent Girin SMC 2019 - 16th Sound & Music Computing Conference, May 2019, Malaga, Spain Communication dans un congrès hal-02349406v1
	Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization Simon Leglaive , Laurent Girin , Radu Horaud ICASSP 2019 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2019, Brighton, United Kingdom. pp.101-105, ⟨10.1109/ICASSP.2019.8683704⟩ Communication dans un congrès hal-02005102v2
	Bayesian time-domain multiple sound source localization for a stochastic machine Raphael Frisch , Marvin Faix , Jacques Droulez , Laurent Girin , Emmanuel Mazer EUSIPCO 2019 - 27th European Signal Processing Conference, Sep 2019, A Coruna, Spain. pp.1-5, ⟨10.23919/EUSIPCO.2019.8902666⟩ Communication dans un congrès hal-02377220v1
	Speech enhancement with variational autoencoders and alpha-stable distributions Simon Leglaive , Umut Şimşekli , Antoine Liutkus , Laurent Girin , Radu Horaud ICASSP 2019 - 44th IEEE International Conference on Acoustics, Speech and Signal Processing, May 2019, Brighton, United Kingdom. pp.541-545, ⟨10.1109/ICASSP.2019.8682546⟩ Communication dans un congrès hal-02005106v1
	Notes on the use of variational autoencoders for speech and audio spectrogram modeling Laurent Girin , Fanny Roche , Thomas Hueber , Simon Leglaive DAFx 2019 - 22nd International Conference on Digital Audio Effects, Sep 2019, Birmingham, United Kingdom. pp.1-8 Communication dans un congrès hal-02349385v1
	Audio-Visual Variational Fusion for Multi-Person Tracking with Robots Xavier Alameda-Pineda , Soraya Arias , Yutong Ban , Guillaume Delorme , Laurent Girin ACMMM 2019 - 27th ACM International Conference on Multimedia, Oct 2019, Nice, France. pp.1059-1061, ⟨10.1145/3343031.3350590⟩ Communication dans un congrès hal-02354514v1
	A variance modeling framework based on variational autoencoders for speech enhancement Simon Leglaive , Laurent Girin , Radu Horaud MLSP 2018 - IEEE 28th International Workshop on Machine Learning for Signal Processing, Sep 2018, Aalborg, Denmark. pp.1-6, ⟨10.1109/MLSP.2018.8516711⟩ Communication dans un congrès hal-01832826v1
	Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking Yutong Ban , Xiaofei Li , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada. pp.6553-6557, ⟨10.1109/ICASSP.2018.8462100⟩ Communication dans un congrès hal-01718114v1
	Multisource MINT Using the Convolutive Transfer Function Xiaofei Li , Sharon Gannot , Laurent Girin , Radu Horaud ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada. pp.756-760, ⟨10.1109/ICASSP.2018.8462607⟩ Communication dans un congrès hal-01718106v1
	A Cascaded Multiple-Speaker Localization and Tracking System Xiaofei Li , Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud IWAENC - LOCATA Challenge Workshop - a satellite event of IWAENC 2018, Sep 2018, Tokyo, Japan. pp.1-5 Communication dans un congrès hal-01957137v1
	Online Localization of Multiple Moving Speakers in Reverberant Environments Xiaofei Li , Bastien Mourgue , Laurent Girin , Sharon Gannot , Radu Horaud SAM 2018 - 10th IEEE Workshop on Sensor Array and Multichannel Signal Processing, Jul 2018, Sheffield, United Kingdom. pp.405-409, ⟨10.1109/SAM.2018.8448423⟩ Communication dans un congrès hal-01795462v1
	Autonomous Sensorimotor Learning for Sound Source Localization by a Humanoid Robot Quan Nguyen , Laurent Girin , Gérard Bailly , Frédéric Elisei , Duc-Canh Nguyen IROS 2018 - Workshop on Crossmodal Learning for Intelligent Robotics in conjunction with IEEE/RSJ IROS, Oct 2018, Madrid, Spain Communication dans un congrès hal-01921882v1
	On the Use of Latent Mixing Filters in Audio Source Separation Laurent Girin , Roland Badeau LVA/ICA 2017 - 13th International Conference on Latent Variable Analysis and Signal Separation, Feb 2017, Grenoble, France. pp.225-235, ⟨10.1007/978-3-319-53547-0_22⟩ Communication dans un congrès hal-01400965v1
	Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization Xiaofei Li , Laurent Girin , Radu Horaud ICASSP 2017 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.541-545, ⟨10.1109/ICASSP.2017.7952214⟩ Communication dans un congrès hal-01430754v1
	An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function Xiaofei Li , Laurent Girin , Radu Horaud WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. pp.56-60, ⟨10.1109/WASPAA.2017.8169994⟩ Communication dans un congrès hal-01568818v1
	Exploiting the Complementarity of Audio and Visual Data in Multi-Speaker Tracking Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud ICCVW 2017 - IEEE International Conference on Computer Vision Workshops, Oct 2017, Venise, Italy. pp.446-454, ⟨10.1109/ICCVW.2017.60⟩ Communication dans un congrès hal-01577965v1
	Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework Laurent Girin , Thomas Hueber , Xavier Alameda-Pineda LVA/ICA 2017 - 13th International Conference on Latent Variable Analysis and Signal Separation, Feb 2017, Grenoble, France. pp.459-468, ⟨10.1007/978-3-319-53547-0_43⟩ Communication dans un congrès hal-01646098v1
	Explaining the Parameterized Wiener Filter with Alpha-Stable Processes Mathieu Fontaine , Antoine Liutkus , Laurent Girin , Roland Badeau WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, New York, United States Communication dans un congrès hal-01548508v1
	A Bayesian stochastic machine for sound source localization Raphael Frisch , Raphaël Laurent , Marvin Faix , Laurent Girin , Laurent Fesquet ICRC 2017 - IEEE International Conference on Rebooting Computing, Nov 2017, Washington, DC, United States. pp.1-8 Communication dans un congrès hal-01644346v1
	An EM Algorithm for Joint Source Separation and Diarisation of Multichannel Convolutive Speech Mixtures Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Sharon Gannot , Radu Horaud ICASSP 2017 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.16-20, ⟨10.1109/ICASSP.2017.7951789⟩ Communication dans un congrès hal-01430761v1
	Exploiting the Intermittency of Speech for Joint Separation and Diarization Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud , Sharon Gannot WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. pp.41-45, ⟨10.1109/WASPAA.2017.8169991⟩ Communication dans un congrès hal-01568813v1
	Non-Stationary Noise Power Spectral Density Estimation Based on Regional Statistics Xiaofei Li , Laurent Girin , Sharon Gannot , Radu Horaud ICASSP 2016 - 41st IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Mar 2016, Shanghai, China. pp.181-185, ⟨10.1109/ICASSP.2016.7471661⟩ Communication dans un congrès hal-01250892v1
	An Inverse-Gamma Source Variance Prior with Factorized Parameterization for Audio Source Separation Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Sharon Gannot , Radu Horaud ICASSP 2016 - 41st IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Mar 2016, Shanghai, China. pp.136-140, ⟨10.1109/ICASSP.2016.7471652⟩ Communication dans un congrès hal-01253169v1
	Voice Activity Detection Based on Statistical Likelihood Ratio With Adaptive Thresholding Xiaofei Li , Radu Horaud , Laurent Girin , Sharon Gannot IWAENC 2016 - International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2016, Xi'an, China. pp.1-5, ⟨10.1109/IWAENC.2016.7602911⟩ Communication dans un congrès hal-01349776v1
	Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function Xiaofei Li , Laurent Girin , Fabien Badeig , Radu Horaud IEEE/RSJ International Conference on Intelligent Robots and Systems, IEEE, Oct 2016, Daejeon, South Korea. pp.2819-2826, ⟨10.1109/IROS.2016.7759437⟩ Communication dans un congrès hal-01349771v1
	Deep neural networks for automatic detection of screams and shouted speech in subway trains Pierre Laffitte , David Sodoyer , Charles Tatkeu , Laurent Girin ICASSP 2016 - 41st IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2016, Shanghai, China. pp.6460-6464, ⟨10.1109/ICASSP.2016.7472921⟩ Communication dans un congrès hal-01385272v1
	Estimation of Relative Transfer Function in the Presence of Stationary Noise Based on Segmental Power Spectral Density Matrix Subtraction Xiaofei Li , Laurent Girin , Radu Horaud , Sharon Gannot ICASSP 2015 - 40th IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2015, Brisbane, Australia. pp.320 - 324, ⟨10.1109/ICASSP.2015.7177983⟩ Communication dans un congrès hal-01119186v1
	A Variational EM Algorithm for the Separation of Moving Sound Sources Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Sharon Gannot , Radu Horaud WASPAA 2015 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, IEEE Signal Processing Society, Oct 2015, New Paltz, NY, United States. pp.1-5, ⟨10.1109/WASPAA.2015.7336936⟩ Communication dans un congrès hal-01169764v2
	Local Relative Transfer Function for Sound Source Localization Xiaofei Li , Radu Horaud , Laurent Girin , Sharon Gannot EUSIPCO 2015 - 23th European Signal Processing Conference, Aug 2015, Nice, France. pp.399-403, ⟨10.1109/EUSIPCO.2015.7362413⟩ Communication dans un congrès hal-01163675v1
	Real-time Control of a DNN-based Articulatory Synthesizer for Silent Speech Conversion: a pilot study Florent Bocquelet , Thomas Hueber , Laurent Girin , Christophe Savariaux , Blaise Yvert Interspeech 2015 - 16th Annual Conference of the International Speech Communication Association, Sep 2015, Dresden, Germany Communication dans un congrès hal-01726265v1
	Sound Representation and Classification Benchmark for Domestic Robots Maxime Janvier , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud ICRA 2014 - IEEE International Conference on Robotics and Automation, May 2014, Hong Kong, China. pp.6285-6292, ⟨10.1109/ICRA.2014.6907786⟩ Communication dans un congrès hal-00952092v1
	Perceptual coding-based informed source separation Serap Kirbiz , Alexey Ozerov , Antoine Liutkus , Laurent Girin EUSIPCO 2014 - 22th European Signal Processing Conference, Sep 2014, Lisbonne, Portugal Communication dans un congrès hal-01016314v1
	Mapping Sounds on Images Using Binaural Spectrograms Antoine Deleforge , Vincent Drouard , Laurent Girin , Radu Horaud EUSIPCO 2014 - 22th European Signal Processing Conference, Sep 2014, Lisbonne, Portugal. pp.2470 - 2474 Communication dans un congrès hal-01019287v1
	Robust Articulatory Speech Synthesis using Deep Neural Networks for BCI Applications Florent Bocquelet , Thomas Hueber , Laurent Girin , Pierre Badin , Blaise Yvert Interspeech 2014 - 15th Annual Conference of the International Speech Communication Association, Sep 2014, Singapour, Singapore Communication dans un congrès hal-01228891v1
	Supervised Classification of Baboon Vocalizations Maxime Janvier , Radu Horaud , Laurent Girin , Frédéric Berthommier , Louis-Jean Boë NIPS4B - Workshop: Neural Information Processing Scaled for Bioacoustics : NIPS4B, Dec 2013, Lake Tahoe, Nevada, United States. 10 p Communication dans un congrès hal-00910104v1
	Informed Source Separation from compressed mixtures using spatial wiener filter and quantization noise estimation Shuhua Zhang , Laurent Girin , Antoine Liutkus ICASSP 2013 - 38th IEEE International Conference on Acoustics, Speech and Signal Processing, May 2013, Vancouver, Canada. pp.61-65, ⟨10.1109/ICASSP.2013.6637609⟩ Communication dans un congrès hal-00940328v1
	Phase-based informed source separation for active listening of music Nicolas Sturmel , Laurent Daudet , Laurent Girin DAFx 2012 - 15th International Conference on Digital Audio Effects, Sep 2012, York, United Kingdom. pp.n/c Communication dans un congrès hal-00807001v1
	A Simple Hybrid Acoustic / Morphologically Constrained Technique for the Synthesis of Stop Consonants in Various Vocalic Contexts Frédéric Berthommier , Laurent Girin , Louis-Jean Boë Interspeech 2012 - 13th Annual Conference of the International Speech Communication Association, Sep 2012, Portland, United States. pp.Thu.P10a.05 Communication dans un congrès hal-00807519v1
	Sound-Event Recognition with a Companion Humanoid Maxime Janvier , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud Humanoids 2012 - IEEE International Conference on Humanoid Robotics, Nov 2012, Osaka, Japan. pp.104-111, ⟨10.1109/HUMANOIDS.2012.6651506⟩ Communication dans un congrès hal-00768767v1
	Professionally-produced music separation guided by covers Timothée Gerber , Martin Dutasta , Laurent Girin , Cédric Févotte ISMIR 2012 - International Society for Music Information Retrieval Conference, Oct 2012, Porto, Portugal. pp.n/c Communication dans un congrès hal-00807027v1
	Linear Mixing Models for Active Listening of Music Productions in Realistic Studio Conditions Nicolas Sturmel , Antoine Liutkus , Jonathan Pinel , Laurent Girin , Sylvain Marchand AES 2012 - 132nd AES Convention, Apr 2012, Budapest, Hungary. Paper 8594 Communication dans un congrès hal-00790783v1
	Informed Audio Source Separation: A Comparative Study Antoine Liutkus , Stanislaw Gorlow , Nicolas Sturmel , Shuhua Zhang , Laurent Girin EUSIPCO 2012 - 20th European Signal Processing Conference, Aug 2012, Bucarest, Romania. pp.n/c Communication dans un congrès hal-00809525v1
	DReaM: A Novel System for Joint Source Separation and Multi-Track Coding Sylvain Marchand , Roland Badeau , Cléo Baras , Laurent Daudet , Dominique Fourer AES 2012 - 133rd AES Convention, Oct 2012, San Francisco, United States. CD 133papers Communication dans un congrès hal-00809503v1
	An informed source separation system for speech signals Shuhua Zhang , Laurent Girin Interspeech 2011 - 12th Annual Conference of the International Speech Communication Association, Aug 2011, Florence, Italy. pp.573-576 Communication dans un congrès hal-00695758v1
	Informed audio source separation from compressed linear stereo mixtures Laurent Girin , Jonathan Pinel AES 2011 - 42nd International Conference: Semantic Audio, Jul 2011, Ilmenau, Germany. pp.159-168 Communication dans un congrès hal-00695724v1
	A long-term harmonic plus noise model for speech signals Faten Ben Ali , Laurent Girin , Sonia Djaziri-Larbi Interspeech 2011 - 12th Annual Conference of the International Speech Communication Association, Aug 2011, Florence, Italy. pp.53-56 Communication dans un congrès hal-00695752v1
	"Sparsification" of audio signals using the MDCT/IntMDCT and a psychoacoustic model - Application to informed audio source separation Jonathan Pinel , Laurent Girin AES 2011 - 42nd International Conference: Semantic Audio, Jul 2011, Ilmenau, Germany. pp.179-188 Communication dans un congrès hal-00695730v1
	A high-rate data hiding technique for audio signals based on IntMDCT quantization Jonathan Pinel , Laurent Girin DAFx 2011 - 14th International Conference on Digital Audio Effects, Sep 2011, Paris, France. pp.353-356 Communication dans un congrès hal-00695759v1
	Interactive Music with Active Audio CDs Sylvain Marchand , Boris Mansencal , Laurent Girin CMMR 2010 - 7th International Symposium on Computer Music Modeling and Retrieval, Jun 2010, Málaga, Spain. pp.73--74 Communication dans un congrès hal-00502792v1
	Hybrid coding/indexing strategy for informed source separation of linear instantaneous under-determined audio mixtures Mathieu Parvaix , Laurent Girin , Laurent Daudet , Jonathan Pinel , Cléo Baras ICA 2010 - 20th International Congress on Acoustics, Aug 2010, Sydney, Australia. pp.ICA2010 Communication dans un congrès hal-00535684v1
	Long-term modelling of parameters trajectories for the harmonic plus noise model of speech signals Faten Ben Ali , Laurent Girin , Sonia Djaziri-Larbi ICA 2010 - 20th International Congress on Acoustics, Aug 2010, Sydney, Australia. pp.ICA2010 Communication dans un congrès hal-00534497v1
	Une technique de tatouage " haute-capacité " pour signaux musicaux au format CD-audio Jonathan Pinel , Laurent Girin , Cléo Baras CFA 2010 - 10ème Congrès Français d'Acoustique, Apr 2010, Lyon, France Communication dans un congrès hal-00542884v1
	Séparation de source informée pour des mélanges stéréo instantanés utilisant un tatouage de l'index des sources localement prédominantes Mathieu Parvaix , Laurent Girin CFA 2010 - 10ème Congrès Français d'Acoustique, Apr 2010, Lyon, France. pp.Cd-Rom Communication dans un congrès hal-00486818v1
	Linking Motion Sensors and Digital Signal Processing for Real-Time Musical Transformations Mathieu Mazuel , Dominique David , Laurent Girin HAID 2010 - 5th International Workshop Haptic Audio Interaction Design, Sep 2010, Copenhague, Denmark. pp.HAID2010 Communication dans un congrès hal-00535694v1
	A high-capacity watermarking technique for audio signals based on MDCT-domain quantization Jonathan Pinel , Laurent Girin , Cléo Baras , Mathieu Parvaix ICA 2010 - 20th International Congress on Acoustics, Aug 2010, Sydney, Australia. pp.ICA2010 Communication dans un congrès hal-00534502v1
	Informed source separation of underdetermined instantaneous stereo mixtures using source index embedding Mathieu Parvaix , Laurent Girin ICASSP 2010 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2010, Dallas, United States. pp.245-248 Communication dans un congrès hal-00486804v1
	A watermarking-based method for single-channel audio source separation Mathieu Parvaix , Laurent Girin , Jean-Marc Brossier ICASSP 2009 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2009, Taipei, Taiwan. pp.1 Communication dans un congrès hal-00361713v1
	Estimation of the Voicing Cut-Off Frequency Contour of Natural Speech Based on Harmonic and Aperiodic Energies Kris Hermus , Laurent Girin , Hugo van Hamme , Sufian Irhimeh ICASSP 2008 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2008, Las Vegas, Nevada, United States Communication dans un congrès hal-00329764v1
	Long-Term Flexible 2D Cepstral Modeling of Speech Spectral Amplitudes Laurent Girin , Mohammad Firouzmand ICASSP 2008 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2008, Las Vegas, Nevada, United States Communication dans un congrès hal-00329752v1
	Development and comparison of two approaches for visual speech analysis with application to voice activity detection Bertrand Rivet , Andrew Aubrey , Laurent Girin , Yulia Hicks , Christian Jutten AVSP 2007 - 6th International Conference on Auditory-Visual Speech Processing, Aug 2007, Hilvarenbeek, Netherlands. p. 228-232 Communication dans un congrès hal-00195015v1
	Using a Visual Voice Activity Detector to Regularize the Permutations in Blind Separation of Convolutive Speech Mixtures Bertrand Rivet , Laurent Girin , Christine Serviere , Dinh-Tuan Pham , Christian Jutten DSP 2007 - 15th IEEE International Conference on Digital Signal Processing, Jul 2007, Cardiff, United Kingdom. pp.223-226, ⟨10.1109/ICDSP.2007.4288559⟩ Communication dans un congrès hal-00173341v1
	Long-term quantization of speech LSF parameters Laurent Girin ICASSP 2007 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2007, Honolulu, Hawaii, United States. pp.845 Communication dans un congrès hal-00194157v1
	Audiovisual speech source separation: a regularization method based on visual voice activity detection Bertrand Rivet , Laurent Girin , Christine Serviere , Dinh-Tuan Pham , Christian Jutten AVSP 2007 - 6th International Conference on Auditory-Visual Speech Processing, Aug 2007, Hilvarenbeek, Netherlands. pp.223-227 Communication dans un congrès hal-00195014v1
	Two novel visual voice activity detectors based on appearance models and retinal filltering Andrew Aubrey , Bertrand Rivet , Yulia Hicks , Laurent Girin , Jonathon Chambers EUSIPCO 2007 - 15th European Signal Processing Conference, Sep 2007, Poznan, Poland Communication dans un congrès hal-00188132v1
	ARTUS : calcul et tatouage audiovisuel des mouvements d'un personnage animé virtuel pour l'accessibilité d'émissions télévisuelles aux téléspectateurs sourds comprenant la Langue Française Parlée Complétée Gérard Bailly , Cléo Baras , Patrick Bas , Séverine Baudry , Denis Beautemps Handicap, Jun 2006, Paris, France. pp.265-270 Communication dans un congrès hal-00366492v1
	Theoretical and experimental bases of a new method for accurate separation of harmonic and noise components of speech signals Laurent Girin European Signal Processing conference (EUSIPCO), Sep 2006, Florence, Italy. pp.1 Communication dans un congrès hal-00372280v1
	An analysis of visual speech information applied to voice activity detection David Sodoyer , Bertrand Rivet , Laurent Girin , Jean-Luc Schwartz , Christian Jutten IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2006, Toulouse, France. pp.1 Communication dans un congrès hal-00361750v1
	Solving the indeterminations of Blind source separation of convolutive speech mixtures David Sodoyer , Christian Jutten , Laurent Girin , Jean-Luc Schwartz , Bertrand Rivet IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2006, 2006, Toulouse, France Communication dans un congrès hal-00098153v1
	Comparing Several Models for Perceptual Long-Term Modeling of Amplitudes and Phase Trajectories of Sinusoidal Speech M. Firouzmand , Laurent Girin , Sylvain Marchand Proceedings of the INTERSPEECH -- EUROSPEECH Conference, Sep 2005, Portugal. pp.357-360 Communication dans un congrès hal-00308299v1
	A Generalized Polynomial And Sinusoidal Model For Partial Tracking And Time Stretching Martin Raspaud , Sylvain Marchand , Laurent Girin Proceedings of the Digital Audio Effects (DAFx05) Conference, Sep 2005, Spain. pp.24--29 Communication dans un congrès hal-00307987v1
	Long Term Modeling of Phase Trajectories within the Speech Sinusoidal Model Framework Laurent Girin , Mohammad Firouzmand , Sylvain Marchand INTERSPEECH - 8th International Conference on Spoken Language Processing (ICSLP04), Oct 2004, South Korea. pp.2469-2472 Communication dans un congrès hal-00308298v1
	Characterizing and classifying Cued Speech vowels from labial parameters Denis Beautemps , Thomas Burger , Laurent Girin 8th International Conference on Spoken Language Processing (ICSLP'04 or InterSpeech'04), 2004, Jeju, South Korea Communication dans un congrès hal-00328134v1
	Watermarking of Speech Signals Using the Sinusoidal Model and Frequency Modulation of the Partials Laurent Girin , Sylvain Marchand IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2004, Canada. pp.I - 633-6 Communication dans un congrès hal-00308297v1
	Comparing the Order of a Polynomial Phase Model for the Synthesis of Quasi-Harmonic Audio Signals Laurent Girin , Sylvain Marchand , Joseph Di Martino , Axel Röbel , Geoffroy Peeters IEEE Workshop on Applications of Signal Processing to Audio and Acoustics - WASPAA'03, Oct 2003, New York, United States. pp.193- 196 Communication dans un congrès hal-00308296v1

	Audio source separation into the wild Laurent Girin , Sharon Gannot , Xiaofei Li Multimodal Behavior Analysis in the Wild, Academic Press (Elsevier), pp.53-78, 2018, Computer Vision and Pattern Recognition, ⟨10.1016/B978-0-12-814601-9.00022-5⟩ Chapitre d'ouvrage hal-01943375v1

	Procédé de traitement numérique sur un ensemble de pistes audio avant mixage Nicolas Sturmel , Laurent Daudet , Laurent Girin France, N° de brevet: FR 2984579. 2013, pp.20 Brevet hal-01021287v1
	Procédé et dispositif de formation d'un signal mixé, procédé et dispositif de séparation de signaux, et signal correspondant Mathieu Parvaix , Laurent Girin , Jean-Marc Brossier , Sylvain Marchand France, N° de brevet: FR 2944403. 2010, pp.20 Brevet hal-01021265v1
	Method and device for forming a digital audio mixed signal, method and device for separating signals, and corresponding signal Laurent Girin , Antoine Liutkus , Gael Richard , Roland Badeau France, Patent n° : US20140037110A1. 2010 Brevet hal-02651076v1

	Procédé et dispositif de formation d'un signal mixé numérique audio, procédé et dispositif de séparation de signaux, et signal correspondant Laurent Girin , Antoine Liutkus , Gael Richard , Roland Badeau 2010 Autre publication scientifique hal-00945254v1

Laurent Girin

Publications

A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning

Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation

Learning and controlling the source-filter representation of speech with a variational autoencoder

A survey of sound source localization with deep learning methods

Unsupervised Speech Enhancement using Dynamical Variational Autoencoders

Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers

Dynamical Variational Autoencoders: A Comprehensive Review

Make That Sound More Metallic: Towards a Perceptually Relevant Control of the Timbre of Synthesizer Sounds Using a Variational Autoencoder

Evaluating the Potential Gain of Auditory and Audiovisual Speech-Predictive Coding Using Deep Learning

Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders

Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering

Audio-noise Power Spectral Density Estimation Using Long Short-term Memory

Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments

Expectation-Maximization for Speech Source Separation using Convolutive Transfer Function

Assessing the Performances of different Neural Network Architectures for the Detection of Screams and Shouts in Public Transportation

Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function

Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization

Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract

Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping

Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces

Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization

Key considerations in designing a speech brain-computer interface

Low Bit-Rate Speech Codec Based on a Long-Term Harmonic Plus Noise Model

A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures

Speaker-Adaptive Acoustic-Articulatory Inversion using Cascaded Gaussian Mixture Regression

Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression

A high-rate data hiding technique for uncompressed audio signals

Fast and accurate direct MDCT to DFT conversion with arbitrary window functions

A mediating role of the auditory dorsal pathway in selective adaptation to speech: a state-dependent transcranial magnetic stimulation study

Informed source separation through spectrogram coding and data embedding

Interactive Music with Active Audio CDs

Informed source separation of linear instantaneous under-determined audio mixtures by source index embedding

A Watermarking-Based Method for Informed Source Separation of Audio Signals with a Single Sensor

Adaptive long-term coding of LSF parameters trajectories for large delay / very- to ultra-low bit-rate speech coding

A study of lip movements during spontaneous dialog and its application to voice activity detection

Perceptual long-term variable-rate sinusoidal modeling of speech

Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures

Visual voice activity detection as a help for speech source separation from convolutive mixtures

Log-Rayleigh Distribution: A Simple and Efficient Statistical Representation of Log-Spectral Coefficients

ARTUS: synthesis and audiovisual watermarking of the movements of a virtual agent interpreting subtitling using cued speech for deaf televiewers

Developing an audio-visual speech source separation algorithm

Unsupervised speech enhancement with deep dynamical generative speech and noise models

Exploring the multidimensional representation of unidimensional speech of acoustic parameters extracted by deep unsupervised models

Speech Modeling with a Hierarchical Transformer Dynamical VAE

Exploring the multidimensional representation of individual speech acoustic parameters extracted by deep unsupervised models

BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model

Repeat after Me: Self-Supervised Learning of Acoustic-to-Articulatory Mapping by Vocal Imitation

Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole

Learning and controlling the source-filter representation of speech with a variational autoencoder

Improved feature extraction for CRNN-based multiple sound source localization

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

Learning robust speech representation with an articulatory-regularized variational autoencoder

Saladnet: Self-Attentive Multisource Localization in the Ambisonics Domain

High-resolution speaker counting in reverberant rooms using CRNN with Ambisonics features

Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input

Towards an articulatory-driven neural vocoder for speech synthesis

Multichannel source counting with CRNN : analysis of the performance

A Recurrent Variational Autoencoder for Speech Enhancement

What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS

Autoencoders for music sound modeling : a comparison of linear, shallow, deep, recurrent and variational models

Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization

Bayesian time-domain multiple sound source localization for a stochastic machine

Speech enhancement with variational autoencoders and alpha-stable distributions

Notes on the use of variational autoencoders for speech and audio spectrogram modeling

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots

A variance modeling framework based on variational autoencoders for speech enhancement

Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking

Multisource MINT Using the Convolutive Transfer Function

A Cascaded Multiple-Speaker Localization and Tracking System

Online Localization of Multiple Moving Speakers in Reverberant Environments

Autonomous Sensorimotor Learning for Sound Source Localization by a Humanoid Robot

On the Use of Latent Mixing Filters in Audio Source Separation

Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization

An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function

Exploiting the Complementarity of Audio and Visual Data in Multi-Speaker Tracking

Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework

Explaining the Parameterized Wiener Filter with Alpha-Stable Processes