Accéder directement au contenu

Laurent Girin

139
Documents

Publications

A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning

Samir Sadok , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda , Renaud Séguier
Neural Networks, 2024, 172, pp.106120. ⟨10.1016/j.neunet.2024.106120⟩
Article dans une revue hal-04132316v1
Image document

Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation

Xiaoyu Lin , Laurent Girin , Xavier Alameda-Pineda
Transactions on Machine Learning Research Journal, 2024, pp.1-19
Article dans une revue hal-03584014v1
Image document

Learning and controlling the source-filter representation of speech with a variational autoencoder

Samir Sadok , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda , Renaud Séguier
Speech Communication, 2023, 148, pp.53-65. ⟨10.1016/j.specom.2023.02.005⟩
Article dans une revue hal-03650569v3
Image document

A survey of sound source localization with deep learning methods

Pierre-Amaury Grumiaux , Srđan Kitić , Laurent Girin , Alexandre Guérin
Journal of the Acoustical Society of America, 2022, 152 (1), pp.107-151. ⟨10.1121/10.0011809⟩
Article dans une revue hal-03952034v1
Image document

Unsupervised Speech Enhancement using Dynamical Variational Autoencoders

Xiaoyu Bie , Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022, 30, pp.2993 - 3007. ⟨10.1109/TASLP.2022.3207349⟩
Article dans une revue hal-03295630v1
Image document

Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers

Yutong Ban , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021, 43 (5), pp.1761-1776. ⟨10.1109/TPAMI.2019.2953020⟩
Article dans une revue hal-01950866v2
Image document

Dynamical Variational Autoencoders: A Comprehensive Review

Laurent Girin , Simon Leglaive , Xiaoyu Bie , Julien Diard , Thomas Hueber
Foundations and Trends in Machine Learning, 2021, 15 (1-2), pp.1-175. ⟨10.1561/2200000089⟩
Article dans une revue hal-02926215v2
Image document

Make That Sound More Metallic: Towards a Perceptually Relevant Control of the Timbre of Synthesizer Sounds Using a Variational Autoencoder

Fanny Roche , Thomas Hueber , Maëva Garnier , Samuel Limier , Laurent Girin
Transactions of the International Society for Music Information Retrieval (TISMIR), 2021, 4, pp.52 - 66. ⟨10.5334/tismir.76⟩
Article dans une revue hal-03247371v1
Image document

Evaluating the Potential Gain of Auditory and Audiovisual Speech-Predictive Coding Using Deep Learning

Thomas Hueber , Eric Tatulli , Laurent Girin , Jean-Luc Schwartz
Neural Computation, 2020, 32 (3), pp.596-625. ⟨10.1162/neco_a_01264⟩
Article dans une revue hal-03016083v1
Image document

Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders

Mostafa Sadeghi , Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, 28, pp.1788-1800. ⟨10.1109/TASLP.2020.3000593⟩
Article dans une revue hal-02364900v3
Image document

Multichannel Online Dereverberation based on Spectral Magnitude Inverse Filtering

Xiaofei Li , Laurent Girin , Sharon Gannot , Radu Horaud
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019, 27 (9), pp.1365-1377. ⟨10.1109/TASLP.2019.2919183⟩
Article dans une revue hal-01969041v1
Image document

Audio-noise Power Spectral Density Estimation Using Long Short-term Memory

Xiaofei Li , Simon Leglaive , Laurent Girin , Radu Horaud
IEEE Signal Processing Letters, 2019, 26 (6), pp.918-922. ⟨10.1109/LSP.2019.2911879⟩
Article dans une revue hal-02100059v1
Image document

Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function

Xiaofei Li , Laurent Girin , Sharon Gannot , Radu Horaud
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019, 27 (3), pp.645-659. ⟨10.1109/TASLP.2019.2892412⟩
Article dans une revue hal-01799809v1
Image document

Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments

Xiaofei Li , Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud
IEEE Journal of Selected Topics in Signal Processing, 2019, 13 (1), pp.88-103. ⟨10.1109/JSTSP.2019.2903472⟩
Article dans une revue hal-01851985v2
Image document

Expectation-Maximization for Speech Source Separation using Convolutive Transfer Function

Xiaofei Li , Laurent Girin , Radu Horaud
CAAI Transactions on Intelligent Technologies, 2019, 4 (1), pp.47 - 53. ⟨10.1049/trit.2018.1061⟩
Article dans une revue hal-01982250v1
Image document

Assessing the Performances of different Neural Network Architectures for the Detection of Screams and Shouts in Public Transportation

Pierre Laffitte , Yun Wang , David Sodoyer , Laurent Girin
Expert Systems with Applications, 2019, 117, pp.29-41. ⟨10.1016/j.eswa.2018.08.052⟩
Article dans une revue hal-01892436v1
Image document

Multichannel Identification and Nonnegative Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function

Xiaofei Li , Sharon Gannot , Laurent Girin , Radu Horaud
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018, 26 (10), pp.1755-1768. ⟨10.1109/TASLP.2018.2839362⟩
Article dans une revue hal-01645749v3
Image document

Multiple-Speaker Localization Based on Direct-Path Features and Likelihood Maximization with Spatial Sparsity Regularization

Xiaofei Li , Laurent Girin , Radu Horaud , Sharon Gannot
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (10), pp.1997 - 2012. ⟨10.1109/TASLP.2017.2740001⟩
Article dans une revue hal-01413417v1

Automatic animation of an articulatory tongue model from ultrasound images of the vocal tract

Diandra Fabre , Thomas Hueber , Laurent Girin , Xavier Alameda-Pineda , Pierre Badin
Speech Communication, 2017, 93, pp.63 - 75. ⟨10.1016/j.specom.2017.08.002⟩
Article dans une revue hal-01578315v1
Image document

Extending the Cascaded Gaussian Mixture Regression Framework for Cross-Speaker Acoustic-Articulatory Mapping

Laurent Girin , Thomas Hueber , Xavier Alameda-Pineda
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (3), pp.662-673. ⟨10.1109/TASLP.2017.2651398⟩
Article dans une revue hal-01485540v1

Real-Time Control of an Articulatory-Based Speech Synthesizer for Brain Computer Interfaces

Florent Bocquelet , Thomas Hueber , Laurent Girin , Christophe Savariaux , Blaise Yvert
PLoS Computational Biology, 2016, 12 (11), pp.e1005119. ⟨10.1371/journal.pcbi.1005119⟩
Article dans une revue hal-01459706v1
Image document

Estimation of the Direct-Path Relative Transfer Function for Supervised Sound-Source Localization

Xiaofei Li , Laurent Girin , Radu Horaud , Sharon Gannot
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2016, 24 (11), pp.2171 - 2186. ⟨10.1109/TASLP.2016.2598319⟩
Article dans une revue hal-01349691v1
Image document

Key considerations in designing a speech brain-computer interface

Florent Bocquelet , Thomas Hueber , Laurent Girin , Stephan Chabardès , Blaise Yvert
Journal of Physiology - Paris, 2016, 110 (4, Part A), pp.392-401. ⟨10.1016/j.jphysparis.2017.07.002⟩
Article dans une revue hal-01978301v1
Image document

Low Bit-Rate Speech Codec Based on a Long-Term Harmonic Plus Noise Model

Faten Ben Ali , Sonia Djaziri-Larbi , Laurent Girin
Journal of the Audio Engineering Society, 2016, 64 (11), pp.844-857. ⟨10.17743/jaes.2016.0028⟩
Article dans une revue hal-02520614v1
Image document

A Variational EM Algorithm for the Separation of Time-Varying Convolutive Audio Mixtures

Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Sharon Gannot , Radu Horaud
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2016, 24 (8), pp.1408-1423. ⟨10.1109/TASLP.2016.2554286⟩
Article dans une revue hal-01301762v1

Speaker-Adaptive Acoustic-Articulatory Inversion using Cascaded Gaussian Mixture Regression

Thomas Hueber , Laurent Girin , Xavier Alameda-Pineda , Gérard Bailly
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2015, 23 (12), pp.2246-2259. ⟨10.1109/TASLP.2015.2464702⟩
Article dans une revue hal-01231197v1
Image document

Co-Localization of Audio Sources in Images Using Binaural Features and Locally-Linear Regression

Antoine Deleforge , Radu Horaud , Yoav Y. Schechner , Laurent Girin
IEEE Transactions on Audio, Speech and Language Processing, 2015, 23 (4), pp.718-731. ⟨10.1109/TASLP.2015.2405475⟩
Article dans une revue hal-01112834v3

A high-rate data hiding technique for uncompressed audio signals

Jonathan Pinel , Laurent Girin , Cléo Baras
Journal of the Audio Engineering Society, 2014, 62 (6), pp.400-413. ⟨10.17743/jaes.2014.0024⟩
Article dans une revue hal-01143294v1
Image document

Fast and accurate direct MDCT to DFT conversion with arbitrary window functions

Shuhua Zhang , Laurent Girin
IEEE Transactions on Audio, Speech and Language Processing, 2013, 21 (3), pp.567-578. ⟨10.1109/TASL.2012.2227737⟩
Article dans une revue hal-00807031v1

A mediating role of the auditory dorsal pathway in selective adaptation to speech: a state-dependent transcranial magnetic stimulation study

Krystyna Grabski , Pascale Tremblay , Vincent Gracco , Laurent Girin , Marc Sato
Experimental Brain Research, 2013, 1515 (5), pp.55-65. ⟨10.1016/j.brainres.2013.03.024⟩
Article dans une revue hal-00915287v1
Image document

Informed source separation through spectrogram coding and data embedding

Antoine Liutkus , Jonathan Pinel , Roland Badeau , Laurent Girin , Gael Richard
Signal Processing, 2012, 92 (8), pp.1937-1949. ⟨10.1016/j.sigpro.2011.09.016⟩
Article dans une revue hal-00643957v1

Interactive Music with Active Audio CDs

Sylvain Marchand , Boris Mansencal , Laurent Girin
Lecture Notes in Computer Science, 2011, 6684, pp.31-50. ⟨10.1007/978-3-642-23126-1_3⟩
Article dans une revue hal-00625211v1
Image document

Informed source separation of linear instantaneous under-determined audio mixtures by source index embedding

Mathieu Parvaix , Laurent Girin
IEEE Transactions on Audio, Speech and Language Processing, 2011, 19 (6), pp.1721-1733. ⟨10.1109/TASL.2010.2097250⟩
Article dans une revue hal-00695763v1
Image document

A Watermarking-Based Method for Informed Source Separation of Audio Signals with a Single Sensor

Mathieu Parvaix , Laurent Girin , Jean-Marc Brossier
IEEE Transactions on Audio, Speech and Language Processing, 2010, 18 (6), pp.1464-1475
Article dans une revue hal-00486809v1
Image document

Adaptive long-term coding of LSF parameters trajectories for large delay / very- to ultra-low bit-rate speech coding

Laurent Girin
EURASIP Journal on Audio, Speech, and Music Processing, 2010, 2010 (Article ID 597039), pp.n/c. ⟨10.1155/2010/597039⟩
Article dans une revue hal-00534492v1
Image document

A study of lip movements during spontaneous dialog and its application to voice activity detection

David Sodoyer , Bertrand Rivet , Laurent Girin , Christophe Savariaux , Jean-Luc Schwartz
Journal of the Acoustical Society of America, 2009, 125 (2), pp.1184-1196. ⟨10.1121/1.3050257⟩
Article dans une revue hal-00941145v1

Perceptual long-term variable-rate sinusoidal modeling of speech

Laurent Girin , Mohammad Firouzmand , Sylvain Marchand
IEEE Transactions on Audio, Speech and Language Processing, 2007, 15 (3), pp.851-861. ⟨10.1109/TASL.2006.885928⟩
Article dans une revue hal-00194164v1
Image document

Mixing Audiovisual Speech Processing and Blind Source Separation for the Extraction of Speech Signals From Convolutive Mixtures

Bertrand Rivet , Laurent Girin , Christian Jutten
IEEE Transactions on Audio, Speech and Language Processing, 2007, 15 (1), pp.96-108. ⟨10.1109/TASL.2006.872619⟩
Article dans une revue hal-00174100v1
Image document

Visual voice activity detection as a help for speech source separation from convolutive mixtures

Bertrand Rivet , Laurent Girin , Christian Jutten
Speech Communication, 2007, 49 (7-8), pp.667-677. ⟨10.1016/j.specom.2007.04.008⟩
Article dans une revue hal-00499184v1
Image document

Log-Rayleigh Distribution: A Simple and Efficient Statistical Representation of Log-Spectral Coefficients

Bertrand Rivet , Laurent Girin , Christian Jutten
IEEE Transactions on Audio, Speech and Language Processing, 2007, 15 (3), pp.796-802. ⟨10.1109/TASL.2006.885922⟩
Article dans une revue hal-00174096v1
Image document

ARTUS: synthesis and audiovisual watermarking of the movements of a virtual agent interpreting subtitling using cued speech for deaf televiewers

Gérard Bailly , Virginie Attina , Cléo Baras , Patrick Bas , Séverine Baudry
Modelling, measurement and control C, 2006, 67SH (2, supplement : handicap), pp.177-187
Article dans une revue hal-00157826v1
Image document

Developing an audio-visual speech source separation algorithm

David Sodoyer , Laurent Girin , Christian Jutten , Jean-Luc Schwartz
Speech Communication, 2004, 44, pp.113-125
Article dans une revue hal-00186591v1

Unsupervised speech enhancement with deep dynamical generative speech and noise models

Xiaoyu Lin , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda
Interspeech 2023 - 24th Annual Conference of the International Speech Communication Association, ISCA, Aug 2023, Dublin, Ireland. pp.1-5
Communication dans un congrès hal-04132312v1

Exploring the multidimensional representation of unidimensional speech of acoustic parameters extracted by deep unsupervised models

Maxime Jacquelin , Maëva Garnier , Laurent Girin , Rémy Vincent , Olivier Perrotin
Journée commune AFIA-TLH / AFCP – “Extraction de connaissances interprétables pour l’étude de la communication parlée”, AFIA-TLH; AFCP, Dec 2023, Avignon (FR), France
Communication dans un congrès hal-04416200v1

Speech Modeling with a Hierarchical Transformer Dynamical VAE

Xiaoyu Lin , Xiaoyu Bie , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda
ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing, Jun 2023, Rhodes, Greece. pp.1-5, ⟨10.1109/ICASSP49357.2023.10096751⟩
Communication dans un congrès hal-04132313v1
Image document

Exploring the multidimensional representation of individual speech acoustic parameters extracted by deep unsupervised models

Maxime Jacquelin , Maëva Garnier , Laurent Girin , Rémy Vincent , Olivier Perrotin
SSW 2023 - 12th ISCA Speech Synthesis Workshop (SSW2023), Aug 2023, Grenoble, France. pp.240-241
Communication dans un congrès hal-04274170v1
Image document

BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model

Brooke Stephenson , Laurent Besacier , Laurent Girin , Thomas Hueber
Interspeech 2022 - 23rd Annual Conference of the International Speech Communication Association, Sep 2022, Incheon, South Korea. pp.3383-3387, ⟨10.21437/Interspeech.2022-10116⟩
Communication dans un congrès hal-03791472v1
Image document

Repeat after Me: Self-Supervised Learning of Acoustic-to-Articulatory Mapping by Vocal Imitation

Marc-Antoine Georges , Julien Diard , Laurent Girin , Jean-Luc Schwartz , Thomas Hueber
ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore, Singapore. pp.8252-8256, ⟨10.1109/ICASSP43922.2022.9747804⟩
Communication dans un congrès hal-03688189v1
Image document

Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole

Laurent Girin , Xiaoyu Bie , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda
JEP 2022 - 34e Journées d’Études sur la Parole, Université de Nantes, Jun 2022, Noirmoutier, France. pp.655-663, ⟨10.21437/JEP.2022-69⟩
Communication dans un congrès hal-03978396v1

Learning and controlling the source-filter representation of speech with a variational autoencoder

Samir Sadok , Simon Leglaive , Laurent Girin , Xavier Alameda-Pineda , Renaud Seguier
CFA 2022 - 16ème Congrès Français d'Acoustique, Société Française d'Acoustique (SFA), Apr 2022, Marseille, France
Communication dans un congrès hal-03603791v1
Image document

Improved feature extraction for CRNN-based multiple sound source localization

Pierre-Amaury Grumiaux , Srdan Kitić , Laurent Girin , Alexandre Guérin
EUSIPCO 2021 - 29th European Signal Processing Conference (EUSIPCO), Aug 2021, Dublin, Ireland. pp.231-235, ⟨10.23919/EUSIPCO54536.2021.9616124⟩
Communication dans un congrès hal-03537334v1
Image document

A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling

Xiaoyu Bie , Laurent Girin , Simon Leglaive , Thomas Hueber , Xavier Alameda-Pineda
Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.46-50, ⟨10.21437/Interspeech.2021-256⟩
Communication dans un congrès hal-03295657v1
Image document

Learning robust speech representation with an articulatory-regularized variational autoencoder

Marc-Antoine Georges , Laurent Girin , Jean-Luc Schwartz , Thomas Hueber
Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3345-3349, ⟨10.21437/Interspeech.2021-1604⟩
Communication dans un congrès hal-03373252v1
Image document

Saladnet: Self-Attentive Multisource Localization in the Ambisonics Domain

Pierre-Amaury Grumiaux , Srdan Kitić , Prerak Srivastava , Laurent Girin , Alexandre Guérin
WASPAA 2021 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2021, New Paltz / Virtual, United States. pp.336-340, ⟨10.1109/WASPAA52581.2021.9632737⟩
Communication dans un congrès hal-03537340v1
Image document

High-resolution speaker counting in reverberant rooms using CRNN with Ambisonics features

Pierre-Amaury Grumiaux , Srdan Kitic , Laurent Girin , Alexandre Guerin
EUSIPCO 2020 - 28th European Signal Processing Conference (EUSIPCO), Jan 2021, Amsterdam, Netherlands. pp.71-75, ⟨10.23919/Eusipco47968.2020.9287637⟩
Communication dans un congrès hal-03537323v1
Image document

Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input

Brooke Stephenson , Thomas Hueber , Laurent Girin , Laurent Besacier
Interspeech 2021 - 22nd Annual Conference of the International Speech Communication Association, Aug 2021, Brno, Czech Republic. pp.3865-3869, ⟨10.21437/Interspeech.2021-275⟩
Communication dans un congrès hal-03372802v1
Image document

Towards an articulatory-driven neural vocoder for speech synthesis

Marc-Antoine Georges , Pierre Badin , Julien Diard , Laurent Girin , Jean-Luc Schwartz
ISSP 2020 - 12th International Seminar on Speech Production, Dec 2020, Providence (virtual), United States
Communication dans un congrès hal-03184762v1
Image document

Multichannel source counting with CRNN : analysis of the performance

Pierre-Amaury Grumiaux , Srdan Kitic , Laurent Girin , Alexandre Guérin
Forum Acusticum 2020, Dec 2020, Lyon (virtual), France. pp.829-835, ⟨10.48465/fa.2020.0766⟩
Communication dans un congrès hal-03235360v1
Image document

A Recurrent Variational Autoencoder for Speech Enhancement

Simon Leglaive , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
ICASSP 2020 - IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE, May 2020, Barcelone (virtual), Spain. pp.371-375, ⟨10.1109/ICASSP40776.2020.9053164⟩
Communication dans un congrès hal-02329000v2
Image document

What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS

Brooke Stephenson , Laurent Besacier , Laurent Girin , Thomas Hueber
Interspeech 2020 - 21st Annual Conference of the International Speech Communication Association, Oct 2020, Shanghai (Virtual Conf), China. pp.215-219, ⟨10.21437/Interspeech.2020-2103⟩
Communication dans un congrès hal-02962234v1

Autoencoders for music sound modeling : a comparison of linear, shallow, deep, recurrent and variational models

Fanny Roche , Thomas Hueber , Samuel Limier , Laurent Girin
SMC 2019 - 16th Sound & Music Computing Conference, May 2019, Malaga, Spain
Communication dans un congrès hal-02349406v1
Image document

Semi-supervised multichannel speech enhancement with variational autoencoders and non-negative matrix factorization

Simon Leglaive , Laurent Girin , Radu Horaud
ICASSP 2019 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2019, Brighton, United Kingdom. pp.101-105, ⟨10.1109/ICASSP.2019.8683704⟩
Communication dans un congrès hal-02005102v2
Image document

Bayesian time-domain multiple sound source localization for a stochastic machine

Raphael Frisch , Marvin Faix , Jacques Droulez , Laurent Girin , Emmanuel Mazer
EUSIPCO 2019 - 27th European Signal Processing Conference, Sep 2019, A Coruna, Spain. pp.1-5, ⟨10.23919/EUSIPCO.2019.8902666⟩
Communication dans un congrès hal-02377220v1
Image document

Speech enhancement with variational autoencoders and alpha-stable distributions

Simon Leglaive , Umut Şimşekli , Antoine Liutkus , Laurent Girin , Radu Horaud
ICASSP 2019 - 44th IEEE International Conference on Acoustics, Speech and Signal Processing, May 2019, Brighton, United Kingdom. pp.541-545, ⟨10.1109/ICASSP.2019.8682546⟩
Communication dans un congrès hal-02005106v1
Image document

Notes on the use of variational autoencoders for speech and audio spectrogram modeling

Laurent Girin , Fanny Roche , Thomas Hueber , Simon Leglaive
DAFx 2019 - 22nd International Conference on Digital Audio Effects, Sep 2019, Birmingham, United Kingdom. pp.1-8
Communication dans un congrès hal-02349385v1
Image document

Audio-Visual Variational Fusion for Multi-Person Tracking with Robots

Xavier Alameda-Pineda , Soraya Arias , Yutong Ban , Guillaume Delorme , Laurent Girin
ACMMM 2019 - 27th ACM International Conference on Multimedia, Oct 2019, Nice, France. pp.1059-1061, ⟨10.1145/3343031.3350590⟩
Communication dans un congrès hal-02354514v1
Image document

A variance modeling framework based on variational autoencoders for speech enhancement

Simon Leglaive , Laurent Girin , Radu Horaud
MLSP 2018 - IEEE 28th International Workshop on Machine Learning for Signal Processing, Sep 2018, Aalborg, Denmark. pp.1-6, ⟨10.1109/MLSP.2018.8516711⟩
Communication dans un congrès hal-01832826v1
Image document

Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking

Yutong Ban , Xiaofei Li , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada. pp.6553-6557, ⟨10.1109/ICASSP.2018.8462100⟩
Communication dans un congrès hal-01718114v1
Image document

Multisource MINT Using the Convolutive Transfer Function

Xiaofei Li , Sharon Gannot , Laurent Girin , Radu Horaud
ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Alberta, Canada. pp.756-760, ⟨10.1109/ICASSP.2018.8462607⟩
Communication dans un congrès hal-01718106v1
Image document

A Cascaded Multiple-Speaker Localization and Tracking System

Xiaofei Li , Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud
IWAENC - LOCATA Challenge Workshop - a satellite event of IWAENC 2018, Sep 2018, Tokyo, Japan. pp.1-5
Communication dans un congrès hal-01957137v1
Image document

Online Localization of Multiple Moving Speakers in Reverberant Environments

Xiaofei Li , Bastien Mourgue , Laurent Girin , Sharon Gannot , Radu Horaud
SAM 2018 - 10th IEEE Workshop on Sensor Array and Multichannel Signal Processing, Jul 2018, Sheffield, United Kingdom. pp.405-409, ⟨10.1109/SAM.2018.8448423⟩
Communication dans un congrès hal-01795462v1
Image document

Autonomous Sensorimotor Learning for Sound Source Localization by a Humanoid Robot

Quan Nguyen , Laurent Girin , Gérard Bailly , Frédéric Elisei , Duc-Canh Nguyen
IROS 2018 - Workshop on Crossmodal Learning for Intelligent Robotics in conjunction with IEEE/RSJ IROS, Oct 2018, Madrid, Spain
Communication dans un congrès hal-01921882v1
Image document

On the Use of Latent Mixing Filters in Audio Source Separation

Laurent Girin , Roland Badeau
LVA/ICA 2017 - 13th International Conference on Latent Variable Analysis and Signal Separation, Feb 2017, Grenoble, France. pp.225-235, ⟨10.1007/978-3-319-53547-0_22⟩
Communication dans un congrès hal-01400965v1
Image document

Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization

Xiaofei Li , Laurent Girin , Radu Horaud
ICASSP 2017 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.541-545, ⟨10.1109/ICASSP.2017.7952214⟩
Communication dans un congrès hal-01430754v1
Image document

An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function

Xiaofei Li , Laurent Girin , Radu Horaud
WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. pp.56-60, ⟨10.1109/WASPAA.2017.8169994⟩
Communication dans un congrès hal-01568818v1
Image document

Exploiting the Complementarity of Audio and Visual Data in Multi-Speaker Tracking

Yutong Ban , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud
ICCVW 2017 - IEEE International Conference on Computer Vision Workshops, Oct 2017, Venise, Italy. pp.446-454, ⟨10.1109/ICCVW.2017.60⟩
Communication dans un congrès hal-01577965v1
Image document

Adaptation of a Gaussian Mixture Regressor to a New Input Distribution: Extending the C-GMR Framework

Laurent Girin , Thomas Hueber , Xavier Alameda-Pineda
LVA/ICA 2017 - 13th International Conference on Latent Variable Analysis and Signal Separation, Feb 2017, Grenoble, France. pp.459-468, ⟨10.1007/978-3-319-53547-0_43⟩
Communication dans un congrès hal-01646098v1
Image document

Explaining the Parameterized Wiener Filter with Alpha-Stable Processes

Mathieu Fontaine , Antoine Liutkus , Laurent Girin , Roland Badeau
WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, New York, United States
Communication dans un congrès hal-01548508v1
Image document

A Bayesian stochastic machine for sound source localization

Raphael Frisch , Raphaël Laurent , Marvin Faix , Laurent Girin , Laurent Fesquet
ICRC 2017 - IEEE International Conference on Rebooting Computing, Nov 2017, Washington, DC, United States. pp.1-8
Communication dans un congrès hal-01644346v1
Image document

An EM Algorithm for Joint Source Separation and Diarisation of Multichannel Convolutive Speech Mixtures

Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Sharon Gannot , Radu Horaud
ICASSP 2017 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.16-20, ⟨10.1109/ICASSP.2017.7951789⟩
Communication dans un congrès hal-01430761v1
Image document

Exploiting the Intermittency of Speech for Joint Separation and Diarization

Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Radu Horaud , Sharon Gannot
WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. pp.41-45, ⟨10.1109/WASPAA.2017.8169991⟩
Communication dans un congrès hal-01568813v1
Image document

Non-Stationary Noise Power Spectral Density Estimation Based on Regional Statistics

Xiaofei Li , Laurent Girin , Sharon Gannot , Radu Horaud
ICASSP 2016 - 41st IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Mar 2016, Shanghai, China. pp.181-185, ⟨10.1109/ICASSP.2016.7471661⟩
Communication dans un congrès hal-01250892v1
Image document

An Inverse-Gamma Source Variance Prior with Factorized Parameterization for Audio Source Separation

Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Sharon Gannot , Radu Horaud
ICASSP 2016 - 41st IEEE International Conference on Acoustics, Speech and Signal Processing, IEEE Signal Processing Society, Mar 2016, Shanghai, China. pp.136-140, ⟨10.1109/ICASSP.2016.7471652⟩
Communication dans un congrès hal-01253169v1
Image document

Voice Activity Detection Based on Statistical Likelihood Ratio With Adaptive Thresholding

Xiaofei Li , Radu Horaud , Laurent Girin , Sharon Gannot
IWAENC 2016 - International Workshop on Acoustic Signal Enhancement (IWAENC), Sep 2016, Xi'an, China. pp.1-5, ⟨10.1109/IWAENC.2016.7602911⟩
Communication dans un congrès hal-01349776v1
Image document

Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function

Xiaofei Li , Laurent Girin , Fabien Badeig , Radu Horaud
IEEE/RSJ International Conference on Intelligent Robots and Systems, IEEE, Oct 2016, Daejeon, South Korea. pp.2819-2826, ⟨10.1109/IROS.2016.7759437⟩
Communication dans un congrès hal-01349771v1

Deep neural networks for automatic detection of screams and shouted speech in subway trains

Pierre Laffitte , David Sodoyer , Charles Tatkeu , Laurent Girin
ICASSP 2016 - 41st IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2016, Shanghai, China. pp.6460-6464, ⟨10.1109/ICASSP.2016.7472921⟩
Communication dans un congrès hal-01385272v1
Image document

Estimation of Relative Transfer Function in the Presence of Stationary Noise Based on Segmental Power Spectral Density Matrix Subtraction

Xiaofei Li , Laurent Girin , Radu Horaud , Sharon Gannot
ICASSP 2015 - 40th IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2015, Brisbane, Australia. pp.320 - 324, ⟨10.1109/ICASSP.2015.7177983⟩
Communication dans un congrès hal-01119186v1
Image document

A Variational EM Algorithm for the Separation of Moving Sound Sources

Dionyssos Kounades-Bastian , Laurent Girin , Xavier Alameda-Pineda , Sharon Gannot , Radu Horaud
WASPAA 2015 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, IEEE Signal Processing Society, Oct 2015, New Paltz, NY, United States. pp.1-5, ⟨10.1109/WASPAA.2015.7336936⟩
Communication dans un congrès hal-01169764v2
Image document

Local Relative Transfer Function for Sound Source Localization

Xiaofei Li , Radu Horaud , Laurent Girin , Sharon Gannot
EUSIPCO 2015 - 23th European Signal Processing Conference, Aug 2015, Nice, France. pp.399-403, ⟨10.1109/EUSIPCO.2015.7362413⟩
Communication dans un congrès hal-01163675v1
Image document

Real-time Control of a DNN-based Articulatory Synthesizer for Silent Speech Conversion: a pilot study

Florent Bocquelet , Thomas Hueber , Laurent Girin , Christophe Savariaux , Blaise Yvert
Interspeech 2015 - 16th Annual Conference of the International Speech Communication Association, Sep 2015, Dresden, Germany
Communication dans un congrès hal-01726265v1
Image document

Sound Representation and Classification Benchmark for Domestic Robots

Maxime Janvier , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
ICRA 2014 - IEEE International Conference on Robotics and Automation, May 2014, Hong Kong, China. pp.6285-6292, ⟨10.1109/ICRA.2014.6907786⟩
Communication dans un congrès hal-00952092v1
Image document

Perceptual coding-based informed source separation

Serap Kirbiz , Alexey Ozerov , Antoine Liutkus , Laurent Girin
EUSIPCO 2014 - 22th European Signal Processing Conference, Sep 2014, Lisbonne, Portugal
Communication dans un congrès hal-01016314v1
Image document

Mapping Sounds on Images Using Binaural Spectrograms

Antoine Deleforge , Vincent Drouard , Laurent Girin , Radu Horaud
EUSIPCO 2014 - 22th European Signal Processing Conference, Sep 2014, Lisbonne, Portugal. pp.2470 - 2474
Communication dans un congrès hal-01019287v1
Image document

Robust Articulatory Speech Synthesis using Deep Neural Networks for BCI Applications

Florent Bocquelet , Thomas Hueber , Laurent Girin , Pierre Badin , Blaise Yvert
Interspeech 2014 - 15th Annual Conference of the International Speech Communication Association, Sep 2014, Singapour, Singapore
Communication dans un congrès hal-01228891v1
Image document

Supervised Classification of Baboon Vocalizations

Maxime Janvier , Radu Horaud , Laurent Girin , Frédéric Berthommier , Louis-Jean Boë
NIPS4B - Workshop: Neural Information Processing Scaled for Bioacoustics : NIPS4B, Dec 2013, Lake Tahoe, Nevada, United States. 10 p
Communication dans un congrès hal-00910104v1
Image document

Informed Source Separation from compressed mixtures using spatial wiener filter and quantization noise estimation

Shuhua Zhang , Laurent Girin , Antoine Liutkus
ICASSP 2013 - 38th IEEE International Conference on Acoustics, Speech and Signal Processing, May 2013, Vancouver, Canada. pp.61-65, ⟨10.1109/ICASSP.2013.6637609⟩
Communication dans un congrès hal-00940328v1
Image document

Phase-based informed source separation for active listening of music

Nicolas Sturmel , Laurent Daudet , Laurent Girin
DAFx 2012 - 15th International Conference on Digital Audio Effects, Sep 2012, York, United Kingdom. pp.n/c
Communication dans un congrès hal-00807001v1
Image document

A Simple Hybrid Acoustic / Morphologically Constrained Technique for the Synthesis of Stop Consonants in Various Vocalic Contexts

Frédéric Berthommier , Laurent Girin , Louis-Jean Boë
Interspeech 2012 - 13th Annual Conference of the International Speech Communication Association, Sep 2012, Portland, United States. pp.Thu.P10a.05
Communication dans un congrès hal-00807519v1
Image document

Sound-Event Recognition with a Companion Humanoid

Maxime Janvier , Xavier Alameda-Pineda , Laurent Girin , Radu Horaud
Humanoids 2012 - IEEE International Conference on Humanoid Robotics, Nov 2012, Osaka, Japan. pp.104-111, ⟨10.1109/HUMANOIDS.2012.6651506⟩
Communication dans un congrès hal-00768767v1
Image document

Professionally-produced music separation guided by covers

Timothée Gerber , Martin Dutasta , Laurent Girin , Cédric Févotte
ISMIR 2012 - International Society for Music Information Retrieval Conference, Oct 2012, Porto, Portugal. pp.n/c
Communication dans un congrès hal-00807027v1
Image document

Linear Mixing Models for Active Listening of Music Productions in Realistic Studio Conditions

Nicolas Sturmel , Antoine Liutkus , Jonathan Pinel , Laurent Girin , Sylvain Marchand
AES 2012 - 132nd AES Convention, Apr 2012, Budapest, Hungary. Paper 8594
Communication dans un congrès hal-00790783v1
Image document

Informed Audio Source Separation: A Comparative Study

Antoine Liutkus , Stanislaw Gorlow , Nicolas Sturmel , Shuhua Zhang , Laurent Girin
EUSIPCO 2012 - 20th European Signal Processing Conference, Aug 2012, Bucarest, Romania. pp.n/c
Communication dans un congrès hal-00809525v1
Image document

DReaM: A Novel System for Joint Source Separation and Multi-Track Coding

Sylvain Marchand , Roland Badeau , Cléo Baras , Laurent Daudet , Dominique Fourer
AES 2012 - 133rd AES Convention, Oct 2012, San Francisco, United States. CD 133papers
Communication dans un congrès hal-00809503v1
Image document

An informed source separation system for speech signals

Shuhua Zhang , Laurent Girin
Interspeech 2011 - 12th Annual Conference of the International Speech Communication Association, Aug 2011, Florence, Italy. pp.573-576
Communication dans un congrès hal-00695758v1
Image document

Informed audio source separation from compressed linear stereo mixtures

Laurent Girin , Jonathan Pinel
AES 2011 - 42nd International Conference: Semantic Audio, Jul 2011, Ilmenau, Germany. pp.159-168
Communication dans un congrès hal-00695724v1
Image document

A long-term harmonic plus noise model for speech signals

Faten Ben Ali , Laurent Girin , Sonia Djaziri-Larbi
Interspeech 2011 - 12th Annual Conference of the International Speech Communication Association, Aug 2011, Florence, Italy. pp.53-56
Communication dans un congrès hal-00695752v1
Image document

"Sparsification" of audio signals using the MDCT/IntMDCT and a psychoacoustic model - Application to informed audio source separation

Jonathan Pinel , Laurent Girin
AES 2011 - 42nd International Conference: Semantic Audio, Jul 2011, Ilmenau, Germany. pp.179-188
Communication dans un congrès hal-00695730v1
Image document

A high-rate data hiding technique for audio signals based on IntMDCT quantization

Jonathan Pinel , Laurent Girin
DAFx 2011 - 14th International Conference on Digital Audio Effects, Sep 2011, Paris, France. pp.353-356
Communication dans un congrès hal-00695759v1
Image document

Interactive Music with Active Audio CDs

Sylvain Marchand , Boris Mansencal , Laurent Girin
CMMR 2010 - 7th International Symposium on Computer Music Modeling and Retrieval, Jun 2010, Málaga, Spain. pp.73--74
Communication dans un congrès hal-00502792v1

Hybrid coding/indexing strategy for informed source separation of linear instantaneous under-determined audio mixtures

Mathieu Parvaix , Laurent Girin , Laurent Daudet , Jonathan Pinel , Cléo Baras
ICA 2010 - 20th International Congress on Acoustics, Aug 2010, Sydney, Australia. pp.ICA2010
Communication dans un congrès hal-00535684v1

Long-term modelling of parameters trajectories for the harmonic plus noise model of speech signals

Faten Ben Ali , Laurent Girin , Sonia Djaziri-Larbi
ICA 2010 - 20th International Congress on Acoustics, Aug 2010, Sydney, Australia. pp.ICA2010
Communication dans un congrès hal-00534497v1
Image document

Une technique de tatouage " haute-capacité " pour signaux musicaux au format CD-audio

Jonathan Pinel , Laurent Girin , Cléo Baras
CFA 2010 - 10ème Congrès Français d'Acoustique, Apr 2010, Lyon, France
Communication dans un congrès hal-00542884v1
Image document

Séparation de source informée pour des mélanges stéréo instantanés utilisant un tatouage de l'index des sources localement prédominantes

Mathieu Parvaix , Laurent Girin
CFA 2010 - 10ème Congrès Français d'Acoustique, Apr 2010, Lyon, France. pp.Cd-Rom
Communication dans un congrès hal-00486818v1

Linking Motion Sensors and Digital Signal Processing for Real-Time Musical Transformations

Mathieu Mazuel , Dominique David , Laurent Girin
HAID 2010 - 5th International Workshop Haptic Audio Interaction Design, Sep 2010, Copenhague, Denmark. pp.HAID2010
Communication dans un congrès hal-00535694v1

A high-capacity watermarking technique for audio signals based on MDCT-domain quantization

Jonathan Pinel , Laurent Girin , Cléo Baras , Mathieu Parvaix
ICA 2010 - 20th International Congress on Acoustics, Aug 2010, Sydney, Australia. pp.ICA2010
Communication dans un congrès hal-00534502v1
Image document

Informed source separation of underdetermined instantaneous stereo mixtures using source index embedding

Mathieu Parvaix , Laurent Girin
ICASSP 2010 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2010, Dallas, United States. pp.245-248
Communication dans un congrès hal-00486804v1

A watermarking-based method for single-channel audio source separation

Mathieu Parvaix , Laurent Girin , Jean-Marc Brossier
ICASSP 2009 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2009, Taipei, Taiwan. pp.1
Communication dans un congrès hal-00361713v1

Estimation of the Voicing Cut-Off Frequency Contour of Natural Speech Based on Harmonic and Aperiodic Energies

Kris Hermus , Laurent Girin , Hugo van Hamme , Sufian Irhimeh
ICASSP 2008 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2008, Las Vegas, Nevada, United States
Communication dans un congrès hal-00329764v1

Long-Term Flexible 2D Cepstral Modeling of Speech Spectral Amplitudes

Laurent Girin , Mohammad Firouzmand
ICASSP 2008 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2008, Las Vegas, Nevada, United States
Communication dans un congrès hal-00329752v1

Development and comparison of two approaches for visual speech analysis with application to voice activity detection

Bertrand Rivet , Andrew Aubrey , Laurent Girin , Yulia Hicks , Christian Jutten
AVSP 2007 - 6th International Conference on Auditory-Visual Speech Processing, Aug 2007, Hilvarenbeek, Netherlands. p. 228-232
Communication dans un congrès hal-00195015v1
Image document

Using a Visual Voice Activity Detector to Regularize the Permutations in Blind Separation of Convolutive Speech Mixtures

Bertrand Rivet , Laurent Girin , Christine Serviere , Dinh-Tuan Pham , Christian Jutten
DSP 2007 - 15th IEEE International Conference on Digital Signal Processing, Jul 2007, Cardiff, United Kingdom. pp.223-226, ⟨10.1109/ICDSP.2007.4288559⟩
Communication dans un congrès hal-00173341v1

Long-term quantization of speech LSF parameters

Laurent Girin
ICASSP 2007 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2007, Honolulu, Hawaii, United States. pp.845
Communication dans un congrès hal-00194157v1

Audiovisual speech source separation: a regularization method based on visual voice activity detection

Bertrand Rivet , Laurent Girin , Christine Serviere , Dinh-Tuan Pham , Christian Jutten
AVSP 2007 - 6th International Conference on Auditory-Visual Speech Processing, Aug 2007, Hilvarenbeek, Netherlands. pp.223-227
Communication dans un congrès hal-00195014v1
Image document

Two novel visual voice activity detectors based on appearance models and retinal filltering

Andrew Aubrey , Bertrand Rivet , Yulia Hicks , Laurent Girin , Jonathon Chambers
EUSIPCO 2007 - 15th European Signal Processing Conference, Sep 2007, Poznan, Poland
Communication dans un congrès hal-00188132v1
Image document

ARTUS : calcul et tatouage audiovisuel des mouvements d'un personnage animé virtuel pour l'accessibilité d'émissions télévisuelles aux téléspectateurs sourds comprenant la Langue Française Parlée Complétée

Gérard Bailly , Cléo Baras , Patrick Bas , Séverine Baudry , Denis Beautemps
Handicap, Jun 2006, Paris, France. pp.265-270
Communication dans un congrès hal-00366492v1

Theoretical and experimental bases of a new method for accurate separation of harmonic and noise components of speech signals

Laurent Girin
European Signal Processing conference (EUSIPCO), Sep 2006, Florence, Italy. pp.1
Communication dans un congrès hal-00372280v1

An analysis of visual speech information applied to voice activity detection

David Sodoyer , Bertrand Rivet , Laurent Girin , Jean-Luc Schwartz , Christian Jutten
IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2006, Toulouse, France. pp.1
Communication dans un congrès hal-00361750v1

Solving the indeterminations of Blind source separation of convolutive speech mixtures

David Sodoyer , Christian Jutten , Laurent Girin , Jean-Luc Schwartz , Bertrand Rivet
IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2006, 2006, Toulouse, France
Communication dans un congrès hal-00098153v1

Comparing Several Models for Perceptual Long-Term Modeling of Amplitudes and Phase Trajectories of Sinusoidal Speech

M. Firouzmand , Laurent Girin , Sylvain Marchand
Proceedings of the INTERSPEECH -- EUROSPEECH Conference, Sep 2005, Portugal. pp.357-360
Communication dans un congrès hal-00308299v1
Image document

A Generalized Polynomial And Sinusoidal Model For Partial Tracking And Time Stretching

Martin Raspaud , Sylvain Marchand , Laurent Girin
Proceedings of the Digital Audio Effects (DAFx05) Conference, Sep 2005, Spain. pp.24--29
Communication dans un congrès hal-00307987v1

Long Term Modeling of Phase Trajectories within the Speech Sinusoidal Model Framework

Laurent Girin , Mohammad Firouzmand , Sylvain Marchand
INTERSPEECH - 8th International Conference on Spoken Language Processing (ICSLP04), Oct 2004, South Korea. pp.2469-2472
Communication dans un congrès hal-00308298v1
Image document

Characterizing and classifying Cued Speech vowels from labial parameters

Denis Beautemps , Thomas Burger , Laurent Girin
8th International Conference on Spoken Language Processing (ICSLP'04 or InterSpeech'04), 2004, Jeju, South Korea
Communication dans un congrès hal-00328134v1

Watermarking of Speech Signals Using the Sinusoidal Model and Frequency Modulation of the Partials

Laurent Girin , Sylvain Marchand
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2004, Canada. pp.I - 633-6
Communication dans un congrès hal-00308297v1

Comparing the Order of a Polynomial Phase Model for the Synthesis of Quasi-Harmonic Audio Signals

Laurent Girin , Sylvain Marchand , Joseph Di Martino , Axel Röbel , Geoffroy Peeters
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics - WASPAA'03, Oct 2003, New York, United States. pp.193- 196
Communication dans un congrès hal-00308296v1
Image document

Audio source separation into the wild

Laurent Girin , Sharon Gannot , Xiaofei Li
Multimodal Behavior Analysis in the Wild, Academic Press (Elsevier), pp.53-78, 2018, Computer Vision and Pattern Recognition, ⟨10.1016/B978-0-12-814601-9.00022-5⟩
Chapitre d'ouvrage hal-01943375v1