Recherche - Archive ouverte HAL Accéder directement au contenu

Filtrer vos résultats

13 résultats
Image document

Apprentissage profond pour le rehaussement de la parole dans les antennes acoustiques ad-hoc

Nicolas Furnon
Informatique [cs]. Université de Lorraine, 2021. Français. ⟨NNT : 2021LORR0277⟩
Thèse tel-03598275v1
Image document

Distributed speech separation in spatially unconstrained microphone arrays

Nicolas Furnon , Romain Serizel , Irina Illina , Slim Essid
ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414758⟩
Communication dans un congrès hal-02985794v3
Image document

Towards an efficient computation of masks for multichannel speech enhancement

Louis Delebecque , Romain Serizel , Nicolas Furnon
2022
Pré-publication, Document de travail hal-03604983v1

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification

Romain Serizel , Slim Essid , Gael Richard
ICASSP, Mar 2016, Shangai, China. pp.5470 - 5474
Communication dans un congrès hal-02288453v1
Image document

SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays

Nicolas Furnon , Romain Serizel , Slim Essid , Irina Illina
2021
Pré-publication, Document de travail hal-04173974v1
Image document

DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays

Nicolas Furnon , Romain Serizel , Irina Illina , Slim Essid
ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
Communication dans un congrès hal-02389159v3
Image document

Deep-neural network approaches for speech recognition with heterogeneous groups of speakers including children

Romain Serizel , Diego Giuliani
Natural Language Engineering, 2016, 1, pp.0 - 0
Article dans une revue hal-01390905v1

A brief introduction to deep neural networks and their application to automatic speech recognition

Romain Serizel
Séminaire de l'équipe Perception, Feb 2015, Grenoble, France
Communication dans un congrès hal-02287014v1

Multiview Approaches to Event Detection and Scene Analysis

Slim Essid , Sanjeel Parekh , Ngoc Q. K. Duong , Romain Serizel , Alexey Ozerov , et al.
Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, 2017
Chapitre d'ouvrage hal-02287697v1
Image document

The Speed Submission to DIHARD II: Contributions & Lessons Learned

Md Sahidullah , Jose Patino , Samuele Cornell , Ruiqing Yin , Sunit Sivasankaran , et al.
2019
Pré-publication, Document de travail hal-02352840v2
Image document

DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays

Nicolas Furnon , Romain Serizel , Slim Essid , Irina Illina
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021, 29, pp.2310 - 2323. ⟨10.1109/TASLP.2021.3092838⟩
Article dans une revue hal-02985867v3

Acoustic scene classification with matrix factorization for unsupervised feature learning

Victor Bisot , Romain Serizel , Slim Essid , Gael Richard
ICASSP, Mar 2016, Shangai, China
Communication dans un congrès hal-02287267v1
Image document

Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes

Nicolas Furnon , Romain Serizel , Slim Essid , Irina Illina
EUSIPCO 2021 - 29th European Signal Processing Conference, IEEE, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616358⟩
Communication dans un congrès hal-03259801v1