Skip to Main content
Number of documents

46

Romain Serizel - Associate Professor (Université de Lorraine, Loria)


I was born in Lille, France. I received the M.Eng. degree in Automatic System Engineering from ENSEM (Nancy, France) in 2005 and the M.Sc. degree in Signal Processing from Université Rennes 1 (Rennes, France) in 2006. I received the Ph.D. degree in Engineering Sciences from the Katholieke Universiteit Leuven (KUL), Belgium in June 2011. From June 2011 to December 2012 I was a research assistant with Prof. Marc Moonen research group at the Electrical Engineering Department (ESAT-SCD) of the KUL. From January 2013 to September 2014 i was a postdoctoral researcher with the Human Language Technology (HLT) group at the Fondazione Bruno Kessler (FBK) in Trento (Italy). From October 2014 to August 2016 I was a postdoctoral researcher with the department of signal and image processing (TSI) at telecom ParisTech in Paris (France)  where I am working on speaker diarization, speaker recognition and respresentation learning for audio in the framework of the LASIE project. I am now an assistant professor at Université de Lorraine doing my research in Laboratoire lorrain de recherche en informatique et ses applications (Loria) with the Multispeech research group and teaching at IUT Charlemagne.


Journal articles11 documents

  • Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert. Joint NN-Supported Multichannel Reduction of Acoustic Echo, Reverberation and Noise. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.3008974⟩. ⟨hal-02372579v4⟩
  • Antoine Deleforge, Diego Di Carlo, Martin Strauss, Romain Serizel, Lucio Marcenaro. Audio-Based Search and Rescue with a Drone: Highlights from the IEEE Signal Processing Cup 2019 Student Competition. IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, 2019, 36 (5), pp.138-144. ⟨10.1109/MSP.2019.2924687⟩. ⟨hal-02161897⟩
  • Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin. CRNN-based multiple DoA estimation using acoustic intensity features for Ambisonics recordings. IEEE Journal of Selected Topics in Signal Processing, IEEE, 2019, Special Issue on Acoustic Source Localization and Tracking in Dynamic Real-life Scenes, 13 (1), pp.22-33. ⟨10.1109/jstsp.2019.2900164⟩. ⟨hal-01839883v2⟩
  • Ziteng Wang, Emmanuel Vincent, Romain Serizel, Yonghong Yan. Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments. Computer Speech and Language, Elsevier, 2018, 49, pp.37-51. ⟨10.1016/j.csl.2017.11.003⟩. ⟨hal-01634449⟩
  • Victor Bisot, Romain Serizel, Slim Essid, Gael Richard. Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2017, 25 (6), pp.1216 - 1229. ⟨10.1109/TASLP.2017.2690570⟩. ⟨hal-01362864v2⟩
  • Romain Serizel, Marc Moonen, Bas van Dijk, Jan Wouters. Low-rank Approximation Based Multichannel Wiener Filter Algorithms for Noise Reduction with Application in Cochlear Implants. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2014, 22, pp.785 - 799. ⟨10.1109/TASLP.2014.2304240⟩. ⟨hal-01390918⟩
  • Romain Serizel, Marc Moonen, Jan Wouters, Søren Jensen. Binaural Integrated Active Noise Control and Noise Reduction in Hearing Aids . IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2013, 21 (5), pp.1113-1118. ⟨hal-01393937⟩
  • Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. A Speech Distortion Weighting Based Approach to Integrated Active Noise Control and Noise Reduction in Hearing Aids. Signal Processing, Elsevier, 2013, 93 (9), pp.2440-2452. ⟨hal-01393931⟩
  • Romain Serizel, Marc Moonen, Jan Wouters, Søren Jensen. A Zone of Quiet Based Approach to Integrated Active Noise Control and Noise Reduction for Speech Enhancement in Hearing Aids. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2012, 20 (6), pp.1685 - 1697. ⟨hal-01393939⟩
  • Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. Output SNR analysis of integrated active noise control and noise reduction in hearing aids under a single speech source scenario. Signal Processing, Elsevier, 2011, 91 (8), pp.1719-1729. ⟨hal-01393940⟩
  • Daniel Menard, Romain Serizel, Romuald Rocher, Olivier Sentieys. Accuracy Constraint Determination in Fixed-Point System Design. EURASIP Journal on Embedded Systems, SpringerOpen, 2008, 2008 (1), ⟨10.1155/2008/242584⟩. ⟨inria-00459254⟩

Conference papers28 documents

  • Michel Olvera, Emmanuel Vincent, Romain Serizel, Gilles Gasso. Foreground-Background Ambient Sound Scene Separation. 28th European Signal Processing Conference (EUSIPCO), Jan 2021, Amsterdam, Netherlands. ⟨hal-02567542v2⟩
  • Romain Serizel. A brief introduction to multichannel noise reduction with deep neural networks. SpiN 2020 - 12th Speech in Noise Workshop, Jan 2020, Toulouse, France. ⟨hal-02506387⟩
  • Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid. DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays. ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain. ⟨hal-02389159v3⟩
  • Romain Serizel, Nicolas Turpault, Ankit Shah, Justin Salamon. Sound event detection in synthetic domestic environments. ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain. ⟨hal-02355573v2⟩
  • Nicolas Turpault, Romain Serizel, Emmanuel Vincent. Limitations of weak labels for embedding and tagging. ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain. ⟨hal-02467401v3⟩
  • Nicolas Turpault, Romain Serizel, Emmanuel Vincent. Semi-supervised triplet loss based learning of ambient audio embeddings. ICASSP 2019, May 2019, Brighton, United Kingdom. ⟨hal-02025824⟩
  • Nicolas Turpault, Romain Serizel, Ankit Parag Shah, Justin Salamon. Sound event detection in domestic environments with weakly labeled data and soundscape synthesis. Workshop on Detection and Classification of Acoustic Scenes and Events, Oct 2019, New York City, United States. ⟨hal-02160855v2⟩
  • Romain Serizel, Nicolas Turpault. Sound Event Detection from Partially Annotated Data: Trends and Challenges. IcETRAN conference, Jun 2019, Srebrno Jezero, Serbia. ⟨hal-02114652v2⟩
  • Lauréline Perotin, Alexandre Défossez, Emmanuel Vincent, Romain Serizel, Alexandre Guérin. Regression versus classification for neural network based audio source localization. WASPAA 2019 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, IEEE, Oct 2019, New Paltz, United States. ⟨hal-02125985v2⟩
  • Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert. Multiple-input neural network-based residual echo suppression. ICASSP 2018 - IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2018, Calgary, Canada. pp.1-5. ⟨hal-01723630v2⟩
  • Romain Serizel, Nicolas Turpault, Hamid Eghbal-Zadeh, Ankit Parag Shah. Large-Scale Weakly Labeled Semi-Supervised Sound Event Detection in Domestic Environments. Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2018, Woking, United Kingdom. ⟨hal-01850270⟩
  • Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin. CRNN-based joint azimuth and elevation localization with the Ambisonics intensity vector. IWAENC 2018 - 16th International Workshop on Acoustic Signal Enhancement, Sep 2018, Tokyo, Japan. ⟨hal-01840453⟩
  • Lauréline Perotin, Romain Serizel, Emmanuel Vincent, Alexandre Guérin. Multichannel speech separation with recurrent neural networks from high-order ambisonics recordings. 43rd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2018), Apr 2018, Calgary, Canada. ⟨hal-01699759v2⟩
  • Mathieu Fontaine, Fabian-Robert Stöter, Antoine Liutkus, Umut Simsekli, Romain Serizel, et al.. Multichannel Audio Modeling with Elliptically Stable Tensor Decomposition. LVA/ICA: Latent Variable Analysis and Signal Separation, Jul 2018, Surrey, United Kingdom. pp.13-23, ⟨10.1007/978-3-319-93764-9_2⟩. ⟨lirmm-01766795⟩
  • Romain Serizel, Victor Bisot, Slim Essid, Gael Richard. Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. ⟨hal-01484744⟩
  • Victor Bisot, Romain Serizel, Slim Essid, Gael Richard. Nonnegative Feature Learning Methods for Acoustic Scene Classification. DCASE 2017 - Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2017, Munich, Germany. ⟨hal-01636627⟩
  • Victor Bisot, Romain Serizel, Slim Essid, Gael Richard. Leveraging deep neural networks with nonnegative representations for improved environmental sound classification. IEEE International Workshop on Machine Learning for Signal Processing MLSP, Sep 2017, Tokyo, Japan. ⟨hal-01576857⟩
  • Romain Serizel, Victor Bisot, Slim Essid, Gael Richard. Machine listening techniques as a complement to video image analysis in forensics. IEEE International Conference on Image Processing, Sep 2016, Phoenix, AZ, United States. pp.948-952, ⟨10.1109/ICIP.2016.7532497⟩. ⟨hal-01393959⟩
  • Romain Serizel, Slim Essid, Gael Richard. Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence. IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2016), Sep 2016, Salerne, Italy. ⟨hal-01393964⟩
  • Romain Serizel, Slim Essid, Gael Richard. Group Non-Negative Matrix Factorisation With Speaker And Session Similarity Constraints For Speaker Identification. IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 2016, Shangai, China. ⟨hal-01393968⟩
  • Romain Serizel, Diego Giuliani. Deep neural network adaptation for children's and adults' speech recognition. Italian Computational Linguistics Conference (CLiC-it), Dec 2014, Pise, Italy. ⟨hal-01393975⟩
  • Romain Serizel, Diego Giuliani. Vocal tract length normalisation approaches to DNN-based children's and adults' speech recognition. 2014 IEEE Spoken Language Technology Workshop (SLT 2014), Dec 2014, South Lake Tahoe, CA, United States. pp.135-140, ⟨10.1109/SLT.2014.7078563⟩. ⟨hal-01393972⟩
  • Romain Serizel, Marc Moonen, Bas Dijk, Jan Wouters. Rank-1 Approximation Based Multichannel Wiener Filtering Algorithms For Noise Reduction In Cochlear Implants. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2013, Vancouver, Canada. ⟨hal-01393980⟩
  • Romain Serizel, Marc Moonen, Jan Wouters, Søren Holdt Jensen. Output SNR analysis of integrated active noise control and noise reduction in hearing aids under a single speech source scenario. European Signal Processing Conference (EUSIPCO), Aug 2010, Aalborg, Denmark. ⟨hal-01393977⟩
  • Romain Serizel, Marc Moonen, Jan Wouters, Søren Jensen. A Zone of Quiet Based Approach to Integrated Active Noise Control and Noise Reduction in Hearing Aids. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2009, New Paltz, United States. ⟨hal-01393983⟩
  • Romain Serizel, Marc Moonen, Jan Wouters, Søren Jensen. A Weighted Approach for Integrated Active Noise Control and Noise Reduction in Hearing Aids. European Signal Processing Conference (EUSIPCO), Aug 2009, Glasgow, United Kingdom. ⟨hal-01393985⟩
  • Romain Serizel, Marc Moonen, Jan Wouters, Søren Jensen. Combined Active Noise Control and noise reduction in Hearing Aids. International Workshop on Acoustic Echo and Noise Control (IWAENC), Sep 2008, Seattle, United States. ⟨hal-01393988⟩
  • Daniel Menard, Romain Serizel, Romuald Rocher, Olivier Sentieys. Noise model for Accuracy Constraint Determination in Fixed-Point Systems. Workshop on Design and Architectures for Signal and Image Processing DASIP 2007, Nov 2007, Grenoble, France. ⟨inria-00459290⟩

Book sections2 documents

  • Romain Serizel, Victor Bisot, Slim Essid, Gael Richard. Acoustic Features for Environmental Sound Analysis. Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, pp.71-101, 2017, 978-3-319-63449-4. ⟨10.1007/978-3-319-63450-0_4⟩. ⟨hal-01575619⟩
  • Slim Essid, Sanjeel Parekh, Ngoc Duong, Romain Serizel, Alexey Ozerov, et al.. Multiview approaches to event detection and scene analysis. Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer, pp.243-276, 2017, 978-3319634494. ⟨10.1007/978-3-319-63450-0_9⟩. ⟨hal-01620341⟩

Patents1 document

  • Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert. Procédé de suppression d'écho résiduel dans un signal acoustique. France, N° de brevet: 1760200. 2017. ⟨hal-01638050⟩

Preprints, Working Papers, ...2 documents

  • Nicolas Turpault, Romain Serizel. Training Sound Event Detection On A Heterogeneous Dataset. 2020. ⟨hal-02891665⟩
  • Nicolas Turpault, Scott Wisdom, Hakan Erdogan, John Hershey, Romain Serizel, et al.. Improving Sound Event Detection In Domestic Environments Using Sound Separation. 2020. ⟨hal-02891700v2⟩

Reports2 documents

  • Md Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, et al.. UIAI System for Short-Duration Speaker Verification Challenge 2020. [Research Report] Short-duration Speaker Verification Challenge 2020. 2020. ⟨hal-02907037⟩
  • Guillaume Carbajal, Romain Serizel, Emmanuel Vincent, Eric Humbert. Joint NN-Supported Multichannel Reduction of Acoustic Echo, Reverberation and Noise: Supporting Document. [Research Report] RR-9303, INRIA Nancy; Invoxia SAS. 2019. ⟨hal-02372431v4⟩