Accéder directement au contenu

Slim Essid

156
Documents

Présentation

Publications

Image document

Pretext Tasks selection for multitask self-supervised speech representation learning

Salah Zaiem , Titouan Parcollet , Slim Essid , Abdelwahab Heba
IEEE Journal of Selected Topics in Signal Processing, 2022, 16 (6), pp.1439-1453. ⟨10.1109/JSTSP.2022.3195430⟩
Article dans une revue hal-03601330v1
Image document

DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays

Nicolas Furnon , Romain Serizel , Slim Essid , Irina Illina
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2021, 29, pp.2310 - 2323. ⟨10.1109/TASLP.2021.3092838⟩
Article dans une revue hal-02985867v3
Image document

Weakly Supervised Representation Learning for Audio-Visual Scene Analysis

Sanjeel Parekh , Slim Essid , Alexey Ozerov , Ngoc Q. K. Duong , Patrick Pérez
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2019
Article dans une revue hal-02399993v1

Early Detection of User Engagement Breakdown in Spontaneous Human-Humanoid Interaction

Atef Ben Youssef , Chloé Clavel , Slim Essid
IEEE Transactions on Affective Computing, 2019
Article dans une revue hal-02288043v1

On-the-fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction

Atef Ben Youssef , Giovanna Varni , Slim Essid , Chloé Clavel
International Journal of Social Robotics, 2019
Article dans une revue hal-02288044v1

Audiovisual Analysis of Music Performances: Overview of an Emerging Field

Zhiyao Duan , Slim Essid , Cynthia Liem , Gael Richard , Gaurav Sharma
IEEE Signal Processing Magazine, 2019, 36 (1), pp.63-73
Article dans une revue hal-02287983v1

Audio-Visual Analysis of Music Performances

Zhiyao Duan , Slim Essid , Cynthia Liem , Gael Richard , Gaurav Sharma
IEEE Signal Processing Magazine, inPress
Article dans une revue hal-01893410v1
Image document

Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification

Victor Bisot , Romain Serizel , Slim Essid , Gael Richard
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2017, 25 (6), pp.1216 - 1229. ⟨10.1109/TASLP.2017.2690570⟩
Article dans une revue hal-01362864v2

TPT-Dance&Actions : un corpus multimodal d'activités humaines

Aymeric Masurelle , Ahmed Rida Sekkat , Slim Essid , Gael Richard
Traitement du Signal, 2015, ⟨10.3166/TS.32.443-475⟩
Article dans une revue hal-02704820v1

Soft Nonnegative Matrix Co-Factorization

Nicolas Seichepine , Slim Essid , Cédric Févotte , Olivier Cappé
IEEE Transactions on Signal Processing, 2014, 62 (22), pp.5940-5949. ⟨10.1109/TSP.2014.2360141⟩
Article dans une revue hal-01116863v1

A Multimodal Approach to Speaker Diarization on TV Talk-Shows

Félicien Vallet , Slim Essid , Jean Carrive
IEEE Transactions on Multimedia, 2013, 15 (3), pp.509-520. ⟨10.1109/TMM.2012.2233724⟩
Article dans une revue hal-02943545v1

Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring

Slim Essid , Cédric Févotte
IEEE Transactions on Multimedia, 2013, 15 (2), pp.415-425. ⟨10.1109/TMM.2012.2228474⟩
Article dans une revue hal-02943541v1
Image document

Learning Optimal Features for Polyphonic Audio-to-Score Alignment

Cyril Joder , Slim Essid , Gael Richard
IEEE Transactions on Audio, Speech and Language Processing, 2013
Article dans une revue hal-02704714v1

A multi-modal dance corpus for research into interaction between humans in virtual environments

Slim Essid , Marc Gowing , Georgios Kordelas , Anil Aksay , P. Kelly
Journal on Multimodal User Interfaces, 2012, pp.1-14. ⟨10.1007/s12193-012-0109-5⟩
Article dans une revue hal-02286487v1
Image document

A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching

Cyril Joder , Slim Essid , Gael Richard
IEEE Transactions on Audio, Speech and Language Processing, 2011
Article dans une revue hal-02653026v1

Temporal Integration for Audio Classification With Application to Musical Instrument Classification

Cyril Joder , Slim Essid , Gael Richard
IEEE Transactions on Audio, Speech and Language Processing, 2009, 17, ⟨10.1109/TASL.2008.2007613⟩
Article dans une revue hal-02652782v1

Influence of prior knowledge on perceptual grouping

S. Essid
Perception, 2008, 37, pp.95
Article dans une revue hal-01440575v1

On the Correlation of Automatic Audio and Visual Segmentations of Music Videos

Olivier Gillet , Slim Essid , Gael Richard
IEEE Transactions on Circuits and Systems for Video Technology, 2007, 17, ⟨10.1109/TCSVT.2007.890831⟩
Article dans une revue hal-02652635v1

Musical instrument recognition by pairwise classification strategies

Gael Richard , Slim Essid , Bertrand David
IEEE Transactions on Audio, Speech and Language Processing, 2006, 14 (4), pp.1401- 1412. ⟨10.1109/TSA.2005.860842⟩
Article dans une revue hal-00477671v1

Instrument recognition in polyphonic music based on automatic taxonomies

Slim Essid , Gael Richard , Bertrand David
IEEE Transactions on Audio, Speech and Language Processing, 2006, 14 (1), pp.68-80. ⟨10.1109/TSA.2005.860351⟩
Article dans une revue hal-00477670v1
Image document

ONLINE SPEAKER DIARIZATION OF MEETINGS GUIDED BY SPEECH SEPARATION

Elio Gruttadauria , Mathieu Fontaine , Slim Essid
IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2024, Seoul (Korea), South Korea
Communication dans un congrès hal-04419041v1
Image document

ON THE CHOICE OF THE OPTIMAL TEMPORAL SUPPORT FOR AUDIO CLASSIFICATION WITH PRE-TRAINED EMBEDDINGS

Aurian Quelennec , Michel Olvera , Geoffroy Peeters , Slim Essid
ICASSP, IEEE, Apr 2024, Séoul, South Korea
Communication dans un congrès hal-04360221v1
Image document

Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis

Victor Letzelter , Mathieu Fontaine , Mickaël Chen , Patrick Pérez , Slim Essid
Advances in neural information processing systems, Dec 2023, New Orleans, United States
Communication dans un congrès hal-04216055v1
Image document

Fine-tuning strategies for faster inference using speech self-supervised models: a comparative study

Salah Zaiem , Robin Algayres , Titouan Parcollet , Slim Essid , Mirco Ravanelli
ICASSP 2023 - International Conference on Acoustics, Speech, and Signal Processing, Jun 2023, Rhodes, Greece
Communication dans un congrès hal-04076307v1
Image document

Cosmopolite Sound Monitoring (CoSMo): A Study of Urban Sound Event Detection Systems Generalizing to Multiple Cities

Florian Angulo , Slim Essid , Geoffroy Peeters , Christophe Mietlicki
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2023, Rhodes Island, Greece. pp.1-5, ⟨10.1109/ICASSP49357.2023.10095833⟩
Communication dans un congrès hal-04093374v1
Image document

Automatic Data Augmentation for Domain Adapted Fine-Tuning of Self-Supervised Speech Representations

Salah Zaiem , Titouan Parcollet , Slim Essid
INTERSPEECH 2023, Aug 2023, Dublin (Ireland), Ireland. pp.67-71, ⟨10.21437/Interspeech.2023-1040⟩
Communication dans un congrès hal-04216177v1
Image document

Speech Self-Supervised Representation Benchmarking: Are We Doing it Right?

Salah Zaiem , Youcef Kemiche , Titouan Parcollet , Slim Essid , Mirco Ravanelli
INTERSPEECH 2023, Aug 2023, Dublin, Ireland. pp.2873-2877, ⟨10.21437/Interspeech.2023-1087⟩
Communication dans un congrès hal-04216175v1

One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models

Yasser Benigmim , Subhankar Roy , Slim Essid , Vicky Kalogeiton , Stéphane Lathuilière
IEEE/CVF Conference on Computer Vision and Pattern Recognition- Workshop on Generative Models for Computer Vision, 2023, vancouver, Canada
Communication dans un congrès hal-04205024v1
Image document

A Repetition-based Triplet Mining Approach for Music Segmentation

Morgan Buisson , Brian Mcfee , Slim Essid , Helene-Camille Crayencour
International Society for Music Information Retrieval (ISMIR), Nov 2023, Milan, Italy
Communication dans un congrès hal-04202766v1
Image document

Opinions in Interactions : New Annotations of the SEMAINE Database

Valentin Barrière , Chloé Clavel , Slim Essid
LREC, Jun 2022, Marseille, France
Communication dans un congrès hal-04276012v1
Image document

Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning

Salah Zaiem , Titouan Parcollet , Slim Essid
Interspeech 2022, Sep 2022, Incheon, South Korea. pp.669-673, ⟨10.21437/interspeech.2022-10191⟩
Communication dans un congrès hal-03817736v1
Image document

Latent and Adversarial Data Augmentation for Sound Event Detection and Classification

David Perera , Slim Essid , Gaël Richard
International workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE), Nov 2022, Nancy, France
Communication dans un congrès hal-03782827v1
Image document

Impact de perturbations internes sur l'entraînement de réseaux profonds pour la détection d'évènements sonores

David Perera , Slim Essid , Gael Richard
Colloque Francophone de Traitement du Signal et des Images (GRETSI), Sep 2022, Nancy, France
Communication dans un congrès hal-03759651v1
Image document

Learning Multi-Level Representations for Hierarchical Music Structure Analysis

Morgan Buisson , Brian Mcfee , Slim Essid , Helene-Camille Crayencour
International Society for Music Information Retrieval (ISMIR), Dec 2022, Bengaluru, India
Communication dans un congrès hal-03780032v1

Conditional Independence for Pretext Task Selection in Self-Supervised Speech Representation Learning

Salah Zaiem , Titouan Parcollet , Slim Essid
Interspeech 2021, Aug 2021, Brno, Czech Republic. pp.2851-2855, ⟨10.21437/interspeech.2021-1027⟩
Communication dans un congrès hal-03601265v1
Image document

Distributed speech separation in spatially unconstrained microphone arrays

Nicolas Furnon , Romain Serizel , Irina Illina , Slim Essid
ICASSP 2021 - 46th International Conference on Acoustics, Speech, and Signal Processing, Jun 2021, Toronto / Virtual, Canada. ⟨10.1109/ICASSP39728.2021.9414758⟩
Communication dans un congrès hal-02985794v3
Image document

NEURO-STEERED MUSIC SOURCE SEPARATION WITH EEG-BASED AUDITORY ATTENTION DECODING AND CONTRASTIVE-NMF

Giorgia Cantisani , Slim Essid , Gael Richard
2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2021, Toronto (virtual conference), Canada. ⟨10.1109/ICASSP39728.2021.9413841⟩
Communication dans un congrès hal-02978978v4
Image document

Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes

Nicolas Furnon , Romain Serizel , Slim Essid , Irina Illina
EUSIPCO 2021 - 29th European Signal Processing Conference, IEEE, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616358⟩
Communication dans un congrès hal-03259801v1
Image document

User-guided one-shot deep model adaptation for music source separation

Giorgia Cantisani , Alexey Ozerov , Slim Essid , Gael Richard
2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), IEEE, Oct 2021, New Paltz, NY, United States
Communication dans un congrès hal-03219350v3
Image document

DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays

Nicolas Furnon , Romain Serizel , Irina Illina , Slim Essid
ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain
Communication dans un congrès hal-02389159v3

Decoding auditory attention in polyphonic music based on EEG: a new dataset and a preliminary study

Giorgia Cantisani , Slim Essid , Gael Richard
Auditory EEG Signal Processing (AESoP) symposium, Sep 2019, Leuven, Belgium
Communication dans un congrès hal-03175885v1
Image document

MAD-EEG: an EEG dataset for decoding auditory attention to a target instrument in polyphonic music

Giorgia Cantisani , Gabriel Trégoat , Slim Essid , Gael Richard
Speech, Music and Mind (SMM), Satellite Workshop of Interspeech 2019, Sep 2019, Vienna, Austria
Communication dans un congrès hal-02291882v3
Image document

Tracking beats and microtiming in afro-latin american music using conditional random fields and deep learning

Magdalena Fuentes , Lucas S Maia , Martín Rocamora , Luiz W P Biscainho , Hélène C Crayencour
ISMIR, Nov 2019, Delft, Netherlands
Communication dans un congrès hal-02419361v1
Image document

SAMBASET: a dataset of historical samba de Enredo recordings for computational music analysis

Lucas S Maia , Magdalena Fuentes , Luiz W P Biscainho , Martín Rocamora , Slim Essid
The 20th International Society for Music Information Retrieval Conference, Nov 2019, Delft, Netherlands
Communication dans un congrès hal-02943462v1
Image document

From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining

Alexandre Garcia , Pierre Colombo , Slim Essid , Florence d'Alché-Buc , Chloé Clavel
2019 Conference on Empirical Methods in Natural Language Processing, Nov 2019, Hong-Kong, China
Communication dans un congrès hal-02371140v1
Image document

EEG-BASED DECODING OF AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC

Giorgia Cantisani , Slim Essid , Gael Richard
2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2019, New Paltz, NY, United States
Communication dans un congrès hal-02291896v1
Image document

A Music Structure Informed Downbeat Tracking System Using Skip-chain Conditional Random Fields and Deep Learning

Magdalena Fuentes , Brian Mcfee , Helene-Camille Crayencour , Slim Essid , Juan P. Bello
2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2019), May 2019, Brighton, United Kingdom. ⟨10.1109/icassp.2019.8682870⟩
Communication dans un congrès hal-02420403v1
Image document

IDENTIFY, LOCATE AND SEPARATE: AUDIO-VISUAL OBJECT EXTRACTION IN LARGE VIDEO COLLECTIONS USING WEAK SUPERVISION

Sanjeel Parekh , Alexey Ozerov , Slim Essid , Ngoc Duong , Patrick Pérez
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2019, New Paltz, United States
Communication dans un congrès hal-02380780v1
Image document

Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events

Sanjeel Parekh , Slim Essid , Alexey Ozerov , Ngoc Q K Duong , Patrick Pérez
CVPR Workshop, 2018, Salt Lake city, United States
Communication dans un congrès hal-02713307v1

Attitude Classification in Adjacency Pairs of a Human-Agent Interaction with Hidden Conditional Random Fields

Valentin Barriere , Chloé Clavel , Slim Essid
ICASSP 2018 - 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2018, Calgary, Canada. pp.4949-4953, ⟨10.1109/ICASSP.2018.8462160⟩
Communication dans un congrès hal-02943469v1
Image document

An ensemble learning approach to detect epileptic seizures from long intracranial EEG recordings

Jean-Baptiste Schiratti , Jean-Eudes Le Douget , Michel Le van Quyen , Slim Essid , Alexandre Gramfort
ICASSP 2018 - 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP, Apr 2018, Calgary, Canada
Communication dans un congrès hal-01724272v1
Image document

ANALYSIS OF COMMON DESIGN CHOICES IN DEEP LEARNING SYSTEMS FOR DOWNBEAT TRACKING

Magdalena Fuentes , Brian Mcfee , Hélène C Crayencour , Slim Essid , Juan P Bello
The 19th International Society for Music Information Retrieval Conference, Sep 2018, Paris, France
Communication dans un congrès hal-02943467v1

Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization

Ayoub Hajlaoui , Mohamed Chetouani , Slim Essid
2018 26th European Signal Processing Conference (EUSIPCO), Sep 2018, Rome, France. pp.91-95, ⟨10.23919/EUSIPCO.2018.8553390⟩
Communication dans un congrès hal-02422892v1
Image document

MAIN MELODY EXTRACTION WITH SOURCE-FILTER NMF AND CRNN

Dogac Basaran , Slim Essid , Geoffroy Peeters
19th International Society for Music Information Retreival, Sep 2018, Paris, France
Communication dans un congrès hal-02019103v1
Image document

Structured Output Learning with Abstention: Application to Accurate Opinion Prediction

Alexandre Garcia , Slim Essid , Chloé Clavel , Florence d'Alché-Buc
35th International Conference on Machine Learning18., Jul 2018, Stockholm, Sweden
Communication dans un congrès hal-01950907v1

Overlapping sound event detection with supervised Nonnegative Matrix Factorization

Victor Bisot , Slim Essid , Gael Richard
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, France. pp.31-35, ⟨10.1109/ICASSP.2017.7951792⟩
Communication dans un congrès hal-02713341v1

Nonnegative Matrix Factorisation for multimodal data analysis

Slim Essid
Dipartimento di Elettronica, Informazione e Bioingegeria (DEIB), Politecnico di Milano, Feb 2017, Milan, Italy
Communication dans un congrès hal-02288528v1
Image document

Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification

Romain Serizel , Victor Bisot , Slim Essid , Gael Richard
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States
Communication dans un congrès hal-01484744v1

UE-HRI: a new dataset for the study of user engagement in spontaneous human-robot interactions

Atef Ben-Youssef , Chloé Clavel , Slim Essid , Miriam Bilac , Marine Chamoux
the 19th ACM International Conference, Nov 2017, Glasgow, France. pp.464-472, ⟨10.1145/3136755.3136814⟩
Communication dans un congrès hal-02943475v1
Image document

Motion informed audio source separation

Sanjeel Parekh , Slim Essid , Alexey Ozerov , Ngoc Duong , Patrick Pérez
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), Mar 2017, New Orleans, United States
Communication dans un congrès hal-01447977v1

Guiding Audio Source Separation by Video Object Information

Sanjeel Parekh , Slim Essid , Alexey Ozerov , Quang-Khanh-Ngoc Duong , Patrick Perez
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2017, New Paltz, New York, United States
Communication dans un congrès hal-02287698v1
Image document

Nonnegative Feature Learning Methods for Acoustic Scene Classification

Victor Bisot , Romain Serizel , Slim Essid , Gael Richard
DCASE 2017 - Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2017, Munich, Germany
Communication dans un congrès hal-01636627v1

Opinion Dynamics Modeling for Movie Review Transcripts Classification with Hidden Conditional Random Fields

Valentin Barriere , Chloé Clavel , Slim Essid
Interspeech 2017, Aug 2017, Stockholm, Sweden
Communication dans un congrès hal-02287607v1

EMOEEG: A new multimodal dataset for dynamic EEG-based emotion recognition with audiovisual elicitation

Anne-Claire Conneau , Ayoub Hajlaoui , Mohamed Chetouani , Slim Essid
2017 25th European Signal Processing Conference (EUSIPCO), Aug 2017, Kos, Greece. pp.738-742, ⟨10.23919/EUSIPCO.2017.8081305⟩
Communication dans un congrès hal-02422947v1

Matrix Co-Factorisation and Applications to Music Analysis

Slim Essid
Machine Learning for Music Discovery Workshop, International Conference on Machine Learning (ICML) 2017, Aug 2017, Sydney, Australia
Communication dans un congrès hal-02287881v1
Image document

Leveraging deep neural networks with nonnegative representations for improved environmental sound classification

Victor Bisot , Romain Serizel , Slim Essid , Gael Richard
IEEE International Workshop on Machine Learning for Signal Processing MLSP, Sep 2017, Tokyo, Japan
Communication dans un congrès hal-01576857v1

Research on Nonnegative Matrix Factorisation at Telecom ParisTech

Slim Essid
Spotify Research Seminar, Aug 2016, New York, United States
Communication dans un congrès hal-02288525v1
Image document

Machine listening techniques as a complement to video image analysis in forensics

Romain Serizel , Victor Bisot , Slim Essid , Gael Richard
IEEE International Conference on Image Processing, Sep 2016, Phoenix, AZ, United States. pp.948-952, ⟨10.1109/ICIP.2016.7532497⟩
Communication dans un congrès hal-01393959v1

Très brève introduction au Machine Learning

Slim Essid
Conference debat du Corps des Mines, Jan 2016, Paris, France
Communication dans un congrès hal-02287867v1

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification

Romain Serizel , Slim Essid , Gael Richard
ICASSP, Mar 2016, Shangai, China. pp.5470 - 5474
Communication dans un congrès hal-02288453v1
Image document

Group Non-Negative Matrix Factorisation With Speaker And Session Similarity Constraints For Speaker Identification

Romain Serizel , Slim Essid , Gael Richard
IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 2016, Shangai, China
Communication dans un congrès hal-01393968v1

Acoustic scene classification with matrix factorization for unsupervised feature learning

Victor Bisot , Romain Serizel , Slim Essid , Gael Richard
ICASSP, Mar 2016, Shangai, China
Communication dans un congrès hal-02287267v1

Downbeat Detection with Conditional Random Fields and Deep Learned Features

Simon Durand , Slim Essid
International Society for Music Information Retrieval (ISMIR), Aug 2016, New York City, United States. pp.386-392
Communication dans un congrès hal-02288480v1

Audio and Brain Research at Telecom ParisTech

Slim Essid
Hearing Seminar of the Center for Computer Research in Music and Acoustics (CCRMA), Stanford University, Sep 2016, Stanford, United States
Communication dans un congrès hal-02287866v1
Image document

Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence

Romain Serizel , Slim Essid , Gael Richard
IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2016), Sep 2016, Salerne, Italy
Communication dans un congrès hal-01393964v1
Image document

SUPERVISED NONNEGATIVE MATRIX FACTORIZATION FOR ACOUSTIC SCENE CLASSIFICATION

Victor Bisot , Romain Serizel , Slim Essid , Gael Richard
IEEE international evaluation campaign on detection and classification of acousitc scenes and events (DCASE 2016), Sep 2016, Budapest, Hungary
Communication dans un congrès hal-02943480v1

Hog and Subband power distribution image features for acoustic scene classification

Victor Bisot , Slim Essid , Gael Richard
EUSIPCO, Sep 2015, Nice, France. pp.719-723
Communication dans un congrès hal-02287266v1

A conditional random field system for beat tracking

Thomas Fillon , C. Joder , Simon Durand , Slim Essid
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia
Communication dans un congrès hal-02288433v1

Nonnegative matrix Factorisation for Audiovisual Document Analysis

Slim Essid
Seminaire Traitement du Langage Parle, LIMSI, 2015, Orsay, France
Communication dans un congrès hal-02287882v1

Introduction à la factorisation en matrices positives

Slim Essid
Journée Télécom-UPS "Le numérique pour tous", May 2015, Paris, France
Communication dans un congrès hal-02287868v1
Image document

MELODY EXTRACTION BY CONTOUR CLASSIFICATION

Rachel M Bittner , Justin Salamon , Slim Essid , Juan P Bello
International Conference on Music Information Retrieval (ISMIR), Sep 2015, Malaga, Spain
Communication dans un congrès hal-02943532v1
Image document

Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes

Aymeric Masurelle , Slim Essid , Gael Richard
2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 14), May 2014, Florence, Italy
Communication dans un congrès hal-00990252v1

A tutorial on Nonnegative Matrix Factorisation with applications to audiovisual content analysis

Slim Essid , Alexey Ozerov
Tutorial at ICME 2014, Jul 2014, Chengdu, China
Communication dans un congrès hal-02287869v1

Assessment of new spectral features for eeg-based emotion recognition.

Anne-Claire Conneau , Slim Essid
International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2014, Florence, Italy
Communication dans un congrès hal-02287334v1

Piecewise constant nonnegative matrix factorization

N. Seichepine , Slim Essid , C. Fevotte , O. Cappe
ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2014, Florence, France. pp.6721-6725, ⟨10.1109/ICASSP.2014.6854901⟩
Communication dans un congrès hal-02943536v1
Image document

Exploring new features for music classification

Rémi Foucard , Slim Essid , Gael Richard , Mathieu Lagrange
WIAMIS, Jul 2013, Paris, France. ⟨10.1109/WIAMIS.2013.6616154⟩
Communication dans un congrès hal-01126767v1

Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization

N. Seichepine , Slim Essid , C. Fevotte , O. Cappe
ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2013, Vancouver, France. pp.3537-3541, ⟨10.1109/ICASSP.2013.6638316⟩
Communication dans un congrès hal-02943543v1

Multimodal Signal Analysis at Telecom ParisTech

Slim Essid
Seminaire scienti\unmatchedfb01que de Technicolor R&D, Dec 2013, Rennes, France
Communication dans un congrès hal-02288526v1
Image document

Non-negative Tensor Factorization for Single-Channel EEG Artifact Rejection

Cécilia Damon , Antoine Liutkus , Alexandre Gramfort , Slim Essid
MLSP, Sep 2013, Southampton, United Kingdom. ⟨10.1109/MLSP.2013.6661983⟩
Communication dans un congrès hal-00959103v1

Co-factorisation douce en matrices non-négatives. Application au regroupement multimodal de locuteurs

Nicolas Seichepine , Slim Essid , Cédric Févotte , Olivier Cappé
GRETSI, Sep 2013, Brest, France
Communication dans un congrès hal-02286798v1
Image document

MULTIMODAL CLASSIFICATION OF DANCE MOVEMENTS USING BODY JOINT TRAJECTORIES AND STEP SOUNDS

Aymeric Masurelle , Slim Essid , Gael Richard
International Workshop on Image and Audio Analysis for Multimedia Interactive Services WIAMIS, Nov 2013, Paris, France. pp.1-4, ⟨10.1109/WIAMIS.2013.6616151⟩
Communication dans un congrès hal-00904461v1

Etiquetage automatique de l'audio : une approche de boosting régressif basée sur une fusion souple d'annotateurs

Rémi Foucard , Slim Essid , Mathieu Lagrange , Gael Richard
Coresa, 2013, NA, France
Communication dans un congrès hal-01106754v1

Probabilistic dance performance alignment by fusion of multimodal features

Angelique Dremeau , Slim Essid
IEEE Int’l Conf. on Acoustics, Speech and Signal Processing (ICASSP), May 2013, Vancouver, Canada
Communication dans un congrès hal-02288353v1
Image document

Non-negative matrix factorization for single-channel EEG artifact rejection

Cécilia Damon , Antoine Liutkus , Alexandre Gramfort , Slim Essid
ICASSP, 2013, Vancouver, Canada. ⟨10.1109/ICASSP.2013.6637836⟩
Communication dans un congrès hal-00958775v1

Nonnegative Tensor Factorization for Single-Channel EEG Artifact Rejection

Cécilia Damon , Antoine Liutkus , Alexandre Gramfort , Slim Essid
IEEE International Workshop on Machine Learning for Signal Processing, Sep 2013, Southampton, United Kingdom
Communication dans un congrès hal-02288386v1

Analysis of dance movements using gaussian processes

Antoine Liutkus , Angélique Drémeau , Dimitrios Alexiadis , Slim Essid , Petros Daras
the 20th ACM international conference, Oct 2012, Nara, France. pp.1375, ⟨10.1145/2393347.2396492⟩
Communication dans un congrès hal-02943555v1
Image document

A regressive boosting approach to automatic audio tagging based on soft annotator fusion

Rémi Foucard , Slim Essid , Mathieu Lagrange , Gael Richard
IEEE ICASSP, Mar 2012, Kyoto, Japan. ⟨10.1109/ICASSP.2012.6287820⟩
Communication dans un congrès hal-01132529v1

Decomposing the video editing structure of a talk-show using nonnegative matrix factorization

Slim Essid , C. Fevotte
2012 19th IEEE International Conference on Image Processing (ICIP 2012), Sep 2012, Orlando, France. pp.3105-3108, ⟨10.1109/ICIP.2012.6467557⟩
Communication dans un congrès hal-02943553v1

A SINGLE-CLASS SVM BASED ALGORITHM FOR COMPUTING AN IDENTIFIABLE NMF

Slim Essid
IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2012, Kyoto, Japan
Communication dans un congrès hal-02278688v1

AN ADVANCED VIRTUAL DANCE PERFORMANCE EVALUATOR

Slim Essid , Dimitrios Alexiadis , Robin Tournemenne , Marc Gowing , Philip Kelly
IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2012, Kyoto, Japan
Communication dans un congrès hal-02288313v1

A multimodal dance corpus for research into real-time interaction between humans in online virtual environments

Slim Essid , Xinyu Lin , Marc Gowing , Georgios Kordelas , Anil Aksay
ICMI WORKSHOP ON MULTIMODAL CORPORA FOR MACHINE LEARNING, Nov 2011, Alicante, Spain
Communication dans un congrès hal-02278689v1

Multi-scale temporal fusion by boosting for music classification

Rémi Foucard , Slim Essid , Mathieu Lagrange , Gael Richard
ISMIR, 2011, Miami, United States. pp.663-668
Communication dans un congrès hal-00639097v1

Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation

Sébastien Gulluni , Slim Essid , Olivier Buisson , Gael Richard
AES Conference, 2011, Ilmenau, Germany
Communication dans un congrès hal-02713989v1

Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignment

Cyril Joder , Slim Essid , Gael Richard
ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2011, Prague, France. pp.397-400, ⟨10.1109/ICASSP.2011.5946424⟩
Communication dans un congrès hal-02714059v1

Enhanced visualisation of dance performance from automatically synchronised multimodal recordings

Marc Gowing , Xinyu Lin , Qianni Zhang , Philip Kell , Noel O'Connor
The 19th ACM international conference, Nov 2011, Scottsdale, France. pp.667, ⟨10.1145/2072298.2072414⟩
Communication dans un congrès hal-02943617v1

Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment

Cyril Joder , Slim Essid , Gael Richard
2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2011, New Paltz, France. pp.121-124, ⟨10.1109/ASPAA.2011.6082330⟩
Communication dans un congrès hal-02943613v1

An audio-driven virtual dance-teaching assistant

Slim Essid , Yves Grenier , Mounira Maazaoui , Gael Richard , Robin Tournemenne
the 19th ACM international conference, Nov 2011, Scottsdale, France. pp.675, ⟨10.1145/2072298.2072416⟩
Communication dans un congrès hal-02713825v1

AN INTERACTIVE SYSTEM FOR ELECTRO-ACOUSTIC MUSIC ANALYSIS

Sébastien Gulluni , Slim Essid , Olivier Buisson , Gael Richard
ISMIR, 2011, Miami, United States
Communication dans un congrès hal-02713906v1

A comparative study of tonal acoustic features for a symbolic level music-to-score alignment

Cyril Joder , Slim Essid , Gael Richard
2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Mar 2010, Dallas, France. pp.409-412, ⟨10.1109/ICASSP.2010.5495784⟩
Communication dans un congrès hal-02747785v1

Approche hiérarchique pour un alignement musique-sur-partition efficace

Cyril Joder , Slim Essid , Gael Richard
Compression et Représentation des Signaux Audiovisuels (CORESA), Oct 2010, Lyon, France
Communication dans un congrès hal-02943620v1

A MULTIMODAL APPROACH TO INITIALISATION FOR TOP-DOWN SPEAKER DIARIZATION OF TELEVISION SHOWS

Simon Bozonnet , Félicien Vallet , Nicholas Evans , Slim Essid , Gael Richard
Eusipco, 2010, aalborg, Denmark
Communication dans un congrès hal-02747730v1

Descripteurs visuels robustes pour l'identification de locuteurs dans des émissions televisées de talk-shows

Vallet Félicien , Slim Essid , Jean Carrive , Gaël Richard
Compression et Représentation des Signaux Audiovisuels (CORESA), Oct 2010, Lyon, France
Communication dans un congrès hal-02943621v1

YAAFE, AN EASY TO USE AND EFFICIENT AUDIO FEATURE EXTRACTION SOFTWARE

Benoît Mathieu , Slim Essid , Thomas Fillon , Jacques Prado , Gael Richard
ISMIR, 2010, Utrecht, Netherlands
Communication dans un congrès hal-02747689v1

Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows

Félicien Vallet , Slim Essid , Jean Carrive , Gael Richard
2010 17th IEEE International Conference on Image Processing (ICIP 2010), Sep 2010, Hong Kong, France. pp.1469-1472, ⟨10.1109/ICIP.2010.5653393⟩
Communication dans un congrès hal-02747558v1

A conditional random field viewpoint of symbolic audio-to-score matching

Cyril Joder , Slim Essid , Gael Richard
the international conference, Oct 2010, Firenze, France. pp.871, ⟨10.1145/1873951.1874100⟩
Communication dans un congrès hal-02747590v1

AN IMPROVED HIERARCHICAL APPROACH FOR MUSIC-TO-SYMBOLIC SCORE ALIGNMENT

Cyril Joder , Slim Essid , Gael Richard
ISMIR, 2010, Utrecht, Netherlands
Communication dans un congrès hal-02747659v1

Incorporating prior knowledge on the digital media creation process into audio classifiers

M. Lardeur , S. Essid , Gael Richard , M. Haller , T. Sikora
ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2009, Taipei, France. pp.1653-1656, ⟨10.1109/ICASSP.2009.4959918⟩
Communication dans un congrès hal-03117240v1

Incorporating prior knowledge on the digital media creation process into audio classifiers

M. Lardeur , Slim Essid , Guy Richard , M. Haller , T. Sikora
ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2009, Taipei, France. pp.1653-1656, ⟨10.1109/ICASSP.2009.4959918⟩
Communication dans un congrès hal-02943669v1
Image document

Interactive Segmentation of Electro-Acoustic Music

Sébastien Gulluni , Slim Essid , Olivier Buisson , Gael Richard
2nd International Workshop on Machine Learning and Music (MML - ECML - PKDD), Sep 2009, Bled, Slovenia
Communication dans un congrès hal-02943665v1
Image document

Étude des descripteurs acoustiques pour l'alignement temporel audio-sur-partition musicale

Cyril Joder , Slim Essid , Gaël Richard
GRETSI, Sep 2009, Dijon, France
Communication dans un congrès hal-02943624v1

Étude des descripteurs acoustiques pour l'alignement temporel audio-sur-partition musicale

Cyril Joder , Slim Essid , Gael Richard
Colloque GRETSI, 2009, dijon, France
Communication dans un congrès hal-03117111v1
Image document

ALIGNMENT KERNELS FOR AUDIO CLASSIFICATION WITH APPLICATION TO MUSIC INSTRUMENT RECOGNITION

Cyril Joder , Slim Essid , Gaël Richard
16th European Signal Processing Conference, Aug 2008, Lausanne, Switzerland
Communication dans un congrès hal-02943674v1
Image document

ON THE ROBUSTNESS OF AUDIO FEATURES FOR MUSICAL INSTRUMENT CLASSIFICATION

S Wegener , M Haller , J J Burred , T Sikora , Slim Essid
16th European Signal Processing Conference, Aug 2008, Lausanne, Switzerland
Communication dans un congrès hal-02943672v1

Rushes Video Summarization using a Collaborative Approach

Emilie Dumont , Bernard Mérialdo , Slim Essid , Werner Bailer , Herwig Rehatschek
TRECVID 2008, ACM International Conference on Multimedia Information Retrieval, 2008, Vancouver, Canada
Communication dans un congrès hal-01987824v1

A Collaborative Approach to Video Summarization

Emilie Dumont , Bernard Mérialdo , Slim Essid , Werner Bailer , Daragh Byrne
SAMT 2008, 3rd International Conference on Semantic and Digital Media Technologies, 2008, Koblenz, Germany
Communication dans un congrès hal-01987822v1
Image document

TOWARDS POLYPHONIC MUSICAL INSTRUMENTS RECOGNITION

Gael Richard , Pierre Leveau , Laurent Daudet , Slim Essid , Bertrand David
19th INTERNATIONAL CONGRESS ON ACOUSTICS, Sep 2007, Madrid, Spain
Communication dans un congrès hal-02943678v1

Combined Supervised and Unsupervised Approaches for Automatic Segmentation of Radiophonic Audio Streams

Gael Richard , Mathieu Ramona , Slim Essid
2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2007, Honolulu, France. pp.II-461-II-464, ⟨10.1109/ICASSP.2007.366272⟩
Communication dans un congrès hal-02943676v1
Image document

On the usefulness of differentiated transient/steady-state processing in machine recognition of musical instruments

Slim Essid , Pierre Leveau , Gael Richard , Laurent Daudet , Bertrand David
AES 118th convention, May 2005, Barcelona, Spain
Communication dans un congrès hal-02946881v1

Instrument recognition in polyphonic music

Slim Essid , Guy Richard , B. David
(ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., Mar 2005, Philadelphia, United States. pp.245-248, ⟨10.1109/ICASSP.2005.1415692⟩
Communication dans un congrès hal-02946873v1
Image document

MUSICAL INSTRUMENT RECOGNITION BASED ON CLASS PAIRWISE FEATURE SELECTION

Slim Essid , Gael Richard , Bertrand David
International Conference on Music Information Retrieval (ISMIR), Oct 2004, Barcelona, Spain
Communication dans un congrès hal-02946907v1
Image document

MUSICAL INSTRUMENT RECOGNITION ON SOLO PERFORMANCES

Slim Essid , Gaël Richard , Bertrand David
European Signal Processing Conference (EUSIPCO, Sep 2004, Vienna, Austria
Communication dans un congrès hal-02946903v1
Image document

Efficient musical instrument recognition on solo performance music using basic features

Slim Essid , Gael Richard , Bertrand David
AES 25th conference, Jun 2004, London, United Kingdom
Communication dans un congrès hal-02946911v1
Image document

Modèles Sinusoïdaux Étendus pour le Codage Audio

Remy Boyer , Slim Essid , Karim Abed-Meraim , Nicolas Moreau
Dix-neuvième colloque sur le Traitement du Signal et des Images, Sep 2003, Paris, France
Communication dans un congrès hal-02946917v1

Transient modeling with a Frequency-Transform Subspace Algorithm and "Transient + Sinusoidal" scheme

Remy Boyer , Slim Essid
IEEE Conference on Digital Signal Processing (DSP), Jul 2002, Santorini, Greece
Communication dans un congrès hal-01251630v1

Non-stationary modeling techniques adapted to low bitrate audio coding

Remy Boyer , Slim Essid , Nicolas Moreau
Int. Conf. on Signal Processing (ICSP), Aug 2002, Beijing, China
Communication dans un congrès hal-01251615v1

Dynamic temporal segmentation in parametric non-stationary modeling for percussive musical signals

Remy Boyer , Slim Essid , Nicolas Moreau
IEEE International Conference on Multimedia and Expo (ICME), Aug 2002, Lausane, Switzerland
Communication dans un congrès hal-01251622v1

Exploration de techniques modernes de modélisation adaptées à du codage audio bas-débit

Remy Boyer , Slim Essid , Nicolas Moreau
7èmes Journées d'Etudes et d'Echanges : Compression et Représentation des Signaux Audiovisuels (CORESA), Oct 2001, Dijon, France
Communication dans un congrès hal-02946929v1
Image document

Multimodal Music Recording Remastering

Giorgia Cantisani , Slim Essid , Gael Richard
DMRN+13: Digital Music Research Network One-day Workshop 2018, Dec 2018, London, United Kingdom
Poster de conférence hal-03187638v1

Multiview Approaches to Event Detection and Scene Analysis

Slim Essid , Sanjeel Parekh , Ngoc Q. K. Duong , Romain Serizel , Alexey Ozerov
Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, 2017
Chapitre d'ouvrage hal-02287697v1
Image document

Multiview approaches to event detection and scene analysis

Slim Essid , Sanjeel Parekh , Ngoc Q. K. Duong , Romain Serizel , Alexey Ozerov
Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer, pp.243-276, 2017, 978-3319634494. ⟨10.1007/978-3-319-63450-0_9⟩
Chapitre d'ouvrage hal-01620341v1
Image document

Acoustic Features for Environmental Sound Analysis

Romain Serizel , Victor Bisot , Slim Essid , Gael Richard
Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, pp.71-101, 2017, 978-3-319-63449-4. ⟨10.1007/978-3-319-63450-0_4⟩
Chapitre d'ouvrage hal-01575619v1

Fusion of Multimodal Information in Music Content Analysis

Slim Essid , Gael Richard
Multimodal Music Processing, Dagstuhl Follow-Ups,, Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik,, 2012
Chapitre d'ouvrage hal-02653102v1

Feature Extraction for Multimedia Analysis

Rachid Benmokhtar , Huet Benoit , Gael Richard , Thierry Declerck , Slim Essid
Multimedia Semantics: Metadata, Analysis and Interaction, Wiley, 2011
Chapitre d'ouvrage hal-02653016v1

Traitement des modalites "audio" et "parole"

Gilles Adda , Gérard Chollet , Slim Essid , Thomas Fillon , Martine Garnier-Rizet
Marine Campedel et Pierre Hoogstel. Sémantique et multimodalité en analyse de l'information, Hermes/Lavoisier, 2011, 978-2-7462-3139-9
Chapitre d'ouvrage hal-02943616v1

Machine Learning Techniques for Multimedia Analysis

Slim Essid , Marine Campedel , Gael Richard , Tomas Piatrik , Rachid Benmokhtar
Multimedia Semantics: Metadata, Analysis and Interaction, 2011
Chapitre d'ouvrage hal-02943615v1

High-level TV talk show structuring centered on speakers' interventions

Félicien Vallet , Slim Essid , Jean Carrive , Gael Richard
TV Content Analysis: Techniques and Applications, CRC Press, Taylor Francis LLC, 2011
Chapitre d'ouvrage hal-02653090v1
Image document

Classification automatique des signaux audio-fréquences : reconnaissance des instruments de musique

Slim Essid
Traitement du signal et de l'image [eess.SP]. Université Pierre et Marie Curie - Paris VI, 2005. Français. ⟨NNT : ⟩
Thèse pastel-00002738v1