Skip to Main content

Co-authors

Export Publications

Export the displayed publications:
Number of documents

127

Slim Essid


Journal articles17 documents

  • Zhiyao Duan, Slim Essid, Cynthia Liem, Gael Richard, Gaurav Sharma. Audiovisual Analysis of Music Performances: Overview of an Emerging Field. IEEE Signal Processing magazine, 2019, 36 (1), pp.63-73. ⟨hal-02287983⟩
  • Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Duong, Patrick Pérez, et al.. Weakly Supervised Representation Learning for Audio-Visual Scene Analysis. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2019. ⟨hal-02399993⟩
  • Atef Ben Youssef, Giovanna Varni, Slim Essid, Chloé Clavel. On-the-fly Detection of User Engagement Decrease in Spontaneous Human-Robot Interaction. International Journal of Social Robotics, 2019. ⟨hal-02288044⟩
  • Atef Ben Youssef, Chloé Clavel, Slim Essid. Early Detection of User Engagement Breakdown in Spontaneous Human-Humanoid Interaction. IEEE Transactions on Affective Computing , 2019. ⟨hal-02288043⟩
  • Zhiyao Duan, Slim Essid, Cynthia Liem, Gael Richard, Gaurav Sharma. Audio-Visual Analysis of Music Performances. IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, In press. ⟨hal-01893410⟩
  • Victor Bisot, Romain Serizel, Slim Essid, Gael Richard. Feature Learning with Matrix Factorization Applied to Acoustic Scene Classification. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2017, 25 (6), pp.1216 - 1229. ⟨10.1109/TASLP.2017.2690570⟩. ⟨hal-01362864v2⟩
  • Aymeric Masurelle, Ahmed Sekkat, Slim Essid, Gael Richard. TPT-Dance&Actions : un corpus multimodal d'activités humaines. Revue Traitement du Signal (Presse universitaire de Grenoble), 2015, ⟨10.3166/TS.32.443-475⟩. ⟨hal-02704820⟩
  • Nicolas Seichepine, Slim Essid, Cédric Févotte, Olivier Cappé. Soft Nonnegative Matrix Co-Factorization. IEEE Transactions on Signal Processing, Institute of Electrical and Electronics Engineers, 2014, 62 (22), pp.5940-5949. ⟨10.1109/TSP.2014.2360141⟩. ⟨hal-01116863⟩
  • Cyril Joder, Slim Essid, Gael Richard. Learning Optimal Features for Polyphonic Audio-to-Score Alignment. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2013. ⟨hal-02704714⟩
  • Félicien Vallet, Slim Essid, Jean Carrive. A Multimodal Approach to Speaker Diarization on TV Talk-Shows. IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2013, 15 (3), pp.509-520. ⟨10.1109/TMM.2012.2233724⟩. ⟨hal-02943545⟩
  • Slim Essid, Cédric Févotte. Smooth Nonnegative Matrix Factorization for Unsupervised Audiovisual Document Structuring. IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2013, 15 (2), pp.415-425. ⟨10.1109/TMM.2012.2228474⟩. ⟨hal-02943541⟩
  • Slim Essid, Marc Gowing, Georgios Kordelas, Anil Aksay, P. Kelly, et al.. A multi-modal dance corpus for research into interaction between humans in virtual environments. Journal on Multimodal User Interfaces, Springer, 2012, pp.1-14. ⟨10.1007/s12193-012-0109-5⟩. ⟨hal-02286487⟩
  • Cyril Joder, Slim Essid, Gael Richard. A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2011. ⟨hal-02653026⟩
  • Cyril Joder, Slim Essid, Gael Richard. Temporal Integration for Audio Classification With Application to Musical Instrument Classification. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2009, 17, ⟨10.1109/TASL.2008.2007613⟩. ⟨hal-02652782⟩
  • Olivier Gillet, Slim Essid, Gael Richard. On the Correlation of Automatic Audio and Visual Segmentations of Music Videos. IEEE Transactions on Circuits and Systems for Video Technology, Institute of Electrical and Electronics Engineers, 2007, 17, ⟨10.1109/TCSVT.2007.890831⟩. ⟨hal-02652635⟩
  • Gael Richard, Slim Essid, Bertrand David. Musical instrument recognition by pairwise classification strategies. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2006, 14 (4), pp.1401- 1412. ⟨10.1109/TSA.2005.860842⟩. ⟨hal-00477671⟩
  • Slim Essid, Gael Richard, Bertrand David. Instrument recognition in polyphonic music based on automatic taxonomies. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2006, 14 (1), pp.68-80. ⟨10.1109/TSA.2005.860351⟩. ⟨hal-00477670⟩

Conference papers96 documents

  • Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid. DNN-Based Distributed Multichannel Mask Estimation for Speech Enhancement in Microphone Arrays. ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain. ⟨hal-02389159v3⟩
  • Sanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Duong, Patrick Pérez, et al.. IDENTIFY, LOCATE AND SEPARATE: AUDIO-VISUAL OBJECT EXTRACTION IN LARGE VIDEO COLLECTIONS USING WEAK SUPERVISION. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2019, New Paltz, United States. ⟨hal-02380780⟩
  • Giorgia Cantisani, Slim Essid, Gael Richard. EEG-BASED DECODING OF AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC. 2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2019, New Paltz, NY, United States. ⟨hal-02291896⟩
  • Giorgia Cantisani, Gabriel Trégoat, Slim Essid, Gael Richard. MAD-EEG: an EEG dataset for decoding auditory attention to a target instrument in polyphonic music. Speech, Music and Mind (SMM), Satellite Workshop of Interspeech 2019, Sep 2019, Vienna, Austria. ⟨hal-02291882⟩
  • Magdalena Fuentes, Brian Mcfee, Helene-Camille Crayencour, Slim Essid, Juan Bello. A Music Structure Informed Downbeat Tracking System Using Skip-chain Conditional Random Fields and Deep Learning. ICASSP, May 2019, Brighton, United Kingdom. ⟨10.1109/icassp.2019.8682870⟩. ⟨hal-02420403⟩
  • Alexandre Garcia, Pierre Colombo, Slim Essid, Florence d'Alché-Buc, Chloe Clavel. From the Token to the Review: A Hierarchical Multimodal approach to Opinion Mining. 2019 Conference on Empirical Methods in Natural Language Processing, Nov 2019, Hong-Kong, China. ⟨hal-02371140⟩
  • Lucas Maia, Magdalena Fuentes, Luiz Biscainho, Martín Rocamora, Slim Essid. SAMBASET: A DATASET OF HISTORICAL SAMBA DE ENREDO RECORDINGS FOR COMPUTATIONAL MUSIC ANALYSIS. The 20th International Society for Music Information Retrieval Conference, Nov 2019, Delft, Netherlands. ⟨hal-02943462⟩
  • Magdalena Fuentes, Lucas Maia, Martín Rocamora, Luiz Biscainho, Hélène Crayencour, et al.. TRACKING BEATS AND MICROTIMING IN AFRO-LATIN AMERICAN MUSIC USING CONDITIONAL RANDOM FIELDS AND DEEP LEARNING. ISMIR, Nov 2019, Delft, Netherlands. ⟨hal-02419361⟩
  • Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid. Multi-task Feature Learning for EEG-based Emotion Recognition Using Group Nonnegative Matrix Factorization. 2018 26th European Signal Processing Conference (EUSIPCO), Sep 2018, Rome, France. pp.91-95, ⟨10.23919/EUSIPCO.2018.8553390⟩. ⟨hal-02422892⟩
  • Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Duong, Patrick Pérez, et al.. Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events. CVPR Workshop, 2018, Salt Lake city, United States. ⟨hal-02713307⟩
  • Alexandre Garcia, Slim Essid, Chloé Clavel, Florence D'alché-Buc. Structured Output Learning with Abstention: Application to Accurate Opinion Prediction. 35th International Conference on Machine Learning18., Jul 2018, Stockholm, Sweden. ⟨hal-01950907⟩
  • Valentin Barriere, Chloe Clavel, Slim Essid. Attitude Classification in Adjacency Pairs of a Human-Agent Interaction with Hidden Conditional Random Fields. ICASSP 2018 - 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2018, Calgary, Canada. pp.4949-4953, ⟨10.1109/ICASSP.2018.8462160⟩. ⟨hal-02943469⟩
  • Magdalena Fuentes, Brian Mcfee, Hélène Crayencour, Slim Essid, Juan Bello. ANALYSIS OF COMMON DESIGN CHOICES IN DEEP LEARNING SYSTEMS FOR DOWNBEAT TRACKING. The 19th International Society for Music Information Retrieval Conference, Sep 2018, Paris, France. ⟨hal-02943467⟩
  • Jean-Baptiste Schiratti, Jean-Eudes Le Douget, Michel Le van Quyen, Slim Essid, Alexandre Gramfort. An ensemble learning approach to detect epileptic seizures from long intracranial EEG recordings. International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. ⟨hal-01724272⟩
  • Dogac Basaran, Slim Essid, Geoffroy Peeters. MAIN MELODY EXTRACTION WITH SOURCE-FILTER NMF AND CRNN. 19th International Society for Music Information Retreival, Sep 2018, Paris, France. ⟨hal-02019103⟩
  • Anne-Claire Conneau, Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid. EMOEEG: A new multimodal dataset for dynamic EEG-based emotion recognition with audiovisual elicitation. 2017 25th European Signal Processing Conference (EUSIPCO), Aug 2017, Kos, Greece. pp.738-742, ⟨10.23919/EUSIPCO.2017.8081305⟩. ⟨hal-02422947⟩
  • Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Duong, Patrick Pérez, et al.. Motion informed audio source separation. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), Mar 2017, New Orleans, United States. ⟨hal-01447977⟩
  • Romain Serizel, Victor Bisot, Slim Essid, Gael Richard. Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker Identification. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States. ⟨hal-01484744⟩
  • Victor Bisot, Romain Serizel, Slim Essid, Gael Richard. Leveraging deep neural networks with nonnegative representations for improved environmental sound classification. IEEE International Workshop on Machine Learning for Signal Processing MLSP, Sep 2017, Tokyo, Japan. ⟨hal-01576857⟩
  • Victor Bisot, Slim Essid, Gael Richard. Overlapping sound event detection with supervised Nonnegative Matrix Factorization. 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, France. pp.31-35, ⟨10.1109/ICASSP.2017.7951792⟩. ⟨hal-02713341⟩
  • Victor Bisot, Romain Serizel, Slim Essid, Gael Richard. Nonnegative Feature Learning Methods for Acoustic Scene Classification. DCASE 2017 - Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2017, Munich, Germany. ⟨hal-01636627⟩
  • Sanjeel Parekh, Slim Essid, Alexey Ozerov, Quang-Khanh-Ngoc Duong, Patrick Perez, et al.. Guiding Audio Source Separation by Video Object Information. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2017, New Paltz, New York, United States. ⟨hal-02287698⟩
  • Anne-Claire Conneau, Ayoub Hajlaoui, Mohamed Chetouani, Slim Essid. EMOEEG: a New Multimodal Dataset for Dynamic EEG-based Emotion Recognition with Audiovisual Elicitation. The European Signal Processing Conference (EUSIPCO), 2017, Kos island, Greece. ⟨hal-02288498⟩
  • Slim Essid. Nonnegative Matrix Factorisation for multimodal data analysis. Dipartimento di Elettronica, Informazione e Bioingegeria (DEIB), Politecnico di Milano, Feb 2017, Milan, Italy. ⟨hal-02288528⟩
  • Slim Essid. Matrix Co-Factorisation and Applications to Music Analysis. Machine Learning for Music Discovery Workshop, International Conference on Machine Learning (ICML) 2017, Aug 2017, Sydney, Australia. ⟨hal-02287881⟩
  • Valentin Barriere, Chloé Clavel, Slim Essid. Opinion Dynamics Modeling for Movie Review Transcripts Classification with Hidden Conditional Random Fields. Interspeech 2017, Aug 2017, Stockholm, Sweden. ⟨hal-02287607⟩
  • Atef Ben-Youssef, Chloé Clavel, Slim Essid, Miriam Bilac, Marine Chamoux, et al.. UE-HRI: a new dataset for the study of user engagement in spontaneous human-robot interactions. the 19th ACM International Conference, Nov 2017, Glasgow, France. pp.464-472, ⟨10.1145/3136755.3136814⟩. ⟨hal-02943475⟩
  • Romain Serizel, Slim Essid, Gael Richard. Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence. IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2016), Sep 2016, Salerne, Italy. ⟨hal-01393964⟩
  • Romain Serizel, Slim Essid, Gael Richard. Group Non-Negative Matrix Factorisation With Speaker And Session Similarity Constraints For Speaker Identification. IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 2016, Shangai, China. ⟨hal-01393968⟩
  • Victor Bisot, Romain Serizel, Slim Essid, Gael Richard. SUPERVISED NONNEGATIVE MATRIX FACTORIZATION FOR ACOUSTIC SCENE CLASSIFICATION. IEEE international evaluation campaign on detection and classification of acousitc scenes and events (DCASE 2016), Sep 2016, Budapest, Hungary. ⟨hal-02943480⟩
  • Romain Serizel, Slim Essid, Gael Richard. Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification. ICASSP, Mar 2016, Shangai, China. pp.5470 - 5474. ⟨hal-02288453⟩
  • Victor Bisot, Romain Serizel, Slim Essid, Gael Richard. Acoustic scene classification with matrix factorization for unsupervised feature learning. ICASSP, Mar 2016, Shangai, China. ⟨hal-02287267⟩
  • Romain Serizel, Victor Bisot, Slim Essid, Gael Richard. Machine listening techniques as a complement to video image analysis in forensics. IEEE International Conference on Image Processing, Sep 2016, Phoenix, AZ, United States. pp.948-952, ⟨10.1109/ICIP.2016.7532497⟩. ⟨hal-01393959⟩
  • Simon Durand, Slim Essid. Downbeat Detection with Conditional Random Fields and Deep Learned Features. International Society for Music Information Retrieval (ISMIR), Aug 2016, New York City, United States. pp.386-392. ⟨hal-02288480⟩
  • Slim Essid. Très brève introduction au Machine Learning. Conference debat du Corps des Mines, Jan 2016, Paris, France. ⟨hal-02287867⟩
  • Slim Essid. Research on Nonnegative Matrix Factorisation at Telecom ParisTech. Spotify Research Seminar, Aug 2016, New York, United States. ⟨hal-02288525⟩
  • Slim Essid. Audio and Brain Research at Telecom ParisTech. Hearing Seminar of the Center for Computer Research in Music and Acoustics (CCRMA), Stanford University, Sep 2016, Stanford, United States. ⟨hal-02287866⟩
  • Victor Bisot, Slim Essid, Gael Richard. Hog and Subband power distribution image features for acoustic scene classification. EUSIPCO, Sep 2015, Nice, France. pp.719-723. ⟨hal-02287266⟩
  • Rachel Bittner, Justin Salamon, Slim Essid, Juan Bello. MELODY EXTRACTION BY CONTOUR CLASSIFICATION. International Conference on Music Information Retrieval (ISMIR), Sep 2015, Malaga, Spain. ⟨hal-02943532⟩
  • Slim Essid. Nonnegative matrix Factorisation for Audiovisual Document Analysis. Seminaire Traitement du Langage Parle, LIMSI, 2015, Orsay, France. ⟨hal-02287882⟩
  • Slim Essid. Introduction à la factorisation en matrices positives. Journée Télécom-UPS "Le numérique pour tous", May 2015, Paris, France. ⟨hal-02287868⟩
  • Thomas Fillon, C. Joder, Simon Durand, Slim Essid. A conditional random field system for beat tracking. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2015, Brisbane, Australia. ⟨hal-02288433⟩
  • Aymeric Masurelle, Slim Essid, Gael Richard. Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes. 2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 14), May 2014, Florence, Italy. ⟨hal-00990252⟩
  • Slim Essid, Alexey Ozerov. A tutorial on Nonnegative Matrix Factorisation with applications to audiovisual content analysis. Tutorial at ICME 2014, Jul 2014, Chengdu, China. ⟨hal-02287869⟩
  • Anne-Claire Conneau, Slim Essid. Assessment of new spectral features for eeg-based emotion recognition.. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), May 2014, Florence, Italy. ⟨hal-02287334⟩
  • N. Seichepine, Slim Essid, C. Fevotte, O. Cappe. Piecewise constant nonnegative matrix factorization. ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2014, Florence, France. pp.6721-6725, ⟨10.1109/ICASSP.2014.6854901⟩. ⟨hal-02943536⟩
  • Rémi Foucard, Slim Essid, Mathieu Lagrange, Gael Richard. Etiquetage automatique de l'audio : une approche de boosting régressif basée sur une fusion souple d'annotateurs. Coresa, 2013, NA, France. ⟨hal-01106754⟩
  • Aymeric Masurelle, Slim Essid, Gael Richard. MULTIMODAL CLASSIFICATION OF DANCE MOVEMENTS USING BODY JOINT TRAJECTORIES AND STEP SOUNDS. International Workshop on Image and Audio Analysis for Multimedia Interactive Services WIAMIS, Nov 2013, Paris, France. pp.1-4, ⟨10.1109/WIAMIS.2013.6616151⟩. ⟨hal-00904461⟩
  • Rémi Foucard, Slim Essid, Gael Richard, Mathieu Lagrange. Exploring new features for music classification. WIAMIS, Jul 2013, Paris, France. ⟨10.1109/WIAMIS.2013.6616154⟩. ⟨hal-01126767⟩
  • Slim Essid. Multimodal Signal Analysis at Telecom ParisTech. Seminaire scienti\unmatchedfb01que de Technicolor R&D, Dec 2013, Rennes, France. ⟨hal-02288526⟩
  • Cécilia Damon, Antoine Liutkus, Alexandre Gramfort, Slim Essid. Nonnegative Tensor Factorization for Single-Channel EEG Artifact Rejection. IEEE International Workshop on Machine Learning for Signal Processing, Sep 2013, Southampton, United Kingdom. ⟨hal-02288386⟩
  • N. Seichepine, Slim Essid, C. Fevotte, O. Cappe. Soft nonnegative matrix co-factorizationwith application to multimodal speaker diarization. ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2013, Vancouver, France. pp.3537-3541, ⟨10.1109/ICASSP.2013.6638316⟩. ⟨hal-02943543⟩
  • Cécilia Damon, Antoine Liutkus, Alexandre Gramfort, Slim Essid. Non-negative Tensor Factorization for Single-Channel EEG Artifact Rejection. MLSP, Sep 2013, Southampton, United Kingdom. ⟨10.1109/MLSP.2013.6661983⟩. ⟨hal-00959103⟩
  • Cécilia Damon, Antoine Liutkus, Alexandre Gramfort, Slim Essid. Non-negative matrix factorization for single-channel EEG artifact rejection. ICASSP, 2013, Vancouver, Canada. ⟨10.1109/ICASSP.2013.6637836⟩. ⟨hal-00958775⟩
  • Angelique Dremeau, Slim Essid. Probabilistic dance performance alignment by fusion of multimodal features. IEEE Int’l Conf. on Acoustics, Speech and Signal Processing (ICASSP), May 2013, Vancouver, Canada. ⟨hal-02288353⟩
  • Nicolas Seichepine, Slim Essid, Cédric Févotte, Olivier Cappé. Co-factorisation douce en matrices non-négatives. Application au regroupement multimodal de locuteurs. GRETSI, Sep 2013, Brest, France. ⟨hal-02286798⟩
  • Rémi Foucard, Slim Essid, Mathieu Lagrange, Gael Richard. A regressive boosting approach to automatic audio tagging based on soft annotator fusion. IEEE ICASSP, Mar 2012, Kyoto, Japan. ⟨10.1109/ICASSP.2012.6287820⟩. ⟨hal-01132529⟩
  • Antoine Liutkus, Angélique Drémeau, Dimitrios Alexiadis, Slim Essid, Petros Daras. Analysis of dance movements using gaussian processes. the 20th ACM international conference, Oct 2012, Nara, France. pp.1375, ⟨10.1145/2393347.2396492⟩. ⟨hal-02943555⟩
  • Slim Essid. A SINGLE-CLASS SVM BASED ALGORITHM FOR COMPUTING AN IDENTIFIABLE NMF. IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2012, Kyoto, Japan. ⟨hal-02278688⟩
  • Slim Essid, C. Fevotte. Decomposing the video editing structure of a talk-show using nonnegative matrix factorization. 2012 19th IEEE International Conference on Image Processing (ICIP 2012), Sep 2012, Orlando, France. pp.3105-3108, ⟨10.1109/ICIP.2012.6467557⟩. ⟨hal-02943553⟩
  • Slim Essid, Dimitrios Alexiadis, Robin Tournemenne, Marc Gowing, Philip Kelly, et al.. AN ADVANCED VIRTUAL DANCE PERFORMANCE EVALUATOR. IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2012, Kyoto, Japan. ⟨hal-02288313⟩
  • Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard. AN INTERACTIVE SYSTEM FOR ELECTRO-ACOUSTIC MUSIC ANALYSIS. ISMIR, 2011, Miami, United States. ⟨hal-02713906⟩
  • Rémi Foucard, Slim Essid, Mathieu Lagrange, Gael Richard. Multi-scale temporal fusion by boosting for music classification. ISMIR, 2011, Miami, United States. pp.663-668. ⟨hal-00639097⟩
  • Slim Essid, Yves Grenier, Mounira Maazaoui, Gael Richard, Robin Tournemenne. An audio-driven virtual dance-teaching assistant. the 19th ACM international conference, Nov 2011, Scottsdale, France. pp.675, ⟨10.1145/2072298.2072416⟩. ⟨hal-02713825⟩
  • Cyril Joder, Slim Essid, Gael Richard. Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignment. ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2011, Prague, France. pp.397-400, ⟨10.1109/ICASSP.2011.5946424⟩. ⟨hal-02714059⟩
  • Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard. Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation. AES Conference, 2011, Ilmenau, Germany. ⟨hal-02713989⟩
  • Cyril Joder, Slim Essid, Gael Richard. Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment. 2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2011, New Paltz, France. pp.121-124, ⟨10.1109/ASPAA.2011.6082330⟩. ⟨hal-02943613⟩
  • Slim Essid, Xinyu Lin, Marc Gowing, Georgios Kordelas, Anil Aksay, et al.. A multimodal dance corpus for research into real-time interaction between humans in online virtual environments. ICMI WORKSHOP ON MULTIMODAL CORPORA FOR MACHINE LEARNING, Nov 2011, Alicante, Spain. ⟨hal-02278689⟩
  • Marc Gowing, Xinyu Lin, Qianni Zhang, Philip Kell, Noel O'Connor, et al.. Enhanced visualisation of dance performance from automatically synchronised multimodal recordings. The 19th ACM international conference, Nov 2011, Scottsdale, France. pp.667, ⟨10.1145/2072298.2072414⟩. ⟨hal-02943617⟩
  • Cyril Joder, Slim Essid, Gael Richard. Approche hiérarchique pour un alignement musique-sur-partition efficace. Compression et Représentation des Signaux Audiovisuels (CORESA), Oct 2010, Lyon, France. ⟨hal-02943620⟩
  • Vallet Félicien, Slim Essid, Jean Carrive, Gaël Richard. Descripteurs visuels robustes pour l'identification de locuteurs dans des émissions televisées de talk-shows. Compression et Représentation des Signaux Audiovisuels (CORESA), Oct 2010, Lyon, France. ⟨hal-02943621⟩
  • Cyril Joder, Slim Essid, Gael Richard. AN IMPROVED HIERARCHICAL APPROACH FOR MUSIC-TO-SYMBOLIC SCORE ALIGNMENT. ISMIR, 2010, Utrecht, Netherlands. ⟨hal-02747659⟩
  • Cyril Joder, Slim Essid, Gael Richard. A conditional random field viewpoint of symbolic audio-to-score matching. the international conference, Oct 2010, Firenze, France. pp.871, ⟨10.1145/1873951.1874100⟩. ⟨hal-02747590⟩
  • Félicien Vallet, Slim Essid, Jean Carrive, Gael Richard. Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows. 2010 17th IEEE International Conference on Image Processing (ICIP 2010), Sep 2010, Hong Kong, France. pp.1469-1472, ⟨10.1109/ICIP.2010.5653393⟩. ⟨hal-02747558⟩
  • Cyril Joder, Slim Essid, Gael Richard. A comparative study of tonal acoustic features for a symbolic level music-to-score alignment. 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Mar 2010, Dallas, France. pp.409-412, ⟨10.1109/ICASSP.2010.5495784⟩. ⟨hal-02747785⟩
  • Benoît Mathieu, Slim Essid, Thomas Fillon, Jacques Prado, Gael Richard. YAAFE, AN EASY TO USE AND EFFICIENT AUDIO FEATURE EXTRACTION SOFTWARE. ISMIR, 2010, Utrecht, Netherlands. ⟨hal-02747689⟩
  • Simon Bozonnet, Félicien Vallet, Nicholas Evans, Slim Essid, Gael Richard, et al.. A MULTIMODAL APPROACH TO INITIALISATION FOR TOP-DOWN SPEAKER DIARIZATION OF TELEVISION SHOWS. Eusipco, 2010, aalborg, Denmark. ⟨hal-02747730⟩
  • M. Lardeur, Slim Essid, G. Richard, M. Haller, T. Sikora. Incorporating prior knowledge on the digital media creation process into audio classifiers. ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2009, Taipei, France. pp.1653-1656, ⟨10.1109/ICASSP.2009.4959918⟩. ⟨hal-02943669⟩
  • Sébastien Gulluni, Slim Essid, Olivier Buisson, Gael Richard. Interactive Segmentation of Electro-Acoustic Music. 2nd International Workshop on Machine Learning and Music (MML - ECML - PKDD), Sep 2009, Bled, Slovenia. ⟨hal-02943665⟩
  • Cyril Joder, Slim Essid, Gaël Richard. Étude des descripteurs acoustiques pour l'alignement temporel audio-sur-partition musicale. GRETSI, Sep 2009, Dijon, France. ⟨hal-02943624⟩
  • Cyril Joder, Slim Essid, Gaël Richard. ALIGNMENT KERNELS FOR AUDIO CLASSIFICATION WITH APPLICATION TO MUSIC INSTRUMENT RECOGNITION. 16th European Signal Processing Conference, Aug 2008, Lausanne, Switzerland. ⟨hal-02943674⟩
  • S Wegener, M Haller, J Burred, T Sikora, Slim Essid, et al.. ON THE ROBUSTNESS OF AUDIO FEATURES FOR MUSICAL INSTRUMENT CLASSIFICATION. 16th European Signal Processing Conference, Aug 2008, Lausanne, Switzerland. ⟨hal-02943672⟩
  • Emilie Dumont, Bernard Mérialdo, Slim Essid, Werner Bailer, Daragh Byrne, et al.. A Collaborative Approach to Video Summarization. SAMT 2008, 3rd International Conference on Semantic and Digital Media Technologies, 2008, Koblenz, Germany. ⟨hal-01987822⟩
  • Emilie Dumont, Bernard Mérialdo, Slim Essid, Werner Bailer, Herwig Rehatschek, et al.. Rushes Video Summarization using a Collaborative Approach. TRECVID 2008, ACM International Conference on Multimedia Information Retrieval, 2008, Vancouver, Canada. ⟨hal-01987824⟩
  • Gael Richard, Pierre Leveau, Laurent Daudet, Slim Essid, Bertrand David. TOWARDS POLYPHONIC MUSICAL INSTRUMENTS RECOGNITION. 19th INTERNATIONAL CONGRESS ON ACOUSTICS, Sep 2007, Madrid, Spain. ⟨hal-02943678⟩
  • Gael Richard, Mathieu Ramona, Slim Essid. Combined Supervised and Unsupervised Approaches for Automatic Segmentation of Radiophonic Audio Streams. 2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2007, Honolulu, France. pp.II-461-II-464, ⟨10.1109/ICASSP.2007.366272⟩. ⟨hal-02943676⟩
  • Slim Essid, G. Richard, B. David. Instrument recognition in polyphonic music. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005., Mar 2005, Philadelphia, United States. pp.245-248, ⟨10.1109/ICASSP.2005.1415692⟩. ⟨hal-02946873⟩
  • Slim Essid, Pierre Leveau, Gael Richard, Laurent Daudet, Bertrand David. On the usefulness of differentiated transient/steady-state processing in machine recognition of musical instruments. AES 118th convention, May 2005, Barcelona, Spain. ⟨hal-02946881⟩
  • Slim Essid, Gaël Richard, Bertrand David. MUSICAL INSTRUMENT RECOGNITION ON SOLO PERFORMANCES. European Signal Processing Conference (EUSIPCO, Sep 2004, Vienna, Austria. ⟨hal-02946903⟩
  • Slim Essid, Gael Richard, Bertrand David. Efficient musical instrument recognition on solo performance music using basic features. AES 25th conference, Jun 2004, London, United Kingdom. ⟨hal-02946911⟩
  • Slim Essid, Gael Richard, Bertrand David. MUSICAL INSTRUMENT RECOGNITION BASED ON CLASS PAIRWISE FEATURE SELECTION. International Conference on Music Information Retrieval (ISMIR), Oct 2004, Barcelona, Spain. ⟨hal-02946907⟩
  • Remy Boyer, Slim Essid, Karim Abed-Meraim, Nicolas Moreau. Modèles Sinusoïdaux Étendus pour le Codage Audio. Dix-neuvième colloque sur le Traitement du Signal et des Images, Sep 2003, Paris, France. ⟨hal-02946917⟩
  • Remy Boyer, Slim Essid, Nicolas Moreau. Dynamic temporal segmentation in parametric non-stationary modeling for percussive musical signals. IEEE International Conference on Multimedia and Expo (ICME), Aug 2002, Lausane, Switzerland. ⟨hal-01251622⟩
  • Remy Boyer, Slim Essid, Nicolas Moreau. Non-stationary modeling techniques adapted to low bitrate audio coding. Int. Conf. on Signal Processing (ICSP), Aug 2002, Beijing, China. ⟨hal-01251615⟩
  • Remy Boyer, Slim Essid. Transient modeling with a Frequency-Transform Subspace Algorithm and "Transient + Sinusoidal" scheme. IEEE Conference on Digital Signal Processing (DSP), Jul 2002, Santorini, Greece. ⟨hal-01251630⟩
  • Remy Boyer, Slim Essid, Nicolas Moreau. Exploration de techniques modernes de modélisation adaptées à du codage audio bas-débit. 7èmes Journées d'Etudes et d'Echanges : Compression et Représentation des Signaux Audiovisuels (CORESA), Oct 2001, Dijon, France. ⟨hal-02946929⟩

Book sections8 documents

  • Romain Serizel, Victor Bisot, Slim Essid, Gael Richard. Acoustic Features for Environmental Sound Analysis. Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, pp.71-101, 2017, 978-3-319-63449-4. ⟨10.1007/978-3-319-63450-0_4⟩. ⟨hal-01575619⟩
  • Slim Essid, Sanjeel Parekh, Ngoc Duong, Romain Serizel, Alexey Ozerov, et al.. Multiview Approaches to Event Detection and Scene Analysis. Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, 2017. ⟨hal-02287697⟩
  • Slim Essid, Sanjeel Parekh, Ngoc Duong, Romain Serizel, Alexey Ozerov, et al.. Multiview approaches to event detection and scene analysis. Tuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer, pp.243-276, 2017, 978-3319634494. ⟨10.1007/978-3-319-63450-0_9⟩. ⟨hal-01620341⟩
  • Slim Essid, Gael Richard. Fusion of Multimodal Information in Music Content Analysis. Multimodal Music Processing, Dagstuhl Follow-Ups,, Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik,, 2012. ⟨hal-02653102⟩
  • Slim Essid, Marine Campedel, Gael Richard, Tomas Piatrik, Rachid Benmokhtar, et al.. Machine Learning Techniques for Multimedia Analysis. Multimedia Semantics: Metadata, Analysis and Interaction, 2011. ⟨hal-02943615⟩
  • Félicien Vallet, Slim Essid, Jean Carrive, Gael Richard. High-level TV talk show structuring centered on speakers' interventions. TV Content Analysis: Techniques and Applications, CRC Press, Taylor Francis LLC, 2011. ⟨hal-02653090⟩
  • Rachid Benmokhtar, Huet Benoit, Gael Richard, Thierry Declerck, Slim Essid. Feature Extraction for Multimedia Analysis. Multimedia Semantics: Metadata, Analysis and Interaction, Wiley, 2011. ⟨hal-02653016⟩
  • Gilles Adda, Gérard Chollet, Slim Essid, Thomas Fillon, Martine Garnier-Rizet, et al.. Traitement des modalites "audio" et "parole". Marine Campedel et Pierre Hoogstel. Sémantique et multimodalité en analyse de l'information, Hermes/Lavoisier, 2011. ⟨hal-02943616⟩

Patents2 documents

  • Sanjeel Parekh, Alexey Ozerov, Quang-Khanh-Ngoc Duong, Gael Richard, Slim Essid, et al.. Procédé de traitement d'un signal audio et dispositif électronique correspondant, produit-programme lisible par ordinateur non transitoire et support d'informations lisible par ordinateur. France, Patent n° : EP3392882 A1. 2018. ⟨hal-02651234⟩
  • Quang-Khanh-Ngoc Duong, Alexey Ozerov, Sanjeel Parekh, Slim Essid, Gael Richard, et al.. Procédé de classification et de localisation d'événements audiovisuels et appareil correspondant, produit-programme lisible par ordinateur et support d'informations lisible par ordinateur. France, Patent n° : EP3540634. 2018. ⟨hal-02651256⟩

Preprints, Working Papers, ...1 document

  • Sanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Duong, Patrick Pérez, et al.. Identify, locate and separate: Audio-visual object extraction in large video collections using weak supervision. 2018. ⟨hal-01914532⟩

Reports2 documents

  • Antoine Liutkus, Angélique Drémeau, Dimitrios Alexiadis, Slim Essid, Petros Daras. Analysis of dance movements using Gaussian processes. [Research Report] 2012, pp.10. ⟨hal-00718791v2⟩
  • Slim Essid, Cédric Févotte. Nonnegative matrix factorization for unsupervised audiovisual document structuring. 2011. ⟨hal-00605886⟩

Theses1 document

  • Slim Essid. Classification automatique des signaux audio-fréquences : reconnaissance des instruments de musique. Traitement du signal et de l'image [eess.SP]. Université Pierre et Marie Curie - Paris VI, 2005. Français. ⟨pastel-00002738⟩