Slim Essid
77
Documents
Présentation
Publications
- 77
- 10
- 9
- 8
- 8
- 7
- 4
- 4
- 4
- 4
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 2
- 3
- 6
- 6
- 8
- 6
- 2
- 1
- 4
- 3
- 11
- 8
- 5
- 2
- 3
- 2
- 1
- 3
|
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysisAdvances in neural information processing systems, Dec 2023, New Orleans, United States
Communication dans un congrès
hal-04216055v1
|
|
Latent and Adversarial Data Augmentation for Sound Event Detection and ClassificationInternational workshop on Detection and Classiffication of Acoustic Scenes and Events (DCASE), Nov 2022, Nancy, France
Communication dans un congrès
hal-03782827v1
|
|
Impact de perturbations internes sur l'entraînement de réseaux profonds pour la détection d'évènements sonoresColloque Francophone de Traitement du Signal et des Images (GRETSI), Sep 2022, Nancy, France
Communication dans un congrès
hal-03759651v1
|
|
NEURO-STEERED MUSIC SOURCE SEPARATION WITH EEG-BASED AUDITORY ATTENTION DECODING AND CONTRASTIVE-NMF2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Jun 2021, Toronto (virtual conference), Canada. ⟨10.1109/ICASSP39728.2021.9413841⟩
Communication dans un congrès
hal-02978978v4
|
|
User-guided one-shot deep model adaptation for music source separation2021 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), IEEE, Oct 2021, New Paltz, NY, United States
Communication dans un congrès
hal-03219350v3
|
|
MAD-EEG: an EEG dataset for decoding auditory attention to a target instrument in polyphonic musicSpeech, Music and Mind (SMM), Satellite Workshop of Interspeech 2019, Sep 2019, Vienna, Austria
Communication dans un congrès
hal-02291882v3
|
Decoding auditory attention in polyphonic music based on EEG: a new dataset and a preliminary studyAuditory EEG Signal Processing (AESoP) symposium, Sep 2019, Leuven, Belgium
Communication dans un congrès
hal-03175885v1
|
|
|
EEG-BASED DECODING OF AUDITORY ATTENTION TO A TARGET INSTRUMENT IN POLYPHONIC MUSIC2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2019, New Paltz, NY, United States
Communication dans un congrès
hal-02291896v1
|
|
IDENTIFY, LOCATE AND SEPARATE: AUDIO-VISUAL OBJECT EXTRACTION IN LARGE VIDEO COLLECTIONS USING WEAK SUPERVISIONIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2019, New Paltz, United States
Communication dans un congrès
hal-02380780v1
|
|
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual EventsCVPR Workshop, 2018, Salt Lake city, United States
Communication dans un congrès
hal-02713307v1
|
|
Motion informed audio source separationIEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), Mar 2017, New Orleans, United States
Communication dans un congrès
hal-01447977v1
|
|
Supervised Group Nonnegative Matrix Factorisation With Similarity Constraints And Applications To Speaker IdentificationIEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Mar 2017, New Orleans, United States
Communication dans un congrès
hal-01484744v1
|
Overlapping sound event detection with supervised Nonnegative Matrix Factorization2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Mar 2017, New Orleans, France. pp.31-35, ⟨10.1109/ICASSP.2017.7951792⟩
Communication dans un congrès
hal-02713341v1
|
|
|
Nonnegative Feature Learning Methods for Acoustic Scene ClassificationDCASE 2017 - Workshop on Detection and Classification of Acoustic Scenes and Events, Nov 2017, Munich, Germany
Communication dans un congrès
hal-01636627v1
|
Guiding Audio Source Separation by Video Object InformationIEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2017, New Paltz, New York, United States
Communication dans un congrès
hal-02287698v1
|
|
|
Leveraging deep neural networks with nonnegative representations for improved environmental sound classificationIEEE International Workshop on Machine Learning for Signal Processing MLSP, Sep 2017, Tokyo, Japan
Communication dans un congrès
hal-01576857v1
|
Acoustic scene classification with matrix factorization for unsupervised feature learningICASSP, Mar 2016, Shangai, China
Communication dans un congrès
hal-02287267v1
|
|
|
Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergenceIEEE International Workshop on Machine Learning for Signal Processing (MLSP 2016), Sep 2016, Salerne, Italy
Communication dans un congrès
hal-01393964v1
|
Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identificationICASSP, Mar 2016, Shangai, China. pp.5470 - 5474
Communication dans un congrès
hal-02288453v1
|
|
|
Group Non-Negative Matrix Factorisation With Speaker And Session Similarity Constraints For Speaker IdentificationIEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 2016, Shangai, China
Communication dans un congrès
hal-01393968v1
|
|
SUPERVISED NONNEGATIVE MATRIX FACTORIZATION FOR ACOUSTIC SCENE CLASSIFICATIONIEEE international evaluation campaign on detection and classification of acousitc scenes and events (DCASE 2016), Sep 2016, Budapest, Hungary
Communication dans un congrès
hal-02943480v1
|
|
Machine listening techniques as a complement to video image analysis in forensicsIEEE International Conference on Image Processing, Sep 2016, Phoenix, AZ, United States. pp.948-952, ⟨10.1109/ICIP.2016.7532497⟩
Communication dans un congrès
hal-01393959v1
|
Hog and Subband power distribution image features for acoustic scene classificationEUSIPCO, Sep 2015, Nice, France. pp.719-723
Communication dans un congrès
hal-02287266v1
|
|
|
Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes2014 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 14), May 2014, Florence, Italy
Communication dans un congrès
hal-00990252v1
|
|
Exploring new features for music classificationWIAMIS, Jul 2013, Paris, France. ⟨10.1109/WIAMIS.2013.6616154⟩
Communication dans un congrès
hal-01126767v1
|
Etiquetage automatique de l'audio : une approche de boosting régressif basée sur une fusion souple d'annotateursCoresa, 2013, NA, France
Communication dans un congrès
hal-01106754v1
|
|
|
MULTIMODAL CLASSIFICATION OF DANCE MOVEMENTS USING BODY JOINT TRAJECTORIES AND STEP SOUNDSInternational Workshop on Image and Audio Analysis for Multimedia Interactive Services WIAMIS, Nov 2013, Paris, France. pp.1-4, ⟨10.1109/WIAMIS.2013.6616151⟩
Communication dans un congrès
hal-00904461v1
|
|
A regressive boosting approach to automatic audio tagging based on soft annotator fusionIEEE ICASSP, Mar 2012, Kyoto, Japan. ⟨10.1109/ICASSP.2012.6287820⟩
Communication dans un congrès
hal-01132529v1
|
A multimodal dance corpus for research into real-time interaction between humans in online virtual environmentsICMI WORKSHOP ON MULTIMODAL CORPORA FOR MACHINE LEARNING, Nov 2011, Alicante, Spain
Communication dans un congrès
hal-02278689v1
|
|
Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignmentICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), May 2011, Prague, France. pp.397-400, ⟨10.1109/ICASSP.2011.5946424⟩
Communication dans un congrès
hal-02714059v1
|
|
Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment2011 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct 2011, New Paltz, France. pp.121-124, ⟨10.1109/ASPAA.2011.6082330⟩
Communication dans un congrès
hal-02943613v1
|
|
Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music AnnotationAES Conference, 2011, Ilmenau, Germany
Communication dans un congrès
hal-02713989v1
|
|
AN INTERACTIVE SYSTEM FOR ELECTRO-ACOUSTIC MUSIC ANALYSISISMIR, 2011, Miami, United States
Communication dans un congrès
hal-02713906v1
|
|
An audio-driven virtual dance-teaching assistantthe 19th ACM international conference, Nov 2011, Scottsdale, France. pp.675, ⟨10.1145/2072298.2072416⟩
Communication dans un congrès
hal-02713825v1
|
|
Multi-scale temporal fusion by boosting for music classificationISMIR, 2011, Miami, United States. pp.663-668
Communication dans un congrès
hal-00639097v1
|
|
A MULTIMODAL APPROACH TO INITIALISATION FOR TOP-DOWN SPEAKER DIARIZATION OF TELEVISION SHOWSEusipco, 2010, aalborg, Denmark
Communication dans un congrès
hal-02747730v1
|
|
Approche hiérarchique pour un alignement musique-sur-partition efficaceCompression et Représentation des Signaux Audiovisuels (CORESA), Oct 2010, Lyon, France
Communication dans un congrès
hal-02943620v1
|
|
A comparative study of tonal acoustic features for a symbolic level music-to-score alignment2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010, Mar 2010, Dallas, France. pp.409-412, ⟨10.1109/ICASSP.2010.5495784⟩
Communication dans un congrès
hal-02747785v1
|
|
YAAFE, AN EASY TO USE AND EFFICIENT AUDIO FEATURE EXTRACTION SOFTWAREISMIR, 2010, Utrecht, Netherlands
Communication dans un congrès
hal-02747689v1
|
|
Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows2010 17th IEEE International Conference on Image Processing (ICIP 2010), Sep 2010, Hong Kong, France. pp.1469-1472, ⟨10.1109/ICIP.2010.5653393⟩
Communication dans un congrès
hal-02747558v1
|
|
A conditional random field viewpoint of symbolic audio-to-score matchingthe international conference, Oct 2010, Firenze, France. pp.871, ⟨10.1145/1873951.1874100⟩
Communication dans un congrès
hal-02747590v1
|
|
AN IMPROVED HIERARCHICAL APPROACH FOR MUSIC-TO-SYMBOLIC SCORE ALIGNMENTISMIR, 2010, Utrecht, Netherlands
Communication dans un congrès
hal-02747659v1
|
|
Descripteurs visuels robustes pour l'identification de locuteurs dans des émissions televisées de talk-showsCompression et Représentation des Signaux Audiovisuels (CORESA), Oct 2010, Lyon, France
Communication dans un congrès
hal-02943621v1
|
|
|
Interactive Segmentation of Electro-Acoustic Music2nd International Workshop on Machine Learning and Music (MML - ECML - PKDD), Sep 2009, Bled, Slovenia
Communication dans un congrès
hal-02943665v1
|
Incorporating prior knowledge on the digital media creation process into audio classifiersICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, Apr 2009, Taipei, France. pp.1653-1656, ⟨10.1109/ICASSP.2009.4959918⟩
Communication dans un congrès
hal-03117240v1
|
|
|
Étude des descripteurs acoustiques pour l'alignement temporel audio-sur-partition musicaleGRETSI, Sep 2009, Dijon, France
Communication dans un congrès
hal-02943624v1
|
Étude des descripteurs acoustiques pour l'alignement temporel audio-sur-partition musicaleColloque GRETSI, 2009, dijon, France
Communication dans un congrès
hal-03117111v1
|
|
|
ON THE ROBUSTNESS OF AUDIO FEATURES FOR MUSICAL INSTRUMENT CLASSIFICATION16th European Signal Processing Conference, Aug 2008, Lausanne, Switzerland
Communication dans un congrès
hal-02943672v1
|
|
ALIGNMENT KERNELS FOR AUDIO CLASSIFICATION WITH APPLICATION TO MUSIC INSTRUMENT RECOGNITION16th European Signal Processing Conference, Aug 2008, Lausanne, Switzerland
Communication dans un congrès
hal-02943674v1
|
|
TOWARDS POLYPHONIC MUSICAL INSTRUMENTS RECOGNITION19th INTERNATIONAL CONGRESS ON ACOUSTICS, Sep 2007, Madrid, Spain
Communication dans un congrès
hal-02943678v1
|
Combined Supervised and Unsupervised Approaches for Automatic Segmentation of Radiophonic Audio Streams2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Apr 2007, Honolulu, France. pp.II-461-II-464, ⟨10.1109/ICASSP.2007.366272⟩
Communication dans un congrès
hal-02943676v1
|
|
|
On the usefulness of differentiated transient/steady-state processing in machine recognition of musical instrumentsAES 118th convention, May 2005, Barcelona, Spain
Communication dans un congrès
hal-02946881v1
|
|
MUSICAL INSTRUMENT RECOGNITION BASED ON CLASS PAIRWISE FEATURE SELECTIONInternational Conference on Music Information Retrieval (ISMIR), Oct 2004, Barcelona, Spain
Communication dans un congrès
hal-02946907v1
|
|
Efficient musical instrument recognition on solo performance music using basic featuresAES 25th conference, Jun 2004, London, United Kingdom
Communication dans un congrès
hal-02946911v1
|
|
MUSICAL INSTRUMENT RECOGNITION ON SOLO PERFORMANCESEuropean Signal Processing Conference (EUSIPCO, Sep 2004, Vienna, Austria
Communication dans un congrès
hal-02946903v1
|
|
Multimodal Music Recording RemasteringDMRN+13: Digital Music Research Network One-day Workshop 2018, Dec 2018, London, United Kingdom
Poster de conférence
hal-03187638v1
|
|
Acoustic Features for Environmental Sound AnalysisTuomas Virtanen; Mark D. Plumbley; Dan Ellis. Computational Analysis of Sound Scenes and Events, Springer International Publishing AG, pp.71-101, 2017, 978-3-319-63449-4. ⟨10.1007/978-3-319-63450-0_4⟩
Chapitre d'ouvrage
hal-01575619v1
|
Fusion of Multimodal Information in Music Content AnalysisMultimodal Music Processing, Dagstuhl Follow-Ups,, Schloss Dagstuhl--Leibniz-Zentrum fuer Informatik,, 2012
Chapitre d'ouvrage
hal-02653102v1
|
|
Feature Extraction for Multimedia AnalysisMultimedia Semantics: Metadata, Analysis and Interaction, Wiley, 2011
Chapitre d'ouvrage
hal-02653016v1
|
|
Machine Learning Techniques for Multimedia AnalysisMultimedia Semantics: Metadata, Analysis and Interaction, 2011
Chapitre d'ouvrage
hal-02943615v1
|
|
High-level TV talk show structuring centered on speakers' interventionsTV Content Analysis: Techniques and Applications, CRC Press, Taylor Francis LLC, 2011
Chapitre d'ouvrage
hal-02653090v1
|
|
EEG-based Decoding of Auditory Attention to a Target Instrument for Neuro-steered Music Source Separation2021
Pré-publication, Document de travail
hal-04349308v2
|
|
Identify, locate and separate: Audio-visual object extraction in large video collections using weak supervision2018
Pré-publication, Document de travail
hal-01914532v1
|