- 17
- 6
- 6
- 4
- 3
- 2
- 2
- 1
Sahar Ghannay
41
Documents
Identifiants chercheurs
- sahar-ghannay
- 0000-0002-7531-2522
- IdRef : 220420521
Présentation
Associate professort at LIMSI, CNRS, Université Paris-Saclay
------------------------------------------------------------
Team: [ILES](https://www.limsi.fr/en/research/iles/home)
Email: sahar.ghannay@limsi.fr
Website: <https://saharghannay.github.io>
Short Bio
---------
Sahar Ghannay is an associate professor at Université Paris-Saclay, in the CNRS, [LISN](https://www.lisn.upsaclay.fr) research center, since September 2018.
She received a PhD in Computer Science from Le Mans University on Septembre 2017. Her thesis work is part of the ANR [VERA](https://anr.fr/Project-ANR-12-BS02-0006) (AdVanced ERror Analysis for speech recognition) project. During her PhD, she spent a few months as @ visiting researcher at Apple within the Siri Speech team.
As a postdoctoral researcher at [LIUM](https://lium.univ-lemans.fr/), she worked on neural end-to-end systems for the detection of named entities, speech understanding, as part of the Chist-Era [M2CR](https://projets-lium.univ-lemans.fr/m2cr/) (Multimodal Multilingual Continuous Representation for Human Language Understanding) project.
Her main research interests are continuous representations learning and their application to natural language processing and speech recognition tasks, semantic information extraction form spoken and writen language and dialog system.
CV
==
Education
---------
- PHD in computer science, at LIUM, Le Mans université, 2017
- MS in computer science, Le Mans université, 2013
- BS in computer science, Le Mans université and université de sfax, 2011
Work Experience
----------------
- 2018 - now: Associate Professort at LISN, CNRS, Université Paris-Saclay
- 2017-2018: Post-doc at LIUM, Le Mans université
- 2017 (4 moths): Internship at Apple within the Siri Speech team at Cupertino
- April 2013-Sept. 2014: research engineer
Publications
- 5
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 17
- 15
- 13
- 5
- 5
- 5
- 3
- 3
- 3
- 3
- 3
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 2
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 1
- 17
- 7
- 5
- 3
- 5
- 1
- 1
- 1
- 6
- 4
- 2
- 3
- 8
- 1
- 5
- 3
- 5
- 3
- 1
RNAdvisor: a comprehensive benchmarking tool for the measure and prediction of RNA structural model qualityBriefings in Bioinformatics, 2024, 25 (2), pp.bbae064. ⟨10.1093/bib/bbae064⟩
Article dans une revue
hal-04508073v1
|
|
|
A study of continuous space word and sentence representations applied to ASR error detectionSpeech Communication, 2020
Article dans une revue
hal-02501943v1
|
|
New Semantic Task for the French Spoken Language Understanding MEDIA BenchmarkThe 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024, Torino, Italy
Communication dans un congrès
hal-04523286v1
|
|
Small Language Models are Good Too: An Empirical Study of Zero-Shot ClassificationLREC-COLING 2024, May 2024, TURIN, Italy
Communication dans un congrès
hal-04519930v1
|
|
mALBERT: Is a Compact Multilingual BERT Model Still Worth It?The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, May 2024, Torino, Italy
Communication dans un congrès
hal-04520797v1
|
|
Projet Gender Equality Monitor (GEM)18e Conférence en Recherche d'Information et Applications, 16e Rencontres Jeunes Chercheurs en RI, 30e Conférence sur le Traitement Automatique des Langues Naturelles, 25e Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, 2023, Paris, France. pp.21-21
Communication dans un congrès
hal-04208588v1
|
Specialized Semantic Enrichment of Speech Representations2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), Jun 2023, Rhodes Island, France. pp.1-5, ⟨10.1109/ICASSPW59220.2023.10193452⟩
Communication dans un congrès
hal-04425710v1
|
|
|
Continual self-supervised domain adaptation for end-to-end speaker diarizationIEEE Spoken Language Technology Workshop (SLT 2022), IEEE Speech and Language Processing Technical Committee, Jan 2023, Doha, Qatar. à paraître
Communication dans un congrès
hal-03824546v1
|
|
Analyzing BERT Cross-lingual Transfer Capabilities in Continual Sequence LabelingFirst Workshop on Performance and Interpretability Evaluations of Multimodal, Multipurpose, Massive-Scale Models, Oct 2022, virtual, South Korea
Communication dans un congrès
hal-03824597v1
|
|
Benchmarking Transformers-based models on French Spoken Language Understanding tasksINTERSPEECH 2022, Sep 2022, Incheon, South Korea
Communication dans un congrès
hal-03715340v2
|
|
Evaluating the carbon footprint of NLP methods: a survey and analysis of existing toolsEMNLP, Workshop SustaiNLP, Nov 2021, Punta Cana, Dominican Republic
Communication dans un congrès
hal-03435068v1
|
|
OVERLAP-AWARE LOW-LATENCY ONLINE SPEAKER DIARIZATION BASED ON END-TO-END LOCAL SEGMENTATIONIEEE Automatic Speech Recognition and Unserstanding Workshop, Dec 2021, Cartagena, Colombia
Communication dans un congrès
hal-03375330v1
|
A Comparison of Metric Learning Loss Functions for End-To-End Speaker VerificationInternational Conference on Statistical Language and Speech Processing, Oct 2020, Cardiff, United Kingdom. pp.137-148, ⟨10.1007/978-3-030-59430-5_11⟩
Communication dans un congrès
hal-02989334v1
|
|
|
A Metric Learning Approach to Misogyny CategorizationWorkshop on Representation Learning for NLP, Jul 2020, Online, France. pp.89-94, ⟨10.18653/v1/2020.repl4nlp-1.12⟩
Communication dans un congrès
hal-02989293v1
|
|
What is best for Spoken Language Understanding: Small but Task-dependant Embeddings or Huge but Out-of-domain Embeddings?45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), May 2020, Barcelona, Spain. pp.8114-8118, ⟨10.1109/ICASSP40776.2020.9053278⟩
Communication dans un congrès
hal-02503694v1
|
Error analysis applied to end-to end spoken language understanding45th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), May 2020, Barcelona, Spain. pp.8514-8518, ⟨10.1109/ICASSP40776.2020.9054455⟩
Communication dans un congrès
hal-02465899v1
|
|
|
Neural Networks approaches focused on French Spoken Language Understanding: application to the MEDIA Evaluation TaskIn Proceedings of The 28th International Conference on Computational Linguistics (COLING’2020), 2020, Dec 2020, Barcelona (online), Spain
Communication dans un congrès
hal-03007482v1
|
|
A Cooking Knowledge Graph and Benchmark for Question Answering Evaluation in Lifelong Learning ScenariosInternational Conference on Applications of Natural Language to Information Systems, Elisabeth Métais and Farid Meziane and Helmut Horacek and Philipp Cimiano, Jun 2020, Saarbrücken, Germany
Communication dans un congrès
hal-03006228v1
|
|
Experiments from LIMSI at the French Named Entity Recognition Coarse-grained taskConference and Labs of the Evaluation Forum, Sep 2020, Thessaloniki, Greece
Communication dans un congrès
hal-04395545v1
|
|
Lifelong learning and task-oriented dialogue system: what does it mean?International Workshop on Spoken Dialogue Systems Technology, Apr 2019, Siracusa, Italy
Communication dans un congrès
hal-02301089v1
|
|
End-to-end named entity and semantic concept extraction from speechIEEE Spoken Language Technology Workshop, Dec 2018, Athens, Greece
Communication dans un congrès
hal-01987740v2
|
|
Simulating ASR errors for training SLU systemsLREC 2018, May 2018, Miyazaki, Japan
Communication dans un congrès
hal-01715923v1
|
|
Simulation d'erreurs de reconnaissance automatique dans un cadre de compréhension de la paroleXXXIIe Journées d'Etudes sur la Parole (JEP 2018), Jun 2018, Aix-en-Provence, France
Communication dans un congrès
hal-01757770v1
|
Représentations de phrases dans un espace continu spécifiques à la tâche de détection d'erreursXXXIIe Journées d'Etudes sur la Parole (JEP 2018), Jun 2018, Aix-en-Provence, France
Communication dans un congrès
hal-01757774v1
|
|
Task Specific Sentence Embeddings for ASR Error DetectionInterspeech 2018, Sep 2018, Hyderabad, India. ⟨10.21437/Interspeech.2018-2211⟩
Communication dans un congrès
hal-01870864v1
|
|
|
Enriching confusion networks for post-processingStatistical Language and Speech Processing 2017, Oct 2017, Le Mans, France
Communication dans un congrès
hal-01585768v1
|
|
ASR error management for improving spoken language understandingInterspeech 2017, Aug 2017, Stockholm, Sweden
Communication dans un congrès
hal-01526298v1
|
Evaluation of acoustic word embeddingsRepEval@ACL 2016: The 1st Workshop on Evaluating Vector-Space Representations for NLP, 2016, Berlin, Germany
Communication dans un congrès
hal-01433181v1
|
|
Acoustic word embeddings for ASR error detectionInterspeech 2016, 2016, San Francisco (CA, USA), Unknown Region
Communication dans un congrès
hal-01433176v1
|
|
Recent improvements on error detection for automatic speech recognition1st International Workshop on Multimodal Media Data Analytics (MMDA 2016), in Conjunction with the 22nd European Conference on Artificial Intelligence, 2016, The Hague The, Netherlands
Communication dans un congrès
hal-01433168v1
|
|
Utilisation des représentations continues des mots et des paramètres prosodiques pour la détection d’erreurs dans les transcriptions automatiques de la parole31ème Journées d’Études sur la Parole, 2016, Paris, France
Communication dans un congrès
hal-01450277v1
|
|
Word embedding evaluation and combination10th edition of the Language Resources and Evaluation Conference (LREC 2016), 2016, Portorož, Slovenia
Communication dans un congrès
hal-01433185v1
|
|
Which ASR errors are hard to detect?Workshop Errors by Humans and Machines in multimedia, multimodal and multilingual data processing (ERRARE 2015), 2015, Sinaia, Romania
Communication dans un congrès
hal-01433201v1
|
|
Word embeddings combination and neural networks for robustness in ASR error detection2015 European Signal Processing Conference (EUSIPCO 2015), 2015, Nice, France
Communication dans un congrès
hal-01433210v1
|
|
Combining continous word representation and prosodic features for ASR error prediction3rd International Conference on Statistical Language and Speech Processing (SLSP 2015), 2015, Budapest, Hungary
Communication dans un congrès
hal-01433203v1
|
|
Using Hypothesis Selection Based Features for Confusion Network MT System CombinationThird Workshop on Hybrid Approaches to Translation (HyTra), EACL 2014, 2014, Gothenburg, Sweden
Communication dans un congrès
hal-01433229v1
|
|
RNAdvisor: a comprehensive benchmarking tool for the measure and prediction of RNA structural model quality2024
Pré-publication, Document de travail
hal-04437940v1
|
|
State-of-the-RNArt: benchmarking current methods for RNA 3D structure prediction2024
Pré-publication, Document de travail
hal-04437967v1
|
LIMSI_UPV at SemEval-2020 Task 9: Recurrent Convolutional Neural Network for Code-mixed Sentiment Analysis2021
Pré-publication, Document de travail
hal-03294371v1
|
Semantic enrichment towards efficient speech representationsLISN. 2023
Rapport
hal-04425932v1
|
|
Étude sur les représentations continues de mots appliquées à la détection automatique des erreurs de reconnaissance de la paroleInformatique et langage [cs.CL]. Université du Maine, 2017. Français. ⟨NNT : 2017LEMA1019⟩
Thèse
tel-01661491v1
|