Accéder directement au contenu

Md Sahidullah

52
Documents

Publications

Image document

Cross-corpora spoken language identification with domain diversification and generalization

Spandan Dey , Md Sahidullah , Goutam Saha
Computer Speech and Language, 2023, 81 (June 2023), pp.101489. ⟨10.1016/j.csl.2023.101489⟩
Article dans une revue hal-03984643v1
Image document

Stuttering Detection Using Speaker Representations and Self-supervised Contextual Embeddings

Shakeel Sheikh , Md Sahidullah , Fabrice Hirsch , Slim Ouni
International Journal of Speech Technology, 2023, ⟨10.1007/s10772-023-10032-1⟩
Article dans une revue hal-03629758v2
Image document

Analysis of constant-Q filterbank based representations for speech emotion recognition

Premjeet Singh , Shefali Waldekar , Md Sahidullah , Goutam Saha
Digital Signal Processing, 2022, 130, pp.103712. ⟨10.1016/j.dsp.2022.103712⟩
Article dans une revue hal-03846173v1
Image document

Robust acoustic domain identification with its application to speaker diarization

A Kishore Kumar , Shefali Waldekar , Md Sahidullah , Goutam Saha
International Journal of Speech Technology, 2022, 25 (December), pp.933-945. ⟨10.1007/s10772-022-09990-9⟩
Article dans une revue hal-03719697v1
Image document

Machine Learning for Stuttering Identification: Review, Challenges & Future Directions

Shakeel Sheikh , Md Sahidullah , Fabrice Hirsch , Slim Ouni
Neurocomputing, 2022, 514 (2022), pp.17. ⟨10.1016/j.neucom.2022.10.015⟩
Article dans une revue hal-03634072v2
Image document

An Overview of Indian Spoken Language Recognition from Machine Learning Perspective

Spandan Dey , Md Sahidullah , Goutam Saha
ACM Transactions on Asian and Low-Resource Language Information Processing, 2022, 21 (6), pp.1-45. ⟨10.1145/3523179⟩
Article dans une revue hal-03616853v1
Image document

Privacy and utility of x-vector based speaker anonymization

Brij Mohan Lal Srivastava , Mohamed Maouche , Md Sahidullah , Emmanuel Vincent , Aurélien Bellet
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2022, ⟨10.1109/TASLP.2022.3190741⟩
Article dans une revue hal-03197376v3
Image document

ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech

Andreas Nautsch , Xin Wang , Nicholas Evans , Tomi Kinnunen , Ville Vestman
IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021, 3 (2), pp.252-265. ⟨10.1109/TBIOM.2021.3059479⟩
Article dans une revue hal-03236124v1
Image document

Optimizing Multi-Taper Features for Deep Speaker Verification

Xuechen Liu , Md Sahidullah , Tomi Kinnunen
IEEE Signal Processing Letters, 2021, 28, pp.2187 - 2191. ⟨10.1109/LSP.2021.3122796⟩
Article dans une revue hal-03394152v1
Image document

Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework

Nirmalya Sen , Md Sahidullah , Hemant Patil , Shyamal Kumar das Mandal , Sreenivasa Krothapalli Rao
International Journal of Speech Technology, 2021, 24, pp.1067-1088. ⟨10.1007/s10772-021-09862-8⟩
Article dans une revue hal-03232723v1
Image document

Speech Frame Selection for Spoofing Detection with an Application to Partially Spoofed Audio-Data

Kishore A. Kumar , Dipjyoti Paul , Monisankha Pal , Md Sahidullah , Goutam Saha
International Journal of Speech Technology, 2021, ⟨10.1007/s10772-020-09785-w⟩
Article dans une revue hal-03008912v1
Image document

Optimization of data-driven filterbank for automatic speaker verification

Susanta Sarangi , Md Sahidullah , Goutam Saha
Digital Signal Processing, 2020, 104, ⟨10.1016/j.dsp.2020.102795⟩
Article dans une revue hal-02900353v1
Image document

ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech

Xin Wang , Junichi Yamagishi , Massimiliano Todisco , Héctor Delgado , Andreas Nautsch
Computer Speech and Language, 2020, 64, pp.101114. ⟨10.1016/j.csl.2020.101114⟩
Article dans une revue hal-02945493v1
Image document

Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals

Tomi Kinnunen , Héctor Delgado , Nicholas Evans , Kong-Aik Lee , Ville Vestman
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, pp.2195 - 2210. ⟨10.1109/TASLP.2020.3009494⟩
Article dans une revue hal-02900931v1
Image document

Quality Measures for Speaker Verification with Short Utterances

Arnab Poddar , Md Sahidullah , Goutam Saha
Digital Signal Processing, 2019, 88, pp.66-79. ⟨10.1016/j.dsp.2019.01.023⟩
Article dans une revue hal-01998376v1
Image document

Voice Mimicry Attacks Assisted by Automatic Speaker Verification

Ville Vestman , Tomi Kinnunen , Rosa González Hautamäki , Md Sahidullah
Computer Speech and Language, 2019, 59, pp.36-54. ⟨10.1016/j.csl.2019.05.005⟩
Article dans une revue hal-02161773v1
Image document

Robust Stuttering Detection via Multi-task and Adversarial Learning

Shakeel Sheikh , Md Sahidullah , Fabrice Hirsch , Slim Ouni
EUSIPCO 2022 - 30th European Signal Processing Conference, Aug 2022, Belgrade, Serbia
Communication dans un congrès hal-03629785v1
Image document

Spoofing-Aware Speaker Verification with Unsupervised Domain Adaptation

Xuechen Liu , Md Sahidullah , Tomi Kinnunen
Odyssey 2022 – The Speaker and Language Recognition Workshop, Jun 2022, Beijing, China. pp.85-91, ⟨10.21437/Odyssey.2022-12⟩
Communication dans un congrès hal-03796438v1
Image document

Learnable Nonlinear Compression for Robust Speaker Verification

Xuechen Liu , Md Sahidullah , Tomi Kinnunen
ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing, May 2022, Singapore, Singapore. ⟨10.1109/ICASSP43922.2022.9747185⟩
Communication dans un congrès hal-03616852v1
Image document

Baselines and Protocols for Household Speaker Recognition

Alexey Sholokhov , Xuechen Liu , Md Sahidullah , Tomi Kinnunen
The Speaker and Language Recognition Workshop (Odyssey 2022), Jun 2022, Beijing, China. pp.185-192, ⟨10.21437/Odyssey.2022-26⟩
Communication dans un congrès hal-03846180v1

Baseline Systems for the First Spoofing-Aware Speaker Verification Challenge: Score and Embedding Fusion

Hye-Jin Shim , Hemlata Tak , Xuechen Liu , Hee-Soo Heo , Jee-Weon Jung
Odyssey 2022 - The Speaker and Language Recognition Workshop, Jun 2022, Beijing, China
Communication dans un congrès hal-03652819v1
Image document

End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge

Shakeel A Sheikh , Md Sahidullah , Fabrice Hirsch , Slim Ouni
ACM Multimedia 2022 Computational Paralinguistics Challenge (ComParE), Oct 2022, Lisbon, Portugal
Communication dans un congrès hal-03728331v1
Image document

Data Quality as Predictor of Voice Anti-Spoofing Generalization

Bhusan Chettri , Rosa González Hautamäki , Md Sahidullah , Tomi Kinnunen
INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-1180⟩
Communication dans un congrès hal-03261131v1
Image document

Modeling and training strategies for language recognition systems

Raphaël Duroselle , Md Sahidullah , Denis Jouvet , Irina Illina
INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-277⟩
Communication dans un congrès hal-03264085v1
Image document

StutterNet: Stuttering Detection Using Time Delay Neural Network

Shakeel Ahmad Sheikh , Md Sahidullah , Fabrice Hirsch , Slim Ouni
EUSIPCO 2021 - 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616063⟩
Communication dans un congrès hal-03227223v1
Image document

Parameterized Channel Normalization for Far-field Deep Speaker Verification

Xuechen Liu , Md Sahidullah , Tomi Kinnunen
ASRU 2021 - IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2021, Cartagena, Colombia
Communication dans un congrès hal-03359174v1
Image document

UIAI System for Short-Duration Speaker Verification Challenge 2020

Md Sahidullah , Achintya Kumar Sarkar , Ville Vestman , Xuechen Liu , Romain Serizel
SLT 2021 - IEEE Spoken Language Technology Workshop, IEEE, Jan 2021, Shenzhen / Virtual, China. ⟨10.1109/SLT48900.2021.9383596⟩
Communication dans un congrès hal-02907037v2
Image document

Optimized Power Normalized Cepstral Coefficients Towards Robust Deep Speaker Verification

Xuechen Liu , Md Sahidullah , Tomi Kinnunen
ASRU 2021 - IEEE Automatic Speech Recognition and Understanding Workshop, Dec 2021, Cartagena, Colombia
Communication dans un congrès hal-03359173v1
Image document

Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages

Spandan Dey , Goutam Saha , Md Sahidullah
EUSIPCO 2021 - 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9616273⟩
Communication dans un congrès hal-03223314v1
Image document

Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing

Tomi Kinnunen , Andreas Nautsch , Md Sahidullah , Nicholas Evans , Xin Wang
INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-1522⟩
Communication dans un congrès hal-03261467v1
Image document

ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection

Junichi Yamagishi , Xin Wang , Massimiliano Todisco , Md Sahidullah , Jose Patino
ASVspoof 2021 Workshop - Automatic Speaker Verification and Spoofing Coutermeasures Challenge, Sep 2021, Virtual, France
Communication dans un congrès hal-03360794v1
Image document

Language recognition on unknown conditions: the LORIA-Inria-MULTISPEECH system for AP20-OLR Challenge

Raphaël Duroselle , Md Sahidullah , Denis Jouvet , Irina Illina
INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨10.21437/Interspeech.2021-276⟩
Communication dans un congrès hal-03228823v2
Image document

Learnable MFCCs for Speaker Verification

Xuechen Liu , Md Sahidullah , Tomi Kinnunen
ISCAS 2021 - IEEE International Symposium on Circuits and Systems, May 2021, Daegu, South Korea. ⟨10.1109/ISCAS51556.2021.9401593⟩
Communication dans un congrès hal-03139532v1
Image document

Non-linear frequency warping using constant-Q transformation for speech emotion recognition

Premjeet Singh , Goutam Saha , Md Sahidullah
ICCCI 2021 - International Conference on Computer Communication and Informatics, Jan 2021, Coimbatore, India. ⟨10.1109/ICCCI50826.2021.9402569⟩
Communication dans un congrès hal-03134015v1
Image document

Domain-Dependent Speaker Diarization for the Third DIHARD Challenge

Kishore A. Kumar , Shefali Waldekar , Goutam Saha , Md Sahidullah
DIHARD 2021 - 3rd Speech Diarization Challenge Workshop, Jan 2021, Virtual, France
Communication dans un congrès hal-03117843v1
Image document

Deep scattering network for speech emotion recognition

Premjeet Singh , Goutam Saha , Md Sahidullah
EUSIPCO 2021 - 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨10.23919/EUSIPCO54536.2021.9615958⟩
Communication dans un congrès hal-03218278v1

Benchmarking and challenges in security and privacy for voice biometrics

Jean-Francois Bonastre , Hector Delgado , Nicholas Evans , Tomi Kinnunen , Kong Aik Lee
SPSC 2021 - 1st ISCA Symposium on Security and Privacy in Speech Communication, ISCA, Nov 2021, Magdeburg, Germany. ⟨10.21437/SPSC.2021-11⟩
Communication dans un congrès hal-03346196v1
Image document

Evaluating Voice Conversion-based Privacy Protection against Informed Attackers

Brij Mohan Lal Srivastava , Nathalie Vauquier , Md Sahidullah , Aurélien Bellet , Marc Tommasi
ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, IEEE Signal Processing Society, May 2020, Barcelona, Spain. pp.2802-2806
Communication dans un congrès hal-02355115v2
Image document

A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings

Xuechen Liu , Md Sahidullah , Tomi Kinnunen
INTERSPEECH 2020, Oct 2020, Shanghai, China
Communication dans un congrès hal-02909105v1
Image document

ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection

Massimiliano Todisco , Xin Wang , Ville Vestman , Md Sahidullah , Héctor Delgado
INTERSPEECH 2019 - 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Communication dans un congrès hal-02172099v1
Image document

Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection

Tomi Kinnunen , Rosa González Hautamäki , Ville Vestman , Md Sahidullah
ICASSP 2019 – 44th International Conference on Acoustics, Speech, and Signal Processing, May 2019, Brighton, United Kingdom
Communication dans un congrès hal-02051701v1
Image document

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

Kong Aik Lee , Ville Hautamäki , Tomi Kinnunen , Hitoshi Yamamoto , Koji Okabe
INTERSPEECH 2019 - 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria
Communication dans un congrès hal-02280151v1
Image document

Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems

Fuming Fang , Junichi Yamagishi , Isao Echizen , Md Sahidullah , Tomi Kinnunen
WIFS 2018 - IEEE International Workshop on Information Forensics and Security, Dec 2018, Hong Kong, Hong Kong SAR China
Communication dans un congrès hal-01889910v1
Image document

Audiovisual Synchrony Detection with Optimized Audio Features

Sami Sieranoja , Md Sahidullah , Tomi Kinnunen , Jukka Komulainen , Abdenour Hadid
ICSIP 2018 - 3rd International Conference on Signal and Image Processing, Jul 2018, Shenzhen, China
Communication dans un congrès hal-01889918v1
Image document

ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements

Héctor Delgado , Massimiliano Todisco , Md Sahidullah , Nicholas Evans , Tomi Kinnunen
Odyssey 2018 - The Speaker and Language Recognition Workshop, Jun 2018, Les Sables d'Olonne, France
Communication dans un congrès hal-01880206v1
Image document

Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion

Massimiliano Todisco , Héctor Delgado , Kong Aik Lee , Md Sahidullah , Nicholas Evans
Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India. ⟨10.21437/Interspeech.2018-2289⟩
Communication dans un congrès hal-01889934v1
Image document

t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification

Tomi Kinnunen , Kong Aik Lee , Héctor Delgado , Nicholas Evans , Massimiliano Todisco
Speaker Odyssey 2018 The Speaker and Language Recognition Workshop, Jun 2018, Les Sables d’Olonne, France
Communication dans un congrès hal-01880306v1
Image document

Introduction to Voice Presentation Attack Detection and Recent Advances

Md Sahidullah , Héctor Delgado , Massimiliano Todisco , Tomi Kinnunen , Nicholas Evans
Sébastien Marcel; Mark S. Nixon; Julian Fierrez; Nicholas Evans. Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, Springer, pp.321-361, 2019, Advances in Computer Vision and Pattern Recognition, 978-3-319-92626-1. ⟨10.1007/978-3-319-92627-8_15⟩
Chapitre d'ouvrage hal-01974528v1