Skip to Main content
Number of documents

30

My Resume


Journal articles6 documents

  • Nirmalya Sen, Md Sahidullah, Hemant Patil, Shyamal Kumar das Mandal, Sreenivasa Krothapalli Rao, et al.. Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework. International Journal of Speech Technology, Springer Verlag, In press, ⟨10.1007/s10772-021-09862-8⟩. ⟨hal-03232723⟩
  • Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, et al.. ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Computer Speech and Language, Elsevier, 2020, 64, pp.101114. ⟨10.1016/j.csl.2020.101114⟩. ⟨hal-02945493⟩
  • Tomi Kinnunen, Héctor Delgado, Nicholas Evans, Kong-Aik Lee, Ville Vestman, et al.. Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification: Fundamentals. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 28, pp.2195 - 2210. ⟨10.1109/TASLP.2020.3009494⟩. ⟨hal-02900931⟩
  • Susanta Sarangi, Md Sahidullah, Goutam Saha. Optimization of data-driven filterbank for automatic speaker verification. Digital Signal Processing, Elsevier, 2020, 104, ⟨10.1016/j.dsp.2020.102795⟩. ⟨hal-02900353⟩
  • Ville Vestman, Tomi Kinnunen, Rosa Hautamäki, Md Sahidullah. Voice Mimicry Attacks Assisted by Automatic Speaker Verification. Computer Speech and Language, Elsevier, 2019, 59, pp.36-54. ⟨10.1016/j.csl.2019.05.005⟩. ⟨hal-02161773⟩
  • Arnab Poddar, Md Sahidullah, Goutam Saha. Quality Measures for Speaker Verification with Short Utterances. Digital Signal Processing, Elsevier, 2019, 88, pp.66-79. ⟨10.1016/j.dsp.2019.01.023⟩. ⟨hal-01998376⟩

Conference papers18 documents

  • Spandan Dey, Goutam Saha, Md Sahidullah. Cross-Corpora Language Recognition: A Preliminary Investigation with Indian Languages. EUSIPCO 2021 – 29th European Signal Processing Conference, Aug 2021, Dublin / Virtual, Ireland. ⟨hal-03223314⟩
  • Premjeet Singh, Goutam Saha, Md Sahidullah. Deep scattering network for speech emotion recognition. EUSIPCO 2021 – 29th European Signal Processing Conference, Aug 2021, Dublin, Ireland. ⟨hal-03218278⟩
  • Premjeet Singh, Goutam Saha, Md Sahidullah. Non-linear frequency warping using constant-Q transformation for speech emotion recognition. 2021 International Conference on Computer Communication and Informatics (ICCCI -2021), Jan 2021, Coimbatore, India. ⟨10.1109/ICCCI50826.2021.9402569⟩. ⟨hal-03134015⟩
  • A. Kishore Kumar, Shefali Waldekar, Goutam Saha, Md Sahidullah. Domain-Dependent Speaker Diarization for the Third DIHARD Challenge. The Third DIHARD Speech Diarization Challenge Workshop, Jan 2021, Virtual, France. ⟨hal-03117843⟩
  • Tomi Kinnunen, Andreas Nautsch, Md Sahidullah, Nicholas Evans, Xin Wang, et al.. Visualizing Classifier Adjacency Relations: A Case Study in Speaker Verification and Voice Anti-Spoofing. INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨hal-03261467⟩
  • Xuechen Liu, Md Sahidullah, Tomi Kinnunen. Learnable MFCCs for Speaker Verification. 2021 IEEE International Symposium on Circuits and Systems (ISCAS), May 2021, Daegu, South Korea. ⟨10.1109/ISCAS51556.2021.9401593⟩. ⟨hal-03139532⟩
  • Md Sahidullah, Achintya Kumar Sarkar, Ville Vestman, Xuechen Liu, Romain Serizel, et al.. UIAI System for Short-Duration Speaker Verification Challenge 2020. SLT 2021 - IEEE Spoken Language Technology Workshop, IEEE, Jan 2021, Shenzhen / Virtual, China. ⟨10.1109/SLT48900.2021.9383596⟩. ⟨hal-02907037v2⟩
  • Bhusan Chettri, Rosa Hautamäki, Md Sahidullah, Tomi Kinnunen. Data Quality as Predictor of Voice Anti-Spoofing Generalization. INTERSPEECH 2021, Aug 2021, Brno, Czech Republic. ⟨hal-03261131⟩
  • Xuechen Liu, Md Sahidullah, Tomi Kinnunen. A Comparative Re-Assessment of Feature Extractors for Deep Speaker Embeddings. INTERSPEECH 2020, Oct 2020, Shanghai, China. ⟨hal-02909105⟩
  • Brij Mohan Lal Srivastava, Nathalie Vauquier, Md Sahidullah, Aurélien Bellet, Marc Tommasi, et al.. Evaluating Voice Conversion-based Privacy Protection against Informed Attackers. ICASSP 2020 - 45th International Conference on Acoustics, Speech, and Signal Processing, IEEE Signal Processing Society, May 2020, Barcelona, Spain. pp.2802-2806. ⟨hal-02355115v2⟩
  • Massimiliano Todisco, Xin Wang, Ville Vestman, Md Sahidullah, Héctor Delgado, et al.. ASVspoof 2019: Future Horizons in Spoofed and Fake Audio Detection. INTERSPEECH 2019 - 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria. ⟨hal-02172099⟩
  • Tomi Kinnunen, Rosa Hautamäki, Ville Vestman, Md Sahidullah. Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection. ICASSP 2019 – 44th International Conference on Acoustics, Speech, and Signal Processing, May 2019, Brighton, United Kingdom. ⟨hal-02051701⟩
  • Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, et al.. I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. INTERSPEECH 2019 - 20th Annual Conference of the International Speech Communication Association, Sep 2019, Graz, Austria. ⟨hal-02280151⟩
  • Tomi Kinnunen, Kong Aik Lee, Héctor Delgado, Nicholas Evans, Massimiliano Todisco, et al.. t-DCF: a Detection Cost Function for the Tandem Assessment of Spoofing Countermeasures and Automatic Speaker Verification. Speaker Odyssey 2018 The Speaker and Language Recognition Workshop, Jun 2018, Les Sables d’Olonne, France. ⟨hal-01880306⟩
  • Massimiliano Todisco, Héctor Delgado, Kong Aik Lee, Md Sahidullah, Nicholas Evans, et al.. Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion. Interspeech 2018 - 19th Annual Conference of the International Speech Communication Association, Sep 2018, Hyderabad, India. ⟨10.21437/Interspeech.2018-2289⟩. ⟨hal-01889934⟩
  • Héctor Delgado, Massimiliano Todisco, Md Sahidullah, Nicholas Evans, Tomi Kinnunen, et al.. ASVspoof 2017 Version 2.0: meta-data analysis and baseline enhancements. Odyssey 2018 - The Speaker and Language Recognition Workshop, Jun 2018, Les Sables d'Olonne, France. ⟨hal-01880206⟩
  • Fuming Fang, Junichi Yamagishi, Isao Echizen, Md Sahidullah, Tomi Kinnunen. Transforming acoustic characteristics to deceive playback spoofing countermeasures of speaker verification systems. WIFS 2018 - IEEE International Workshop on Information Forensics and Security, Dec 2018, Hong Kong, Hong Kong SAR China. ⟨hal-01889910⟩
  • Sami Sieranoja, Md Sahidullah, Tomi Kinnunen, Jukka Komulainen, Abdenour Hadid. Audiovisual Synchrony Detection with Optimized Audio Features. ICSIP 2018 - 3rd International Conference on Signal and Image Processing, Jul 2018, Shenzhen, China. ⟨hal-01889918⟩

Book sections1 document

  • Md Sahidullah, Héctor Delgado, Massimiliano Todisco, Tomi Kinnunen, Nicholas Evans, et al.. Introduction to Voice Presentation Attack Detection and Recent Advances. Sébastien Marcel; Mark S. Nixon; Julian Fierrez; Nicholas Evans. Handbook of Biometric Anti-Spoofing: Presentation Attack Detection, Springer, pp.321-361, 2019, Advances in Computer Vision and Pattern Recognition, 978-3-319-92626-1. ⟨10.1007/978-3-319-92627-8_15⟩. ⟨hal-01974528⟩

Preprints, Working Papers, ...4 documents

  • A. Kishore Kumar, Shefali Waldekar, Goutam Saha, Md Sahidullah. ABSP System for The Third DIHARD Challenge. 2021. ⟨hal-03130955⟩
  • Brij Mohan Lal Srivastava, Mohamed Maouche, Md Sahidullah, Emmanuel Vincent, Aurélien Bellet, et al.. Privacy and utility of x-vector based speaker anonymization. 2021. ⟨hal-03197376⟩
  • Md Sahidullah, Jose Patino, Samuele Cornell, Ruiqing Yin, Sunit Sivasankaran, et al.. The Speed Submission to DIHARD II: Contributions & Lessons Learned. 2019. ⟨hal-02352840v2⟩
  • Tomi Kinnunen, Rosa González Hautamäki, Ville Vestman, Md Sahidullah. Can We Use Speaker Recognition Technology to Attack Itself? Enhancing Mimicry Attacks Using Automatic Target Speaker Selection. 2018. ⟨hal-01937767⟩

Reports1 document

  • Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, et al.. I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences. [Research Report] I4U Consortium. 2019. ⟨hal-02174317⟩