Hervé Bredin, Ruiqing yin, Juan Manuel Coria, Gregory Gelly, Pavel Korshunov, et al.. Pyannote.audio: neural building blocks for speaker diarization. IEEE International Conference on Acoustics, Speech, and Signal Processing, May 2020, Barcelona, Spain. ⟨hal-02995345⟩
Hervé Bredin, Gregory Gelly. Improving Speaker Diarization of TV Series using Talking-Face Detection and Clustering. ACM Multimedia 2016, ACM, Jan 2016, Amsterdam, Netherlands. ⟨hal-01836453⟩