Can Cui
4
Documents
Présentation
**PhD Student · Inria · CDD**
Thesis in preparation: Joint embedded speech separation, diarization and recognition for the automatic generation of meeting minutes.
The automatic generation of meeting minutes thus involves solving a set of tasks: i) segmenting the signal according to the number of speakers and who is speaking at each time (diarisation), ii) separating overlapping speech signals \[3\] and enhancing them with respect to ambient noise and reverberation, iii) ensuring the robustness of ASR with respect to diarization errors and signal distortions introduced by separation and enhancement, and iv) removing disfluencies from the word-for-word transcription in order to obtain readable minutes. The objective of this PhD is to design a system which can jointly address the first three tasks given a single-channel or a multichannel signal and which can be embedded in a device with limited computing power (for example a mobile phone), while being able to compete with current Cloud-based technologies.
**AI Intern · Orange · Stage**
Direction: Cognitive Computing Competence Center (CCIC) within the Information System Department (DSI)
Subject: Detection of weak signals within customer conversations in a call center
Mission: Analysis of voice conversations
Activities: Transformation and construction of abstract and extractive summaries; Classifications of conversations with regard to the emotions within the conversations; Extraction of themes from conversations
**Sorbonne University**
Master in language and computer science
**Sorbonne University**
Master in Linguistics
Publications
|
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2023), Dec 2023, Taipei, Taiwan. ⟨10.1109/ASRU57964.2023.10389729⟩
Communication dans un congrès
hal-04235774v1
|
|
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature AnalysisRencontre des Jeunes Chercheurs en Parole 2023 - 10E Edition, Nov 2023, Grenoble, France
Poster de conférence
hal-04321252v1
|
|
Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications2024
Pré-publication, Document de travail
hal-04495886v1
|
|
End-to-end Joint Rich and Normalized ASR with a limited amount of rich training data2023
Pré-publication, Document de travail
hal-04304642v1
|