Julien Abadji
2
Documents
Identifiants chercheurs
- julien-abadji
- 0000-0003-4006-8045
Présentation
Interested in software engineering revolving around generation and filtering of huge multilingual corpora.
Publications
- 1
- 1
- 1
- 2
- 2
- 2
- 2
- 1
- 1
|
Towards a Cleaner Document-Oriented Multilingual Crawled CorpusThirteenth Language Resources and Evaluation Conference - LREC 2022, Jun 2022, Marseille, France
Communication dans un congrès
hal-03536361v1
|
|
Ungoliant: An Optimized Pipeline for the Generation of a Very Large-Scale Multilingual Web CorpusCMLC 2021 - 9th Workshop on Challenges in the Management of Large Corpora, Jul 2021, Limerick / Virtual, Ireland. ⟨10.14618/ids-pub-10468⟩
Communication dans un congrès
hal-03301590v1
|