Recherche - Archive ouverte HAL Accéder directement au contenu

Filtrer vos résultats

146 résultats

MELHISSA: a multilingual entity linking architecture for historical press articles

Elvys Linhares Pontes , Luis Adrián Cabrera-Diego , Jose G. Moreno , Emanuela Boros , Ahmed Hamdi , et al.
International Journal on Digital Libraries, 2022, 23 (2), pp.133-160. ⟨10.1007/s00799-021-00319-6⟩
Article dans une revue hal-03885133v1
Image document

Enhancing Table of Contents Extraction by System Aggregation

Thi-Tuyet-Hai Nguyen , Antoine Doucet , Mickaël Coustaty
The 14th IAPR International Conference on Document Analysis and Recognition (ICDAR2017), Nov 2017, Kyoto, Japan. pp.242-247, ⟨10.1109/ICDAR.2017.48⟩
Communication dans un congrès hal-02568946v1
Image document

Identification of Microblogs Prominent Users during Events by Learning Temporal Sequences of Features

Imen Bizid , Nibal Nayef , Patrice Boursier , Sami Faiz , Antoine Doucet
The 24th ACM International Conference on Information and Knowledge Management (CIKM2015), Oct 2015, Melbourne, Australia. pp.1715-1718, ⟨10.1145/2806416.2806612⟩
Communication dans un congrès hal-01287168v1

ESAIR '15: Proceedings of the Eighth Workshop on Exploiting Semantic Annotations in Information Retrieval

Krisztian Balog , Jeffrey Dalton , Antoine Doucet , Yusra Ibrahim
Balog, Krisztian and Dalton, Jeffrey and Doucet, Antoine and Ibrahim, Yusra. 24th ACM International Conference on Information and Knowledge Management (CIKM2015), Oct 2015, Melbourne, Australia. ACM, 2015, 978-1-4503-3790-8
Proceedings/Recueil des communications hal-01294128v1
Image document

The PARSEME Shared Task on Automatic Identification of Verbal Multiword Expressions

Agata Savary , Carlos Ramisch , Silvio Ricardo Cordeiro , Federico Sangati , Veronika Vincze , et al.
MWE 2017 - Proceedings of the 13th Workshop on Multiword Expressions, Apr 2017, Valencia, Spain. pp.31 - 47
Communication dans un congrès hal-01504624v1

Utilisation de Séquences Fréquentes Maximales en Recherche d'Information

Antoine Doucet
7th International Conference on the Statistical Analysis of Textual Data (JADT-2004), 2004, Belgique. pp.334-345
Communication dans un congrès hal-00324774v1

Data Mining Meets Collocations Discovery

Helena Ahonen-Myka , Antoine Doucet
Inquiries into Words, Constraints and Contexts, Festschrift in the Honour of Kimmo Koskenniemi, CSLI Publications, Center for the Study of Language and Information, University of Stanford, pp.194-203, 2005
Chapitre d'ouvrage hal-00324775v1

A method to calculate probability and expected document frequency

Antoine Doucet
Sigir Forum, 2005, pp.33-40
Article dans une revue hal-00239233v1
Image document

ICDAR 2011 Book Structure Extraction Competition

Antoine Doucet , Gabriella Kazai , Jean-Luc Meunier
Eleventh International Conference on Document Analysis and Recognition (ICDAR'2011), Sep 2011, Pekin, China. pp.1501--1505, ⟨10.1109/ICDAR.2011.298⟩
Communication dans un congrès hal-01069019v1
Image document

Term Association Analysis for Named Entity Filtering

Oskar Gross , Antoine Doucet , Hannu Toivonen
Voorhees, Ellen M. and Buckland, Lori P. Twentieth Text REtrieval Conference, TREC 2012, National Institute of Standards and Technology (NIST), pp.10, 2012
Chapitre d'ouvrage hal-01071724v1

Improving Skin-Disease Classification Based on Customized Loss Function Combined With Balanced Mini-Batch Logic and Real-Time Image Augmentation

Tri-Cong Pham , Antoine Doucet , Chi-Mai Luong , Cong-Thanh Tran , Van-Dung Hoang
IEEE Access, 2020, 8, pp.150725-150737. ⟨10.1109/ACCESS.2020.3016653⟩
Article dans une revue hal-03026932v1
Image document

Report on INEX 2013

Patrice Bellot , Antoine Doucet , Shlomo Geva , Sairam Gurajada , Jaap Kamps , et al.
Sigir Forum, 2013, 47 (2), pp.21-32. ⟨10.1145/2568388.2568393⟩
Article dans une revue hal-01447807v1
Image document

Neural Networks for Multi-Word Expression Detection

Natalia Klyueva , Antoine Doucet , Milan Straka
Proceedings of the 13th Workshop on Multiword Expressions (MWE 2017), Apr 2017, Valencia, Spain. pp.60-65, ⟨10.18653/v1/W17-1707⟩
Communication dans un congrès hal-03025446v1
Image document

Can Cross-domain Term Extraction Benefit from Cross-lingual Transfer?

Tran Thi Hong Hanh , Matej Martinc , Antoine Doucet , Senja Pollak
25th International Conference on Discovery Science (DS 2022), Oct 2022, Montpellier, France. ⟨10.1007/978-3-031-18840-4_26⟩
Communication dans un congrès hal-04351020v1
Image document

Large Scale Analysis of Semantic and Temporal Aspects in Cultural Heritage Collection's Search

Yasunobu Sumikawa , Adam Jatowt , Antoine Doucet , Jean-Philippe Moreux
2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL), Jun 2019, Champaign, United States. pp.77-86, ⟨10.1109/jcdl.2019.00021⟩
Communication dans un congrès hal-03025966v1

Adaptive Edit-Distance and Regression Approach for Post-OCR Text Correction

Thi-Tuyet-Hai Nguyen , Mickaël Coustaty , Antoine Doucet , Adam Jatowt , Nhu-Van Nguyen
ICADL 2018: Maturity and Innovation in Digital Libraries, pp.278-289, 2018, ⟨10.1007/978-3-030-04257-8_29⟩
Chapitre d'ouvrage hal-02364664v1

Report on INEX 2011

Patrice Bellot , Timothy Chappell , Antoine Doucet , Shlomo Geva , Jaap Kamps , et al.
Sigir Forum, 2012, 46 (1), pp.Pages 33-42. ⟨10.1145/2215676.2215679⟩
Article dans une revue hal-01072069v1
Image document

DataTourism : Designing an Architecture to Process Tourism Data

Fayrouz Soualah-Alila , Mickaël Coustaty , Nicolas Rempulski , Antoine Doucet
IFITT and ENTER 2016 Conferences, Feb 2016, Bilbao, Spain. pp.751-763
Communication dans un congrès hal-01238379v1
Image document

A survey on bipartite graphs embedding

Edward Giamphy , Jean-Loup Guillaume , Antoine Doucet , Kevin Sanchis
Social Network Analysis and Mining, 2023, 13 (1), pp.54. ⟨10.1007/s13278-023-01058-z⟩
Article dans une revue hal-04089238v1
Image document

A Multilingual Dataset for Named Entity Recognition, Entity Linking and Stance Detection in Historical Newspapers

Ahmed Hamdi , Elvys Linhares Pontes , Emanuela Boros , Thi Tuyet Hai Nguyen , Günter Hackl , et al.
SIGIR '21: The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, Jul 2021, Virtual Event, Canada. pp.2328-2334, ⟨10.1145/3404835.3463255⟩
Communication dans un congrès hal-03418387v1
Image document

Adapting Transformers for Detecting Emergency Events on Social Media

Emanuela Boros , Gaël Lejeune , Mickaël Coustaty , Antoine Doucet
14th International Conference on Knowledge Discovery and Information Retrieval, Oct 2022, Valletta, Malta. pp.300-306, ⟨10.5220/0011559800003335⟩
Communication dans un congrès hal-03861202v1
Image document

Tracking news stories in short messages in the era of infodemic

Guillaume Bernard , Cyrille Suire , Cyril Faucher , Antoine Doucet , Paolo Rosso
Conference and Labs of the Evaluation Forum (CLEF 2022), Università di Bologna, Italy, Sep 2022, Bologne, Italy. pp.18-32, ⟨10.1007/978-3-031-13643-6_2⟩
Communication dans un congrès hal-03727200v1
Image document

Étude comparative de méthodes de classification multilingue appliquées à l'épidémiologie

Stephen Mutuvi , Emanuela Boros , Antoine Doucet , Gaël Lejeune , Adam Jatowt , et al.
COnférence en Recherche d'Informations et Applications - CORIA 2021, French Information Retrieval Conference, Apr 2021, Grenoble (virtuel), France. ⟨10.5281/zenodo.4734472⟩
Communication dans un congrès hal-03320343v1
Image document

Tentative d'approche multilingue en extraction d'information

Gaël Lejeune , Nadine Lucas , Antoine Doucet
JADT Journées internationales d'Analyse statistique des Données Textuelles, Jun 2010, rome, Italie. pp.1259-1267
Communication dans un congrès hal-01067147v1

EXTIRP: baseline retrieval from Wikipedia

Miro Lehtonen , Antoine Doucet
Proceedings of the Fifth Annual Workshop of the Initiative for the Evaluation of XML retrieval (INEX 2006), Dagstuhl Castle, Germany, December 18-20 2006, Springer, pp.119-124, 2007, Lecture Notes in Computer Science
Chapitre d'ouvrage hal-00324995v1

Non-Contiguous Word Sequences for Information Retrieval

Antoine Doucet , Helena Ahonen-Myka
42nd annual meeting of the Association for Computational Linguistics (ACL-2004), Workshop on Multiword Expressions: Integrating Processing, 2004, Spain. pp.88-95
Communication dans un congrès hal-00324779v1
Image document

Neural Machine Translation with BERT for Post-OCR Error Detection and Correction

Thi Tuyet Hai Nguyen , Adam Jatowt , Nhu-Van Nguyen , Mickaël Coustaty , Antoine Doucet
JCDL '20: The ACM/IEEE Joint Conference on Digital Libraries in 2020, Aug 2020, Virtual Event, China. pp.333-336, ⟨10.1145/3383583.3398605⟩
Communication dans un congrès hal-03026937v1
Image document

Post-OCR Error Detection by Generating Plausible Candidates

Thi-Tuyet-Hai Nguyen , Adam Jatowt , Mickaël Coustaty , Nhu-Van Nguyen , Antoine Doucet
2019 International Conference on Document Analysis and Recognition (ICDAR), Sep 2019, Sydney, Australia. pp.876-881, ⟨10.1109/ICDAR.2019.00145⟩
Communication dans un congrès hal-02518252v1
Image document

Archive TimeLine Summarization (ATLS): Conceptual Framework for Timeline Generation over Historical Document Collections

Nicolas Gutehrlé , Antoine Doucet , Adam Jatowt
Proceedings of the 6th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, Oct 2022, Gyeongju, Republic of Korea, France. pp.13-23
Communication dans un congrès hal-03962124v1
Image document

A Quantitative Analysis of Noise Impact on Document Ranking

Edward Giamphy , Kevin Sanchis , Gohar Dashyan , Jean-Loup Guillaume , Ahmed Hamdi , et al.
IEEE Conference on Systems, Man, and Cybernetics, Oct 2023, Honolulu, United States
Communication dans un congrès hal-04284004v1