Skip to Main content

Keywords

Co-authors

Researcher identifiers

External widget

Social networks

    Export Publications

    Export the displayed publications:
    Number of documents

    71

    Laurent Romary


    Inria, Directeur de Recherche - Senior Researcher

    • DARIAH EU infrastructure, director
    • ISO/TC 37, chair (ISO TC 37/SC 4/WG 2, convenor)

     

     

     


    Automatic Language Modelling and ANAlysis & Computational Humanities   

    Journal articles8 documents

    • Andrea Bertino, Luca Foppiano, Laurent Romary, Pierre Mounier. Leveraging Concepts in Open Access Publications. Journal of Data Mining and Digital Humanities, Episciences.org, 2020, 2019. ⟨hal-01981922v3⟩
    • Detlef Reineke, Laurent Romary. Bridging the gap between SKOS and TBX. edition - Die Fachzeitschrift für Terminologie, Deutscher Terminologie-Tag e.V. (DTT), 2019, Begriffssysteme und ihre Darstellung, 19 (2). ⟨hal-02398820⟩
    • Laurent Romary, Charles Riondet. Towards multiscale archival digital data. Umanistica digitale, AIUCD - Associazione per l’Informatica Umanistica e la Cultura Digitale, 2019, ⟨10.6092/issn.2532-8816/9045⟩. ⟨hal-01586389⟩
    • Charles Riondet, Laurent Romary. The Standardization Survival Kit: for a Wider Use of Metadata Standards within Arts and Humanities. Archives et Bibliothèques de Belgique - Archief- en Bibliotheekwezen in België, Archief, 2018, Trust and Understanding: the value of metadata in a digitally joined-up world, ed. by R. Depoortere, T. Gheldof, D. Styven and J. Van Der Eycken, 106, pp.55-62. ⟨hal-02124679⟩
    • Jack Bowers, Laurent Romary. Bridging the Gaps between Digital Humanities, Lexicography, and Linguistics: A TEI Dictionary for the Documentation of Mixtepec-Mixtec. Dictionaries: Journal of the Dictionary Society of North America, Dictionary Society of North America, 2018, 39 (2), pp.79-106. ⟨hal-01968871⟩
    • Laurent Romary, Charles Riondet. EAD-ODD: A solution for project-specific EAD schemes. Archival Science, Springer Verlag, 2018, ⟨10.1007/s10502-018-9290-y⟩. ⟨hal-01737568v2⟩
    • Jack Bowers, Laurent Romary. Deep encoding of etymological information in TEI. Journal of the Text Encoding Initiative, TEI Consortium, 2017, ⟨10.4000/jtei.1643⟩. ⟨hal-01296498v2⟩
    • Jennifer Edmond, Frank Fischer, Michael Mertens, Laurent Romary. The DARIAH ERIC: Redefining Research Infrastructure for the Arts and Humanities in the Digital Age. ERCIM News, ERCIM, 2017, Digital Humanities. ⟨hal-01588665⟩

    Conference papers34 documents

    • Pedro Javier Ortiz Suárez, Laurent Romary, Benoît Sagot. A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages. ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics, Jul 2020, Seattle, United States. ⟨hal-02863875v2⟩
    • Pedro Javier Ortiz Suárez, Yoann Dupont, Benjamin Muller, Laurent Romary, Benoît Sagot. Establishing a New State-of-the-Art for French Named Entity Recognition. LREC 2020 - 12th Language Resources and Evaluation Conference, May 2020, Marseille, France. ⟨hal-02617950v2⟩
    • Fahad Khan, Laurent Romary, Ana Salgado, Jack Bowers, Mohamed Khemakhem, et al.. Modelling Etymology in LMF/TEI: The Grande Dicionário Houaiss da Língua Portuguesa Dictionary as a Use Case. LREC 2020 - 12th Language Resources and Evaluation Conference, May 2020, Marseille, France. ⟨hal-02618067⟩
    • Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoan Dupont, Laurent Romary, et al.. Les modèles de langue contextuels Camembert pour le français : impact de la taille et de l'hétérogénéité des données d'entrainement. JEP-TALN-RECITAL 2020 - 33ème Journées d’Études sur la Parole, 27ème Conférence sur le Traitement Automatique des Langues Naturelles, 22ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, Jun 2020, Nancy, France. pp.54-65. ⟨hal-02784755v3⟩
    • Mohamed Khemakhem, Simon Gabay, Béatrice Joyeux-Prunel, Laurent Romary, Léa Saint-Raymond, et al.. Information Extraction Workflow for Digitised Entry-based Documents. DARIAH Annual event 2020, May 2020, Zagreb, Croatia. ⟨hal-02508549⟩
    • Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoann Dupont, Laurent Romary, et al.. CamemBERT: a Tasty French Language Model. ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics, Jul 2020, Seattle, United States. ⟨hal-02889805⟩
    • Mohamed Khemakhem, Ioana Galleron, Geoffrey Williams, Laurent Romary, Pedro Javier Ortiz Suárez. How OCR Performance can Impact on the Automatic Extraction of Dictionary Content Structures. 19th annual Conference and Members’ Meeting of the Text Encoding Initiative Consortium (TEI) -What is text, really? TEI and beyond, Sep 2019, Graz, Austria. ⟨hal-02263276⟩
    • Laurent Romary. The place of lexicography in (computer) science. The Future of Academic Lexicography: Linguistic Knowledge Codification in the Era of Big Data and AI, Frieda Steurs; Dirk Geeraerts; Niels Schiller; Marian Klamer; Iztok Kosem, Nov 2019, Leiden, Netherlands. ⟨hal-02358218⟩
    • Sheena Bassett, Leon Wessels, Steven Krauwer, Bente Maegaard, Hella Hollander, et al.. Connecting the Humanities through Research Infrastructures. 4th Digital Humanities in the Nordic Countries (DHN 2019), Mar 2019, Copenhagen, Denmark. ⟨hal-02047512⟩
    • Jack Bowers, Laurent Romary. TEI and the Mixtepec-Mixtec corpus: data integration, annotation and normalization of heterogeneous data for an under-resourced language. 6th International Conference on Language Documentation and Conservation (ICLDC), Feb 2019, Honolulu, United States. ⟨hal-02075475⟩
    • Anas Khan, Hervé Bohbot, Francesca Frontini, Mohamed Khemakhem, Laurent Romary. Historical Dictionaries as Digital Editions and Connected Graphs: the Example of Le Petit Larousse Illustré. Digital Humanities 2019, Jul 2019, Utrech, Netherlands. ⟨hal-02111199⟩
    • Pedro Javier Ortiz Suárez, Benoît Sagot, Laurent Romary. Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures. 7th Workshop on the Challenges in the Management of Large Corpora (CMLC-7), Jul 2019, Cardiff, United Kingdom. ⟨10.14618/IDS-PUB-9021⟩. ⟨hal-02148693⟩
    • Lucie Rondeau Du Noyer, Simon Gabay, Mohamed Khemakhem, Laurent Romary. Scaling up Automatic Structuring of Manuscript Sales Catalogues. TEI 2019: What is text, really? TEI and beyond, Sep 2019, Graz, Austria. ⟨hal-02272962⟩
    • Hervé Bohbot, Francesca Frontini, Fahad Khan, Mohamed Khemakhem, Laurent Romary. Nénufar: Modelling a Diachronic Collection of Dictionary Editions as a Computational Lexical Resource. ELEX 2019: smart lexicography, Oct 2019, Sintra, Portugal. ⟨hal-02272978⟩
    • Jack Bowers, Mohamed Khemakhem, Laurent Romary. TEI Encoding of a Classical Mixtec Dictionary Using GROBID- Dictionaries. ELEX 2019: Smart Lexicography, Oct 2019, Sintra, Portugal. ⟨hal-02264033⟩
    • Pedro Javier Ortiz Suárez, Laurent Romary, Benoît Sagot. Preparing the Dictionnaire Universel for Automatic Enrichment. 10th International Conference on Historical Lexicography and Lexicology (ICHLL), Jun 2019, Leeuwarden, Netherlands. ⟨hal-02131598⟩
    • Luca Foppiano, Laurent Romary, Masashi Ishii, Mikiko Tanifuji. Automatic Identification and Normalisation of Physical Measurements in Scientific Literature. DocEng '19 - ACM Symposium on Document Engineering 2019, Sep 2019, Berlin, Germany. pp.1-4, ⟨10.1145/3342558.3345411⟩. ⟨hal-02294424v2⟩
    • Laurent Romary, Mohamed Khemakhem, Fahad Khan, Jack Bowers, Nicoletta Calzolari, et al.. LMF Reloaded. AsiaLex 2019: Past, Present and Future, Jun 2019, Istanbul, Turkey. ⟨hal-02118319⟩
    • Hervé Bohbot, Francesca Frontini, Giancarlo Luxardo, Mohamed Khemakhem, Laurent Romary. Presenting the Nénufar Project: a Diachronic Digital Edition of the Petit Larousse Illustré. GLOBALEX 2018 - Globalex workshop at LREC2018, May 2018, Miyazaki, Japan. pp.1-6. ⟨hal-01728328⟩
    • Mohamed Khemakhem, Axel Herold, Laurent Romary. Enhancing Usability for Automatically Structuring Digitised Dictionaries. GLOBALEX workshop at LREC 2018, May 2018, Miyazaki, Japan. ⟨hal-01708137v2⟩
    • Hajer Maraoui, Kais Haddar, Laurent Romary. Segmentation tool for hadith corpus to generate TEI encoding. 4th International Conference on Advanced Intelligent Systems and Informatics (AISI’18), Sep 2018, Cairo, Egypt. ⟨hal-01794105⟩
    • Marie Puren, Charles Riondet, Laurent Romary, Dorian Seillier, Lionel Tadjou. The Standardization Survival Kit (SSK): Bringing best practices to research communities in the Humanities. Digital Humanities Benelux 2018, Jun 2018, Amsterdam, Netherlands. ⟨hal-01850075⟩
    • Luca Foppiano, Laurent Romary. entity-fishing: a DARIAH entity recognition and disambiguation service. Digital Scholarship in the Humanities , Sep 2018, Tokyo, Japan. ⟨hal-01812100⟩
    • Mohamed Khemakhem, Laurent Romary, Simon Gabay, Hervé Bohbot, Francesca Frontini, et al.. Automatically Encoding Encyclopedic-like Resources in TEI. The annual TEI Conference and Members Meeting, Sep 2018, Tokyo, Japan. ⟨hal-01819505⟩
    • Jack Bowers, Laurent Romary. Encoding Mixtepec-Mixtec Etymology in TEI. TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan. ⟨hal-02003975⟩
    • David Lindemann, Mohamed Khemakhem, Laurent Romary. Retro-digitizing and Automatically Structuring a Large Bibliography Collection. European Association for Digital Humanities (EADH) Conference, EADH, Dec 2018, Galway, Ireland. ⟨hal-01941534⟩
    • Jack Bowers, Axel Herold, Laurent Romary. TEI-Lex0 Etym -towards terse(r) recommendations for the encoding of etymological information. TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan. ⟨hal-02075506⟩
    • Laurent Romary, Toma Tasovac. TEI Lex-0: A Target Format for TEI-Encoded Dictionaries and Lexical Resources. TEI Conference and Members' Meeting, Sep 2018, Tokyo, Japan. ⟨hal-02265312⟩
    • Andrea Bertino, Luca Foppiano, Laurent Romary, Pierre Mounier. Leveraging Concepts in Open Access Publications. PUBMET 2018 - 5th Conference on Scholarly Publishing in the Context of Open Science, Sep 2018, Zadar, Croatia. ⟨hal-01900303⟩
    • Mohamed Khemakhem, Carmen Brando, Laurent Romary, Frédérique Mélanie-Becquet, Jean-Luc Pinol. Fueling Time Machine: Information Extraction from Retro-Digitised Address Directories. JADH2018 "Leveraging Open Data", Sep 2018, Tokyo, Japan. ⟨hal-01814189⟩
    • Mohamed Khemakhem, Luca Foppiano, Laurent Romary. Automatic Extraction of TEI Structures in Digitized Lexical Resources using Conditional Random Fields. electronic lexicography, eLex 2017, Sep 2017, Leiden, Netherlands. ⟨hal-01508868v2⟩
    • Hajer Maraoui, Kais Haddar, Laurent Romary. Encoding prototype of Al-Hadith Al-Shareef in TEI. ICALP 2017 - The 6th International Conference on Arabic Language Processing, Oct 2017, Fes, Morocco. pp.14. ⟨hal-01574543⟩
    • Anne Baillot, Marie Puren, Charles Riondet, Dorian Seillier, Laurent Romary. Access to cultural heritage data. A challenge for digital humanities. Digital Humanities 2017, Aug 2017, Montréal, Canada. ⟨hal-01582176⟩
    • Stefan Pernes, Laurent Romary, Kara Warburton. TBX in ODD: Schema-agnostic specification and documentation for TermBase eXchange. LOTKS 2017- Workshop on Language, Ontology, Terminology and Knowledge Structures, Sep 2017, Montpellier, France. ⟨hal-01581440v2⟩

    Poster communications6 documents

    • Laurent Romary, Damien Biabiany, Klaus Illmayer, Marie Puren, Charles Riondet, et al.. SSK by example - Make your Arts and Humanities research go standard. DARIAH Annual Event, May 2019, Varsovie, Poland. ⟨hal-02151788⟩
    • Marie Puren, Charles Riondet, Laurent Romary, Dorian Seillier, Lionel Tadjou. The SSK. Make your Arts and Humanities research go standard. TEI inside !. TEI2018 - Annual TEI Conference and Members Meeting, Sep 2018, Tokyo, Japan. 2018. ⟨hal-01902702v2⟩
    • Herve Bohbot, Alexandre Fauchere, Francesca Frontini, Agata Jackiewicz, Giancarlo Luxardo, et al.. A Diachronic Digital Edition of the Petit Larousse illustré. Journée d'étude CORLI : Traitements et standardisation des corpus multimodaux et web 2.0., May 2018, Paris, France. ⟨hal-01873805⟩
    • Aria Adli, Eric Engel, Laurent Romary, Fahime Same. A stand-off XML-TEI representation of reference annotation. DGfS 2018: 40. Jahrestagung der Deutschen Gesellschaft für Sprachwissenschaft, Mar 2018, Stuttgart, Germany. 2017. ⟨hal-01876327⟩
    • Marie Puren, Charles Riondet, Laurent Romary, Dorian Seillier, Lionel Tadjou. SSK by example. Make your Arts and Humanities research go standard. Digital Humanities 2018 : "Bridges/Puentes", Jun 2018, Mexico, Mexico. ⟨hal-01848882⟩
    • Marie Puren, Charles Riondet, Dorian Seillier, Laurent Romary. The Standardization Survival Kit (SSK): For a wider use of standards within Arts and Humanities. Digital Humanities Benelux Conference 2017, Jul 2017, Utrecht, Netherlands. 2017. ⟨hal-01587687⟩

    Documents associated with scientific events8 documents

    • Laurent Romary. TEI guidelines: born to be open. ACDH-CH : Austrian Centre for Digital Humanities and Cultural Heritage Lectures, Jun 2020, Vienne, Austria. Lecture (6.1). ⟨hal-02864525⟩
    • Laurent Romary. The TEI as a modeling infrastructure: TEI beyond the TEI realms. Ringvorlesung Digital Humanities, Jul 2019, Paderborn, Germany. ⟨hal-02265036⟩
    • Laurent Romary. Open Access in France: how the call of Jussieu reflects our social, technical and political landscape. Open-Access-Tage, Sep 2018, Graz, Austria. ⟨hal-01881469⟩
    • Laurent Romary. Data Mining Technologies at the service of Open Knowledge. Ringvorlesung "Open Technology for an Open Society”, Jan 2018, Berlin, Germany. pp.1-65, 2018. ⟨hal-01708771⟩
    • Laurent Romary, Jennifer Edmond. Sustainability in DARIAH. Sustainability of Digital Research Infrastructures for the Arts and Humanities, Apr 2017, Berlin, Germany. pp.10. ⟨hal-01516487⟩
    • Laurent Romary. How to Open up? (Digital) Libraries at the Service of (Digital) Scholars. Fiesole Collection Development Retreat, Apr 2017, Villeneuve d´Ascq, France. ⟨hal-01513674⟩
    • Jack Bowers, Laurent Romary. Language Documentation and Standards in Digital Humanities: TEI and the documentation of Mixtepec-Mixtec. JADH 2017: Proceedings of the 7th Conference of Japanese Association for Digital Humanities "Creating Data through Collaboration", Sep 2017, Kyoto, Japan. ⟨hal-01744813⟩
    • Laurent Romary. The Text Encoding Initiative as an Infrastructure. French-Israeli Symposium on Digital Humanities, Feb 2017, Jerusalem, Israel. 2017. ⟨hal-01618017⟩

    Book sections4 documents

    • Jennifer Edmond, Laurent Romary. 3. Academic Publishing. Digital Technology and the Practices of Humanities Research, Open Book Publishers, pp.49-80, 2020, 978-1-78374-841-9. ⟨10.11647/OBP.0192.03⟩. ⟨hal-02464616⟩
    • Jennifer Edmond, Frank Fischer, Laurent Romary, Toma Tasovac. 9. Springing the Floor for a Different Kind of Dance: Building DARIAH as a Twenty-First-Century Research Infrastructure for the Arts and Humanities. Digital Technology and the Practices of Humanities Research, Open Book Publishers, pp.207-234, 2020, 978-1-78374-841-9. ⟨10.11647/OBP.0192.09⟩. ⟨hal-02464622⟩
    • Laurent Romary, Jennifer Edmond. A Tangential View on Impact for the Arts and Humanities through the Lens of the DARIAH-ERIC. Bente Maegaard; Riccardo Pozzo. Stay Tuned To The Future - Impact of the Research Infrastructures for Social Sciences and Humanities, Leo S. Olschki Editore, 2019, 978 88 222 6643 9. ⟨hal-02094713⟩
    • Tobias Blanke, Conny Kristel, Laurent Romary. Crowds for Clouds: Recent Trends in Humanities Research Infrastructures. Agiati Benardou; Erik Champion; Costis Dallas; Lorna Hughes. Cultural Heritage Digital Tools and Infrastructures, Routledge, 2018, 978-1-4724-4712-8. ⟨hal-01248562⟩

    Directions of work or proceedings2 documents

    • Laurent Romary, Andreas Degkwitz. IFLA Satellite Meeting - Digital Humanities – Opportunities and Risks: Connecting Libraries and Research. Andreas Degkwitz ; Laurent Romary. IFLA Satellite Meeting - Digital Humanities – Opportunities and Risks: Connecting Libraries and Research, Aug 2017, Berlin, Germany. 2017. ⟨hal-01643305⟩
    • Pierre Alliez, Laurent Bergerot, Jean-François Bernard, Clotilde Boust, George Bruseker, et al.. Digital 3D Objects in Art and Humanities: challenges of creation, interoperability and preservation. White paper: A result of the PARTHENOS Workshop held in Bordeaux at Maison des Sciences de l’Homme d’Aquitaine and at Archeovision Lab. (France), November 30th - December 2nd, 2016.. PARTHENOS. Digital 3D Objects in Art and Humanities: challenges of creation, interoperability and preservation, Nov 2016, Bordeaux, France. pp.71, 2017. ⟨hal-01526713v2⟩

    Other publications2 documents

    • Detlef Reineke, Laurent Romary. SKOS and TBX vocabularies. 2018. ⟨hal-01883377v3⟩
    • Laurent Romary, Charles Riondet. Ongoing maintenance and customization of archival standards using ODD (EAC-CPF revision proposal). 2017. ⟨hal-01677185⟩

    Preprints, Working Papers, ...2 documents

    • Erzsébet Tóth-Czifra, Laurent Romary. The Heritage Data Reuse Charter: from principles to research workflows. 2020. ⟨halshs-02475692⟩
    • Louis Martin, Benjamin Muller, Pedro Javier Ortiz Suárez, Yoann Dupont, Laurent Romary, et al.. CamemBERT: a Tasty French Language Model. 2019. ⟨hal-02445946⟩

    Reports4 documents

    • Charles Riondet, Dorian Seillier, Lionel Tadjou, Laurent Romary. Standardization Survival Kit (Final). [Technical Report] Inria Paris. 2018. ⟨hal-01513531v2⟩
    • Laurent Romary, Piotr Banski, Jack Bowers, Emiliano Degl’innocenti, Matej Ďurčo, et al.. Report on Standardization (draft). [Technical Report] Deliverable 4.2, Inria. 2017. ⟨hal-01560563⟩
    • Charles Riondet, Laurent Romary, Annelies van Nispen, Kepa Joseba Rodriguez, Mike Bryant. Report on Standards. [Contract] D.11.4, Inria Paris. 2017. ⟨hal-01503235⟩
    • Loïc Bertrand, Sophie David, Serge X. Cohen, Susanna Holowati, Marie Puren, et al.. First edition of the web directory of IPERION CH instruments and databases. [Technical Report] D.2.3, IPANEMA. 2016. ⟨hal-02138440⟩

    Lectures1 document

    • Simon Gabay, Mohamed Khemakhem, Laurent Romary. Les catalogues et GROBID. Doctorat. Du catalogue aux humanités numériques : quelles méthodes pour quels résultats ?, Paris, France. 2018. ⟨cel-01951107⟩