Follow
Benoît Sagot
Benoît Sagot
Directeur de recherches at Inria, head of the ALMAnaCH team
Verified email at inria.fr - Homepage
Title
Cited by
Cited by
Year
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
15252023
What does BERT learn about the structure of language?
G Jawahar, B Sagot, D Seddah
57th Annual Meeting of the Association for Computational Linguistics (ACL …, 2019
14872019
CamemBERT: a Tasty French Language Model
L Martin, B Muller, PJ Ortiz Suárez, Y Dupont, L Romary, ...
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
11912020
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
PJ Ortiz Suárez, B Sagot, L Romary
Challenges in the Management of Large Corpora (CMLC-7) 2019, 9, 2019
458*2019
The Lefff, a freely available and large-coverage morphological and syntactic lexicon for French
B Sagot
LREC 2010, 2010
303*2010
Building a free French wordnet from multilingual resources
B Sagot, D Fišer
Ontolex 2008, 2008
2532008
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
PJ Ortiz Suárez, L Romary, B Sagot
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
230*2020
Controllable sentence simplification
L Martin, B Sagot, E de la Clergerie, A Bordes
arXiv preprint arXiv:1910.02677, 2019
1752019
Coupling an annotated corpus and a morphosyntactic lexicon for state-of-the-art POS tagging with less human effort
P Denis, B Sagot
PACLIC 2009, 2009
1722009
MUSS: Multilingual unsupervised sentence simplification by mining paraphrases
L Martin, A Fan, E De La Clergerie, A Bordes, B Sagot
arXiv preprint arXiv:2005.00352, 2020
159*2020
Towards a cleaner document-oriented multilingual crawled corpus
J Abadji, PO Suarez, L Romary, B Sagot
arXiv preprint arXiv:2201.06642, 2022
1562022
ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations
F Alva-Manchego, L Martin, A Bordes, C Scarton, B Sagot, L Specia
Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020
1452020
Universal dependencies 2.5
D Zeman, J Nivre, et al.
LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied …, 2020
1312020
When being unseen from mBERT is just the beginning: Handling new languages with multilingual language models
B Muller, A Anastasopoulos, B Sagot, D Seddah
arXiv preprint arXiv:2010.12858, 2020
1302020
Quality at a glance: An audit of web-crawled multilingual datasets
J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ...
Transactions of the Association for Computational Linguistics 10, 50-72, 2022
1212022
The Lefff 2 syntactic lexicon for French: architecture, acquisition, use
B Sagot, L Clément, E de La Clergerie, P Boullier
LREC 2006, 2006
1132006
Influence of pre-annotation on POS-tagged corpus development
K Fort, B Sagot
The fourth ACL linguistic annotation workshop, 56-63, 2010
1062010
Between words and characters: A brief history of open-vocabulary modeling and tokenization in NLP
SJ Mielke, Z Alyafeai, E Salesky, C Raffel, M Dey, M Gallé, A Raja, C Si, ...
arXiv preprint arXiv:2112.10508, 2021
102*2021
Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging
P Denis, B Sagot
Language resources and evaluation 46 (4), 721-736, 2012
982012
Generative Spoken Dialogue Language Modeling
TA Nguyen, E Kharitonov, J Copet, Y Adi, WN Hsu, A Elkahky, ...
arXiv preprint arXiv:2203.16502, 2022
892022
The system can't perform the operation now. Try again later.
Articles 1–20