Suche

Benoît Sagot

Le projet FREEM : ressources, outils et enjeux pour l’étude du français d’Ancien Régime
BERTrade: Using Contextual Embeddings to Parse Old French
From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Ungoliant: An Optimized Pipeline for the Generation of a Very Large-Scale Multilingual Web Corpus
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
CamemBERT: a Tasty French Language Model
Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell
Les modèles de langue contextuels Camembert pour le Français : impact de la taille et de l'hétérogénéité des données d'entrainement
Establishing a New State-of-the-Art for French Named Entity Recognition
French Contextualized Word-Embeddings with a sip of CaBeRnet: a New French Balanced Reference Corpus
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
Preparing the Dictionnaire Universel for Automatic Enrichment

Published with Wowchemy — the free, open source website builder that empowers creators.