Pedro Ortiz Suarez
Pedro Ortiz Suarez
Home
Veröffentlichungen
Vorträge
Projekte
Kontakt
CV
Hell
Dunkel
Automatisch
Deutsch
Deutsch
English
Español
Français
Benoît Sagot
Aktuellste
Le projet FREEM : ressources, outils et enjeux pour l’étude du français d’Ancien Régime
BERTrade: Using Contextual Embeddings to Parse Old French
From FreEM to D'AlemBERT: a Large Corpus and a Language Model for Early Modern French
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets
Ungoliant: An Optimized Pipeline for the Generation of a Very Large-Scale Multilingual Web Corpus
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
A Monolingual Approach to Contextualized Word Embeddings for Mid-Resource Languages
CamemBERT: a Tasty French Language Model
Building a User-Generated Content North-African Arabizi Treebank: Tackling Hell
Les modèles de langue contextuels Camembert pour le Français : impact de la taille et de l'hétérogénéité des données d'entrainement
Establishing a New State-of-the-Art for French Named Entity Recognition
French Contextualized Word-Embeddings with a sip of CaBeRnet: a New French Balanced Reference Corpus
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures
Preparing the Dictionnaire Universel for Automatic Enrichment
Zitieren
×