AntoineSimoulin / gpt-fr
Generative Pretrained Transformers for French
☆27Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for gpt-fr
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".☆22Updated 3 years ago
- An extension package of 🤗 Datasets that provides support for executing arbitrary SQL queries on HF datasets☆31Updated 9 months ago
- German small and large versions of GPT2.☆20Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆58Updated 2 years ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆43Updated 5 months ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 2 years ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆97Updated last year
- ☆16Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆92Updated last year
- MAFAND-MT☆54Updated 4 months ago
- NTREX -- News Test References for MT Evaluation☆75Updated 5 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 5 months ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆46Updated 3 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆75Updated 2 months ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Inference code in Pytorch for GPT-like models, such as PAGnol, a family of models with up to 1.5B parameters, trained on datasets in Fren…☆20Updated 2 years ago
- 🇧🇪 BelGPT-2: the 1st GPT model pretrained in French.☆33Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆56Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated last year
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆103Updated 7 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆62Updated 8 months ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆81Updated last year
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆56Updated 5 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- ☆44Updated 2 years ago
- 🤗 Disaggregators: Curated data labelers for in-depth analysis.☆65Updated last year