AntoineSimoulin / gpt-frLinks
Generative Pretrained Transformers for French
☆28Updated 2 years ago
Alternatives and similar repositories for gpt-fr
Users that are interested in gpt-fr are comparing it to the libraries listed below
Sorting:
- A french sequence to sequence pretrained model☆61Updated 2 years ago
- The French summarization dataset introduced in "BARThez: a Skilled Pretrained French Sequence-to-Sequence Model".☆22Updated 4 years ago
- German small and large versions of GPT2.☆20Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 2 years ago
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated last year
- A tiny BERT for low-resource monolingual models☆31Updated 9 months ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆36Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…☆16Updated 3 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆94Updated last year
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.☆36Updated 2 years ago
- GPT-2 French demo | Démo française de GPT-2☆68Updated 4 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆16Updated last year
- This is a neural spell checker☆65Updated 2 years ago
- Explainable Zero-Shot Topic Extraction☆62Updated 10 months ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Scripts to create speech corpora from open.bible☆13Updated 3 years ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆74Updated 3 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago