bminixhofer / gerpt2Links

German small and large versions of GPT2.

☆20

Alternatives and similar repositories for gerpt2

Users that are interested in gerpt2 are comparing it to the libraries listed below

Sorting:

ccasimiro88 / TranslateAlignRetrieve
Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.
☆59Updated 2 years ago
infinitylogesh / mutate
A library to synthesize text datasets using Large Language Models (LLM)
☆151Updated 2 years ago
patil-suraj / onnx_transformers
Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.
☆127Updated 4 years ago
gsarti / t5-flax-gcp
Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP
☆58Updated 3 years ago
ofirpress / shortformer
Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.
☆147Updated 4 years ago
timoschick / dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆189Updated 4 years ago
wietsedv / gpt2-recycle
As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)
☆48Updated 4 years ago
allenai / gooaq
Question-answers, collected from Google
☆129Updated 4 years ago
CPJKU / wechsel
Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
☆85Updated last year
mhagiwara / xfspell
xfspell — the Transformer Spell Checker
☆189Updated 5 years ago
microsoft / xtreme-distil-transformers
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆156Updated last year
google-research-datasets / Disfl-QA
A Benchmark Dataset for Understanding Disfluencies in Question Answering
☆64Updated 4 years ago
feralvam / easse
Easier Automatic Sentence Simplification Evaluation
☆162Updated 2 years ago
Georgetown-IR-Lab / ExtendedSumm
On Generating Extended Summaries of Long Documents
☆78Updated 4 years ago
midas-research / bhaav
Dataset of sentences from Hindi stories tagged with different emotion tags
☆11Updated 5 years ago
facebookresearch / muss
Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".
☆99Updated 2 years ago
facebookresearch / access
Code to reproduce the experiments from the paper.
☆102Updated 2 years ago
jwieting / paraphrastic-representations-at-scale
☆75Updated 4 years ago
anton-l / wav2vec-toolkit
A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models
☆31Updated 4 years ago
helboukkouri / character-bert
Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"
☆201Updated 2 years ago
lucidrains / charformer-pytorch
Implementation of the GBST block from the Charformer paper, in Pytorch
☆118Updated 4 years ago
thompsonb / prism
MT Evaluation in Many Languages via Zero-Shot Paraphrasing
☆102Updated last year
D2KLab / ZeSTE
Explainable Zero-Shot Topic Extraction
☆63Updated last year
simonepri / lm-scorer
📃Language Model based sentences scoring library
☆309Updated 3 years ago
kaustubhdhole / natural-dont-know
Code for the paper: Saying No is An Art: Contextualized Fallback Responses for Unanswerable Dialogue Queries
☆19Updated 3 years ago
hellohaptik / HINT3
This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…
☆33Updated 4 years ago
MicrosoftTranslator / NTREX
NTREX -- News Test References for MT Evaluation
☆85Updated last year
tigerchen52 / GLADIS
GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)
☆18Updated last year
dbmdz / berts
DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models
☆154Updated 2 years ago
amazon-science / contrastive-controlled-mt
Code and data for the IWSLT 2022 shared task on Formality Control for SLT
☆21Updated 2 years ago