richarddwang / electra_pytorchLinks
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
Alternatives and similar repositories for electra_pytorch
Users that are interested in electra_pytorch are comparing it to the libraries listed below
Sorting:
- A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch☆228Updated 2 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated last year
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆314Updated last year
- ☆345Updated 4 years ago
- ☆468Updated 4 years ago
- Minimalist implementation of a BERT Sentence Classifier with PyTorch Lightning, Transformers and PyTorch-NLP.☆219Updated 2 years ago
- Adversarial Natural Language Inference Benchmark☆397Updated 3 years ago
- Enhancing the BERT training with Semi-supervised Generative Adversarial Networks☆228Updated 2 years ago
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆533Updated 3 years ago
- Pytorch Implementation of ALBERT(A Lite BERT for Self-supervised Learning of Language Representations)☆227Updated 4 years ago
- Understanding the Difficulty of Training Transformers☆329Updated 3 years ago
- New dataset☆306Updated 3 years ago
- GeDi: Generative Discriminator Guided Sequence Generation☆211Updated 2 months ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated 2 years ago
- ☆219Updated 5 years ago
- Flexible components pairing 🤗 Transformers with Pytorch Lightning☆610Updated 2 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆201Updated last year
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆295Updated 2 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆647Updated 2 years ago
- Neural Text Generation with Unlikelihood Training☆309Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 4 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆345Updated 2 years ago
- Interpretable Evaluation for AI Systems☆367Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆206Updated 3 years ago
- ☆211Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆204Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆311Updated 5 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆283Updated 2 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆153Updated 3 years ago