allenai / dont-stop-pretrainingView external linksLinks
Code associated with the Don't Stop Pretraining ACL 2020 paper
☆540Nov 15, 2021Updated 4 years ago
Alternatives and similar repositories for dont-stop-pretraining
Users that are interested in dont-stop-pretraining are comparing it to the libraries listed below
Sorting:
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆359Feb 22, 2022Updated 3 years ago
- Longformer: The Long-Document Transformer☆2,186Feb 8, 2023Updated 3 years ago
- Code for using and evaluating SpanBERT.☆903Jul 25, 2023Updated 2 years ago
- Multi-Task Deep Neural Networks for Natural Language Understanding☆2,257Mar 7, 2024Updated last year
- ACL2020 Tutorial: Open-Domain Question Answering☆835Jan 1, 2021Updated 5 years ago
- Source code for "Train No Evil: Selective Masking for Task-Guided Pre-Training"☆70Nov 25, 2022Updated 3 years ago
- Hyperparameter Search for AllenNLP☆140Mar 6, 2025Updated 11 months ago
- Source code and dataset for ACL 2019 paper "ERNIE: Enhanced Language Representation with Informative Entities"☆1,419Jan 10, 2024Updated 2 years ago
- BERT-related papers☆2,042Aug 12, 2023Updated 2 years ago
- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821☆3,643Oct 16, 2024Updated last year
- LAnguage Model Analysis☆1,392Jul 7, 2024Updated last year
- ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators☆2,370Mar 23, 2024Updated last year
- Beyond Accuracy: Behavioral Testing of NLP models with CheckList☆2,048Jan 9, 2024Updated 2 years ago
- The implementation of DeBERTa☆2,191Sep 29, 2023Updated 2 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)☆347Dec 20, 2022Updated 3 years ago
- Shared repository for open-sourced projects from the Google AI Language team.☆1,746Feb 5, 2026Updated last week
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆379Apr 21, 2023Updated 2 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆185Jun 12, 2023Updated 2 years ago
- [ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723☆731Aug 29, 2022Updated 3 years ago
- Library for Knowledge Intensive Language Tasks☆963Mar 31, 2022Updated 3 years ago
- PyTorch original implementation of Cross-lingual Language Model Pretraining.☆2,924Feb 14, 2023Updated 3 years ago
- Collection of papers and resources for data augmentation for NLP.☆831Aug 12, 2022Updated 3 years ago
- Must-read Papers on pre-trained language models.☆3,365Nov 6, 2022Updated 3 years ago
- Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo☆3,106May 9, 2024Updated last year
- Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer☆542Dec 10, 2021Updated 4 years ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"☆6,491Jan 14, 2026Updated last month
- KnowBert -- Knowledge Enhanced Contextual Word Representations☆376Jun 2, 2020Updated 5 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆317May 28, 2020Updated 5 years ago
- ☆221Jun 8, 2020Updated 5 years ago
- Plug and Play Language Model implementation. Allows to steer topic and attributes of GPT-2 models.☆1,155Feb 20, 2024Updated last year
- A PyTorch-based knowledge distillation toolkit for natural language processing☆1,698May 8, 2023Updated 2 years ago
- Data augmentation for NLP☆4,645Jun 24, 2024Updated last year
- A python tool for evaluating the quality of sentence embeddings.☆2,107Mar 19, 2024Updated last year
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆650Jan 4, 2023Updated 3 years ago
- Variational Methods for Pretraining in Resource-limited Environments☆174Jul 29, 2020Updated 5 years ago
- LUKE -- Language Understanding with Knowledge-based Embeddings☆726Nov 19, 2023Updated 2 years ago
- MASS: Masked Sequence to Sequence Pre-training for Language Generation☆1,123Nov 28, 2022Updated 3 years ago
- Understanding the Difficulty of Training Transformers☆332May 31, 2022Updated 3 years ago
- This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"☆1,628Jun 12, 2023Updated 2 years ago