uds-lsv / bert-stable-fine-tuning
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
☆134Updated last year
Alternatives and similar repositories for bert-stable-fine-tuning:
Users that are interested in bert-stable-fine-tuning are comparing it to the libraries listed below
- Hyperparameter Search for AllenNLP☆134Updated last month
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- Implementation of Mixout with PyTorch☆74Updated 2 years ago
- A BART version of an open-domain QA model in a closed-book setup☆120Updated 4 years ago
- QED: A Framework and Dataset for Explanations in Question Answering☆115Updated 3 years ago
- This repository contains the code for "BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Representations".☆63Updated 4 years ago
- ☆74Updated 3 years ago
- Code and datasets of "Multilingual Extractive Reading Comprehension by Runtime Machine Translation"☆40Updated 6 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- Implementation of Marge, Pre-training via Paraphrasing, in Pytorch☆75Updated 4 years ago
- For the code release of our arXiv paper "Revisiting Few-sample BERT Fine-tuning" (https://arxiv.org/abs/2006.05987).☆184Updated last year
- This repository contains the code for running the character-level Sandwich Transformers from our ACL 2020 paper on Improving Transformer …☆55Updated 4 years ago
- Fine-tune transformers with pytorch-lightning☆44Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- A library to conduct ranking experiments with transformers.☆161Updated last year
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆145Updated 3 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago
- The implementation of the papers on dual learning of natural language understanding and generation. (ACL2019,2020; Findings of EMNLP 2020…☆66Updated 4 years ago
- ☆47Updated 4 years ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆361Updated 2 years ago
- ☆92Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated last year
- CharBERT: Character-aware Pre-trained Language Model (COLING2020)☆119Updated 4 years ago
- SUPERT: Unsupervised multi-document summarization evaluation & generation☆94Updated 2 years ago
- Anserini notebooks☆69Updated last year
- Fork of huggingface/pytorch-pretrained-BERT for BERT on STILTs☆106Updated 2 years ago
- Example code, data, and commands for the AllenNLP guide☆47Updated 3 years ago
- Few-shot NLP benchmark for unified, rigorous eval☆91Updated 2 years ago
- SacreROUGE is a library dedicated to the use and development of text generation evaluation metrics with an emphasis on summarization.☆140Updated 2 years ago