IntelLabs / academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
☆310Updated last year
Alternatives and similar repositories for academic-budget-bert:
Users that are interested in academic-budget-bert are comparing it to the libraries listed below
- Understanding the Difficulty of Training Transformers☆328Updated 2 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆325Updated last year
- Search Engines with Autoregressive Language models☆281Updated last year
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆429Updated 2 years ago
- ☆316Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆272Updated 2 years ago
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆371Updated 10 months ago
- Prune a model while finetuning or training.☆398Updated 2 years ago
- Interpretable Evaluation for AI Systems☆361Updated last year
- Adversarial Natural Language Inference Benchmark☆393Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- ☆490Updated 11 months ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆133Updated last year
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆154Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated last year
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆293Updated 2 years ago
- Scalable training for dense retrieval models.☆273Updated last year
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆154Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆779Updated 8 months ago
- A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch☆221Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆361Updated 2 years ago
- Research code for pixel-based encoders of language (PIXEL)☆334Updated 10 months ago
- ☆460Updated 3 years ago
- Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.☆279Updated 6 months ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆604Updated 2 years ago
- ☆294Updated 2 years ago
- FastFormers - highly efficient transformer models for NLU☆703Updated last year
- EMNLP 2021 - Pre-training architectures for dense retrieval☆244Updated 2 years ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆201Updated 2 years ago