IntelLabs / academic-budget-bert
Repository containing code for "How to Train BERT with an Academic Budget" paper
☆313Updated last year
Alternatives and similar repositories for academic-budget-bert
Users that are interested in academic-budget-bert are comparing it to the libraries listed below
Sorting:
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Updated 2 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆329Updated last year
- Understanding the Difficulty of Training Transformers☆329Updated 2 years ago
- Interpretable Evaluation for AI Systems☆366Updated 2 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- ☆319Updated 3 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆202Updated 3 years ago
- Prune a model while finetuning or training.☆402Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆154Updated last year
- Search Engines with Autoregressive Language models☆285Updated 2 years ago
- Efficient, check-pointed data loading for deep learning with massive data sets.☆207Updated last year
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆272Updated 2 years ago
- Code for the paper "Are Sixteen Heads Really Better than One?"☆171Updated 5 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆436Updated 3 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆136Updated last year
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, t…☆313Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆393Updated 3 years ago
- Hyperparameter Search for AllenNLP☆139Updated 2 months ago
- ☆182Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆783Updated 11 months ago
- Code for the NAACL 2022 long paper "DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings"☆293Updated 2 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated 2 years ago
- A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch☆226Updated last year
- Code associated with the Don't Stop Pretraining ACL 2020 paper☆530Updated 3 years ago
- A framework for few-shot evaluation of autoregressive language models.☆103Updated 2 years ago
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆96Updated 2 years ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆157Updated 2 years ago
- Research code for pixel-based encoders of language (PIXEL)☆335Updated last year
- Run Effective Large Batch Contrastive Learning Beyond GPU/TPU Memory Constraint☆387Updated last year