IntelLabs / academic-budget-bertLinks
Repository containing code for "How to Train BERT with an Academic Budget" paper
☆314Updated last year
Alternatives and similar repositories for academic-budget-bert
Users that are interested in academic-budget-bert are comparing it to the libraries listed below
Sorting:
- ☆321Updated 4 years ago
- Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)☆330Updated last year
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆397Updated 3 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated 2 years ago
- Interpretable Evaluation for AI Systems☆366Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆203Updated 3 years ago
- PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…☆276Updated 2 years ago
- Search Engines with Autoregressive Language models☆291Updated 2 years ago
- This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, t…☆314Updated 4 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆310Updated 5 years ago
- ☆199Updated 3 years ago
- Understanding the Difficulty of Training Transformers☆329Updated 3 years ago
- New dataset☆306Updated 3 years ago
- ☆167Updated 6 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆201Updated last year
- A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/☆97Updated 2 years ago
- A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch☆227Updated 2 years ago
- [EMNLP 2021] Improving and Simplifying Pattern Exploiting Training☆153Updated 3 years ago
- An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…☆433Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- ☆246Updated 5 years ago
- Few-shot Learning of GPT-3☆352Updated last year
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆137Updated last year
- GeDi: Generative Discriminator Guided Sequence Generation☆211Updated last month
- Hyperparameter Search for AllenNLP☆139Updated 4 months ago
- ☆345Updated 4 years ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆178Updated 3 years ago
- UnifiedQA: Crossing Format Boundaries With a Single QA System☆441Updated 3 years ago
- Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics☆201Updated 3 years ago