IntelLabs / academic-budget-bertLinks

Repository containing code for "How to Train BERT with an Academic Budget" paper

☆314

Alternatives and similar repositories for academic-budget-bert

Users that are interested in academic-budget-bert are comparing it to the libraries listed below

Sorting:

urvashik / knnlm
☆321Updated 4 years ago
richarddwang / electra_pytorch
Pretrain and finetune ELECTRA with fastai and huggingface. (Results of the paper replicated !)
☆330Updated last year
timoschick / dino
This repository contains the code for "Generating Datasets with Pretrained Language Models".
☆188Updated 3 years ago
facebookresearch / anli
Adversarial Natural Language Inference Benchmark
☆397Updated 3 years ago
JohnGiorgi / DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…
☆380Updated 2 years ago
neulab / ExplainaBoard
Interpretable Evaluation for AI Systems
☆366Updated 2 years ago
facebookresearch / PAQ
Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"
☆203Updated 3 years ago
neulab / knn-transformers
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…
☆276Updated 2 years ago
facebookresearch / SEAL
Search Engines with Autoregressive Language models
☆291Updated 2 years ago
lena-voita / the-story-of-heads
This is a repository with the code for the ACL 2019 paper "Analyzing Multi-Head Self-Attention: Specialized Heads Do the Heavy Lifting, t…
☆314Updated 4 years ago
google-research-datasets / tydiqa
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆310Updated 5 years ago
google-deepmind / xquad
☆199Updated 3 years ago
LiyuanLucasLiu / Transformer-Clinic
Understanding the Difficulty of Training Transformers
☆329Updated 3 years ago
facebookresearch / MLQA
New dataset
☆306Updated 3 years ago
google-research-datasets / boolean-questions
☆167Updated 6 years ago
helboukkouri / character-bert
Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"
☆201Updated last year
jayded / eraserbenchmark
A benchmark for understanding and evaluating rationales: http://www.eraserbenchmark.com/
☆97Updated 2 years ago
lucidrains / electra-pytorch
A simple and working implementation of Electra, the fastest way to pretrain language models from scratch, in Pytorch
☆227Updated 2 years ago
rrmenon10 / ADAPET
[EMNLP 2021] Improving and Simplifying Pattern Exploiting Training
☆153Updated 3 years ago
microsoft / fastseq
An efficient implementation of the popular sequence models for text generation, summarization, and translation tasks. https://arxiv.org/p…
☆433Updated 2 years ago
microsoft / xtreme-distil-transformers
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆155Updated last year
google-deepmind / pg19
☆246Updated 5 years ago
tonyzhaozh / few-shot-learning
Few-shot Learning of GPT-3
☆352Updated last year
uds-lsv / bert-stable-fine-tuning
On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines
☆137Updated last year
salesforce / GeDi
GeDi: Generative Discriminator Guided Sequence Generation
☆211Updated last month
allenai / allentune
Hyperparameter Search for AllenNLP
☆139Updated 4 months ago
allenai / naacl2021-longdoc-tutorial
☆345Updated 4 years ago
mega002 / lm-debugger
The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.
☆178Updated 3 years ago
allenai / unifiedqa
UnifiedQA: Crossing Format Boundaries With a Single QA System
☆441Updated 3 years ago
allenai / cartography
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
☆201Updated 3 years ago