allenai / bffLinks

☆39

Alternatives and similar repositories for bff

Users that are interested in bff are comparing it to the libraries listed below

Sorting:

shayne-longpre / a-pretrainers-guide
☆72Updated 2 years ago
hadasah / btm
☆75Updated last year
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆58Updated 2 years ago
allenai / catwalk
This project studies the performance and robustness of language models and task-adaptation methods.
☆150Updated last year
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated 11 months ago
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆133Updated 6 months ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
XiangLi1999 / AutoBencher
☆29Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆107Updated last year
EleutherAI / semantic-memorization
☆44Updated 8 months ago
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated last month
yuzhaouoe / pretraining-data-packing
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆22Updated 11 months ago
CarperAI / autocrit
A repository for transformer critique learning and generation
☆90Updated last year
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆49Updated last year
jxmorris12 / bm25_pt
minimal pytorch implementation of bm25 (with sparse tensors)
☆104Updated last year
McGill-NLP / length-generalization
Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023
☆136Updated last year
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆77Updated 2 years ago
HazyResearch / skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆46Updated last year
kernelmachine / silo-lm
SILO Language Models code repository
☆81Updated last year
anadim / the-little-retrieval-test
☆34Updated 2 years ago
RulinShao / massive-serve
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
☆22Updated last month
CodeCreator / WebOrganizer
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
☆58Updated 3 months ago
chaitanyamalaviya / ExpertQA
[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers
☆131Updated last year
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago
r-three / RAD
Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model
☆44Updated last year
google-research / t5x_retrieval
☆100Updated 2 years ago
tau-nlp / scrolls
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
☆70Updated last year
srush / LLM-Talk
☆51Updated last year
google-deepmind / streamingqa
☆48Updated last year
McGill-NLP / CHASE
Synthetic Data Generation for Evaluation
☆15Updated 5 months ago