huggingface / datablationsLinks

Scaling Data-Constrained Language Models

☆342

Alternatives and similar repositories for datablations

Users that are interested in datablations are comparing it to the libraries listed below

Sorting:

p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆266Updated last year
imoneoi / multipack
Multipack distributed sampler for fast padding-free training of LLMs
☆202Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆258Updated 2 years ago
haoliuhl / chain-of-hindsight
Simple next-token-prediction for RLHF
☆227Updated 2 years ago
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆169Updated last year
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆205Updated last year
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated 2 years ago
lucidrains / CALM-pytorch
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
☆178Updated last year
normster / llm_rules
RuLES: a benchmark for evaluating rule-following in language models
☆240Updated 9 months ago
IBM / ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆225Updated 2 months ago
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆315Updated last year
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆277Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆108Updated 2 years ago
neulab / gemini-benchmark
☆150Updated last year
allenai / catwalk
This project studies the performance and robustness of language models and task-adaptation methods.
☆154Updated last year
allenai / WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
☆246Updated last year
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆169Updated 2 months ago
jayelm / gisting
Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467
☆300Updated 9 months ago
mlfoundations / open_lm
A repository for research on medium sized language models.
☆520Updated 5 months ago
tianjunz / HIR
☆159Updated 2 years ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆100Updated last year
CarperAI / autocrit
A repository for transformer critique learning and generation
☆89Updated last year
bigscience-workshop / lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
☆105Updated 2 years ago
google-deepmind / loft
LOFT: A 1 Million+ Token Long-Context Benchmark
☆218Updated 5 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆253Updated last year
Cohere-Labs-Community / parameter-efficient-moe
☆272Updated 2 years ago
SALT-NLP / demonstrated-feedback
☆129Updated last year
FranxYao / Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
☆478Updated last year
tomekkorbak / pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
☆180Updated last year
akoksal / LongForm
Reverse Instructions to generate instruction tuning data with corpus examples
☆216Updated last year