allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆257Updated 2 months ago
Related projects: ⓘ
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆248Updated 10 months ago
- The official evaluation suite and dynamic data release for MixEval.☆200Updated this week
- A repository for research on medium sized language models.☆469Updated last month
- ☆419Updated 2 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆264Updated 8 months ago
- LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processin…☆667Updated this week
- Multipack distributed sampler for fast padding-free training of LLMs☆170Updated last month
- GPT-2 (124M) quality in 5B tokens☆227Updated last week
- A comprehensive deep dive into the world of tokens☆212Updated 2 months ago
- awesome synthetic (text) datasets☆213Updated last week
- Scaling Data-Constrained Language Models☆310Updated this week
- RuLES: a benchmark for evaluating rule-following in language models☆209Updated this week
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆218Updated 5 months ago
- RewardBench: the first evaluation tool for reward models.☆352Updated last week
- Extract full next-token probabilities via language model APIs☆226Updated 6 months ago
- Scalable data pre processing and curation toolkit for LLMs☆461Updated this week
- An Open Source Toolkit For LLM Distillation☆284Updated last month
- batched loras☆327Updated last year
- Long context evaluation for large language models☆148Updated this week
- A puzzle to learn about prompting☆106Updated last year
- Fast bare-bones BPE for modern tokenizer training☆138Updated 3 weeks ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆493Updated this week
- Minimalistic large language model 3D-parallelism training☆1,116Updated this week
- Let's build better datasets, together!☆195Updated last month
- Sparse autoencoders☆297Updated last week
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆139Updated 3 weeks ago
- ☆409Updated 10 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆89Updated last week