allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆262Updated 6 months ago
Alternatives and similar repositories for fm-cheatsheet:
Users that are interested in fm-cheatsheet are comparing it to the libraries listed below
- Manage scalable open LLM inference endpoints in Slurm clusters☆247Updated 6 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆253Updated last year
- A repository for research on medium sized language models.☆484Updated this week
- ☆484Updated last month
- Textbook on reinforcement learning from human feedback☆111Updated this week
- Scaling Data-Constrained Language Models☆330Updated 3 months ago
- ☆115Updated this week
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆297Updated last year
- A comprehensive deep dive into the world of tokens☆215Updated 6 months ago
- The official evaluation suite and dynamic data release for MixEval.☆233Updated 2 months ago
- A puzzle to learn about prompting☆123Updated last year
- awesome synthetic (text) datasets☆253Updated 2 months ago
- Fast bare-bones BPE for modern tokenizer training☆141Updated 2 months ago
- Extract full next-token probabilities via language model APIs☆230Updated 10 months ago
- RuLES: a benchmark for evaluating rule-following in language models☆215Updated this week
- Multipack distributed sampler for fast padding-free training of LLMs☆184Updated 5 months ago
- git extension for {collaborative, communal, continual} model development☆207Updated 2 months ago
- Let's build better datasets, together!☆244Updated 3 weeks ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆644Updated this week
- Automatic Evals for Instruction-Tuned Models☆100Updated this week
- A simple unified framework for evaluating LLMs☆164Updated 3 weeks ago
- A bagel, with everything.☆315Updated 9 months ago
- experiments with inference on llama☆104Updated 7 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated 8 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆259Updated last week
- Evaluating LLMs with fewer examples☆141Updated 9 months ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆156Updated last week
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆90Updated last month
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆176Updated last month