allenai / fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
☆257Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for fm-cheatsheet
- Manage scalable open LLM inference endpoints in Slurm clusters☆236Updated 4 months ago
- ☆451Updated 3 weeks ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆252Updated last year
- ☆101Updated 3 months ago
- Extract full next-token probabilities via language model APIs☆229Updated 8 months ago
- The official evaluation suite and dynamic data release for MixEval.☆224Updated last week
- A repository for research on medium sized language models.☆479Updated this week
- Scaling Data-Constrained Language Models☆321Updated last month
- A comprehensive deep dive into the world of tokens☆214Updated 4 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆293Updated 11 months ago
- awesome synthetic (text) datasets☆242Updated 3 weeks ago
- A puzzle to learn about prompting☆121Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆211Updated last month
- Multipack distributed sampler for fast padding-free training of LLMs☆178Updated 3 months ago
- A MAD laboratory to improve AI architecture designs 🧪☆95Updated 6 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆811Updated this week
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆216Updated 7 months ago
- A bibliography and survey of the papers surrounding o1☆754Updated this week
- Understand and test language model architectures on synthetic tasks.☆162Updated 6 months ago
- ☆258Updated this week
- code for training & evaluating Contextual Document Embedding models☆117Updated this week
- ☆91Updated last year
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆84Updated last week
- batched loras☆336Updated last year
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024