allenai / fm-cheatsheetLinks
Website for hosting the Open Foundation Models Cheat Sheet.
☆267Updated last month
Alternatives and similar repositories for fm-cheatsheet
Users that are interested in fm-cheatsheet are comparing it to the libraries listed below
Sorting:
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆257Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆260Updated 11 months ago
- A repository for research on medium sized language models.☆498Updated 2 weeks ago
- Scaling Data-Constrained Language Models☆335Updated 9 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆303Updated last year
- A comprehensive deep dive into the world of tokens☆224Updated 11 months ago
- Multipack distributed sampler for fast padding-free training of LLMs☆191Updated 10 months ago
- A puzzle to learn about prompting☆128Updated 2 years ago
- ☆134Updated 2 months ago
- ☆92Updated last year
- Let's build better datasets, together!☆259Updated 6 months ago
- ☆520Updated 7 months ago
- PyTorch building blocks for the OLMo ecosystem☆234Updated this week
- A MAD laboratory to improve AI architecture designs 🧪☆120Updated 6 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 7 months ago
- A bagel, with everything.☆321Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆226Updated 3 months ago
- ☆200Updated this week
- The official evaluation suite and dynamic data release for MixEval.☆242Updated 7 months ago
- git extension for {collaborative, communal, continual} model development☆213Updated 7 months ago
- ☆236Updated 2 months ago
- Extract full next-token probabilities via language model APIs☆247Updated last year
- Fast bare-bones BPE for modern tokenizer training☆159Updated 2 months ago
- Understand and test language model architectures on synthetic tasks.☆217Updated 2 weeks ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆205Updated 2 weeks ago
- ☆541Updated 9 months ago
- Evaluation suite for LLMs☆349Updated 2 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆463Updated last year
- experiments with inference on llama☆104Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆187Updated last year