allenai / fm-cheatsheetLinks

Website for hosting the Open Foundation Models Cheat Sheet.

☆267

Alternatives and similar repositories for fm-cheatsheet

Users that are interested in fm-cheatsheet are comparing it to the libraries listed below

Sorting:

huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆273Updated last year
llm-efficiency-challenge / neurips_llm_efficiency_challenge
NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day
☆256Updated 2 years ago
mlfoundations / open_lm
A repository for research on medium sized language models.
☆514Updated 4 months ago
google-deepmind / mishax
☆142Updated last month
srush / GPTWorld
A puzzle to learn about prompting
☆135Updated 2 years ago
lm-sys / llm-decontaminator
Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"
☆311Updated last year
imoneoi / multipack
Multipack distributed sampler for fast padding-free training of LLMs
☆201Updated last year
justinchiu / openlogprobs
Extract full next-token probabilities via language model APIs
☆247Updated last year
normster / llm_rules
RuLES: a benchmark for evaluating rule-following in language models
☆238Updated 7 months ago
JinjieNi / MixEval
The official evaluation suite and dynamic data release for MixEval.
☆250Updated 11 months ago
huggingface / data-is-better-together
Let's build better datasets, together!
☆262Updated 10 months ago
huggingface / datablations
Scaling Data-Constrained Language Models
☆342Updated 3 months ago
IBM / ModuleFormer
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆224Updated last month
booydar / babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
☆215Updated last month
center-for-humans-and-machines / transformer-heads
Toolkit for attaching, training, saving and loading of new heads for transformer models
☆289Updated 7 months ago
mcleish7 / arithmetic
Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)
☆192Updated last year
jxmorris12 / cde
code for training & evaluating Contextual Document Embedding models
☆198Updated 5 months ago
SumanthRH / tokenization
A comprehensive deep dive into the world of tokens
☆226Updated last year
sabetAI / BLoRA
batched loras
☆346Updated 2 years ago
huggingface / cosmopedia
☆544Updated 11 months ago
stanford-crfm / ecosystem-graphs
☆268Updated 8 months ago
Aleph-Alpha-Research / scaling
Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…
☆64Updated 3 weeks ago
Data-Provenance-Initiative / Data-Provenance-Collection
☆254Updated 6 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Updated 2 years ago
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆89Updated last year
davanstrien / awesome-synthetic-datasets
awesome synthetic (text) datasets
☆302Updated 3 months ago
FastEval / FastEval
Fast & more realistic evaluation of chat language models. Includes leaderboard.
☆189Updated last year
QuixiAI / spectrum
☆136Updated 2 months ago
EleutherAI / nanoGPT-mup
The simplest, fastest repository for training/finetuning medium-sized GPTs.
☆166Updated 3 months ago