xjdr-alt / entropixLinks

Entropy Based Sampling and Parallel CoT Decoding

☆3,398

Alternatives and similar repositories for entropix

Users that are interested in entropix are comparing it to the libraries listed below

Sorting:

KellerJordan / modded-nanogpt
NanoGPT (124M) in 3 minutes
☆2,851Updated last week
willccbb / verifiers
Verifiers for LLM Reinforcement Learning
☆1,577Updated this week
open-thought / system-2-research
System 2 Reasoning Link Collection
☆846Updated 4 months ago
NousResearch / DisTrO
Distributed Training Over-The-Internet
☆946Updated 2 months ago
codelion / optillm
Optimizing inference proxy for LLMs
☆2,645Updated this week
carlini / yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
☆1,022Updated 2 months ago
facebookresearch / lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
☆4,659Updated last week
trotsky1997 / MathBlackBox
☆1,028Updated 7 months ago
ridgerchu / matmulfreellm
Implementation for MatMul-free LM.
☆3,016Updated this week
karpathy / nano-llama31
nanoGPT style version of Llama 3.1
☆1,401Updated 11 months ago
facebookresearch / blt
Code for BLT research paper
☆1,740Updated 2 months ago
huggingface / search-and-learn
Recipes to scale inference-time compute of open models
☆1,108Updated 2 months ago
SakanaAI / self-adaptive-llms
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
☆1,129Updated 5 months ago
EleutherAI / cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
☆808Updated last week
huggingface / picotron
Minimalistic 4D-parallelism distributed training framework for education purpose
☆1,607Updated 2 weeks ago
bespokelabsai / curator
Synthetic data curation for post-training and structured data extraction
☆1,456Updated 2 weeks ago
EurekaLabsAI / ngram
The n-gram Language Model
☆1,437Updated 11 months ago
open-thought / reasoning-gym
procedural reasoning datasets
☆979Updated 2 weeks ago
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆897Updated 2 months ago
facebookresearch / coconut
Training Large Language Model to Reason in a Continuous Latent Space
☆1,199Updated 6 months ago
policy-gradient / GRPO-Zero
Implementing DeepSeek R1's GRPO algorithm from scratch
☆1,488Updated 3 months ago
argilla-io / distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…
☆2,816Updated this week
facebookresearch / large_concept_model
Large Concept Models: Language modeling in a sentence representation space
☆2,251Updated 5 months ago
pytorch / torchtune
PyTorch native post-training library
☆5,361Updated this week
mistralai / mistral-finetune
☆2,986Updated 10 months ago
huggingface / datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
☆2,504Updated this week
allenai / open-instruct
AllenAI's post-training codebase
☆3,077Updated this week
openai / transformer-debugger
☆4,087Updated last year
openai / SWELancer-Benchmark
This repo contains the dataset and code for the paper "SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software E…
☆1,434Updated last week
NousResearch / Hermes-Function-Calling
☆938Updated 10 months ago