xjdr-alt / entropixLinks
Entropy Based Sampling and Parallel CoT Decoding
☆3,383Updated 6 months ago
Alternatives and similar repositories for entropix
Users that are interested in entropix are comparing it to the libraries listed below
Sorting:
- NanoGPT (124M) in 3 minutes☆2,610Updated last week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,613Updated 2 months ago
- System 2 Reasoning Link Collection☆836Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆1,197Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,557Updated 2 weeks ago
- ☆2,952Updated 8 months ago
- procedural reasoning datasets☆770Updated this week
- Code for BLT research paper☆1,675Updated 2 weeks ago
- A library for mechanistic interpretability of GPT-style language models☆2,217Updated last week
- nanoGPT style version of Llama 3.1☆1,373Updated 10 months ago
- ☆1,024Updated 5 months ago
- Optimizing inference proxy for LLMs☆2,477Updated this week
- Agentic components of the Llama Stack APIs☆4,248Updated last month
- Minimalistic large language model 3D-parallelism training☆1,909Updated this week
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆879Updated last month
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,518Updated this week
- The n-gram Language Model☆1,421Updated 10 months ago
- AllenAI's post-training codebase☆2,993Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,735Updated this week
- Tools for merging pretrained large language models.☆5,774Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,405Updated last month
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆795Updated last month
- Distributed Training Over-The-Internet☆935Updated 3 weeks ago
- A bibliography and survey of the papers surrounding o1☆1,194Updated 6 months ago
- Synthetic data curation for post-training and structured data extraction☆1,372Updated this week
- A PyTorch native platform for training generative AI models☆3,891Updated this week
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,014Updated last month
- Recipes to scale inference-time compute of open models☆1,090Updated 2 weeks ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,994Updated 9 months ago
- Democratizing Reinforcement Learning for LLMs☆3,330Updated 3 weeks ago