xjdr-alt / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆3,368Updated 6 months ago
Alternatives and similar repositories for entropix
Users that are interested in entropix are comparing it to the libraries listed below
Sorting:
- NanoGPT (124M) in 3 minutes☆2,546Updated 3 weeks ago
- System 2 Reasoning Link Collection☆833Updated 2 months ago
- ☆2,939Updated 8 months ago
- Optimizing inference proxy for LLMs☆2,233Updated this week
- Verifiers for LLM Reinforcement Learning☆953Updated this week
- Minimalistic large language model 3D-parallelism training☆1,870Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,700Updated this week
- Recipes to scale inference-time compute of open models☆1,071Updated last week
- Tools for merging pretrained large language models.☆5,721Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,524Updated last month
- ☆1,019Updated 5 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆1,484Updated 2 months ago
- The n-gram Language Model☆1,420Updated 9 months ago
- Code for BLT research paper☆1,587Updated this week
- Implementing DeepSeek R1's GRPO algorithm from scratch☆1,328Updated last month
- [ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling☆871Updated 2 weeks ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,379Updated this week
- procedural reasoning datasets☆580Updated this week
- nanoGPT style version of Llama 3.1☆1,367Updated 9 months ago
- Everything about the SmolLM2 and SmolVLM family of models☆2,361Updated last month
- Sky-T1: Train your own O1 preview model within $450☆3,245Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,537Updated this week
- Democratizing Reinforcement Learning for LLMs☆3,236Updated this week
- Distributed Training Over-The-Internet☆920Updated 5 months ago
- MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.☆1,296Updated 3 weeks ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆791Updated 2 weeks ago
- Fast State-of-the-Art Static Embeddings☆1,615Updated this week
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,010Updated 3 weeks ago
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆299Updated 3 weeks ago
- Bringing BERT into modernity via both architecture changes and scaling☆1,358Updated this week