xjdr-alt / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆3,355Updated 5 months ago
Alternatives and similar repositories for entropix:
Users that are interested in entropix are comparing it to the libraries listed below
- System 2 Reasoning Link Collection☆826Updated last month
- Optimizing inference proxy for LLMs☆2,167Updated this week
- NanoGPT (124M) in 3 minutes☆2,501Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,461Updated 3 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,062Updated 2 months ago
- Distributed Training Over-The-Internet☆901Updated 4 months ago
- Large Concept Models: Language modeling in a sentence representation space☆2,098Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆813Updated 3 weeks ago
- Code for BLT research paper☆1,513Updated this week
- The n-gram Language Model☆1,414Updated 8 months ago
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,514Updated 2 weeks ago
- procedural reasoning datasets☆565Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,404Updated 2 months ago
- nanoGPT style version of Llama 3.1☆1,356Updated 8 months ago
- [ICLR 2025] Automated Design of Agentic Systems☆1,258Updated 2 months ago
- Implementing DeepSeek R1's GRPO algorithm from scratch☆445Updated this week
- ☆1,015Updated 4 months ago
- ☆634Updated 4 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality☆3,817Updated 8 months ago
- ☆855Updated 7 months ago
- A library for making RepE control vectors☆579Updated 3 months ago
- Minimalistic 4D-parallelism distributed training framework for education purpose☆991Updated last month
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆1,005Updated 2 months ago
- Recipes to scale inference-time compute of open models☆1,055Updated last month
- Bringing BERT into modernity via both architecture changes and scaling☆1,329Updated 3 weeks ago
- ☆2,703Updated last week
- Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""☆3,815Updated 4 months ago
- OO for LLMs☆702Updated this week
- Things you can do with the token embeddings of an LLM☆1,437Updated 3 weeks ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,040Updated 2 months ago