xjdr-alt / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆3,344Updated 4 months ago
Alternatives and similar repositories for entropix:
Users that are interested in entropix are comparing it to the libraries listed below
- NanoGPT (124M) in 3 minutes☆2,417Updated last week
- System 2 Reasoning Link Collection☆812Updated last week
- Optimizing inference proxy for LLMs☆2,110Updated last week
- Training Large Language Model to Reason in a Continuous Latent Space☆998Updated 2 months ago
- procedural reasoning datasets☆534Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,481Updated last month
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,578Updated this week
- ☆1,011Updated 3 months ago
- Verifiers for LLM Reinforcement Learning☆686Updated this week
- Large Concept Models: Language modeling in a sentence representation space☆2,053Updated last month
- Distributed Training Over-The-Internet☆891Updated 3 months ago
- 🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…☆1,247Updated last month
- The n-gram Language Model☆1,404Updated 7 months ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆990Updated last month
- Implementation for MatMul-free LM.☆2,969Updated 4 months ago
- ☆2,892Updated 6 months ago
- Recipes to scale inference-time compute of open models☆1,044Updated last month
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,271Updated this week
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,361Updated 2 months ago
- Code for BLT research paper☆1,436Updated this week
- Minimalistic 4D-parallelism distributed training framework for education purpose☆948Updated 2 weeks ago
- Minimalistic large language model 3D-parallelism training☆1,715Updated this week
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆652Updated 2 months ago
- Everything about the SmolLM2 and SmolVLM family of models☆2,049Updated this week
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,057Updated this week
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,328Updated last month
- [ICLR 2025] Automated Design of Agentic Systems☆1,225Updated last month
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆857Updated last month
- A bibliography and survey of the papers surrounding o1☆1,182Updated 4 months ago
- Bringing BERT into modernity via both architecture changes and scaling☆1,283Updated this week