xjdr-alt / entropix
Entropy Based Sampling and Parallel CoT Decoding
☆3,036Updated last week
Related projects ⓘ
Alternatives and complementary repositories for entropix
- Optimizing inference proxy for LLMs☆1,563Updated this week
- ☆935Updated 2 weeks ago
- ☆2,746Updated 2 months ago
- A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!☆3,256Updated 3 months ago
- Distributed Training Over-The-Internet☆688Updated 2 months ago
- NanoGPT (124M) quality in 7.8 8xH100-minutes☆1,033Updated this week
- g1: Using Llama-3.1 70b on Groq to create o1-like reasoning chains☆3,906Updated last month
- 🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide…☆1,096Updated this week
- System 2 Reasoning Link Collection☆693Updated 3 weeks ago
- Efficient Triton Kernels for LLM Training☆3,454Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆1,824Updated 2 weeks ago
- A benchmark to evaluate language models on questions I've previously asked them to solve.☆916Updated 2 weeks ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,634Updated this week
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,227Updated last week
- Automated Design of Agentic Systems☆1,038Updated this week
- Composable building blocks to build Llama Apps☆4,594Updated this week
- AdalFlow: The library to build & auto-optimize LLM applications.☆2,074Updated this week
- Implementation for MatMul-free LM.☆2,920Updated 2 weeks ago
- A native PyTorch Library for large model training☆2,623Updated this week
- Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"☆803Updated 3 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,045Updated this week
- Things you can do with the token embeddings of an LLM☆1,376Updated last week
- ReFT: Representation Finetuning for Language Models☆1,159Updated 2 weeks ago
- ☆1,271Updated 2 weeks ago
- Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.☆2,489Updated this week
- The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework Join our Community: https://discord.com/servers/agora-999382051…☆1,772Updated this week
- The Open Cookbook for Top-Tier Code Large Language Model☆1,182Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆798Updated 2 weeks ago
- A reading list on LLM based Synthetic Data Generation 🔥☆791Updated 2 weeks ago