train entropix like a champ!
☆20Oct 10, 2024Updated last year
Alternatives and similar repositories for entropix-trainer
Users that are interested in entropix-trainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- smol models are fun too☆94Nov 9, 2024Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- look how they massacred my boy☆63Oct 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Plotting (entropy, varentropy) for small LMs☆99May 20, 2025Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆61May 12, 2026Updated last month
- This is sample code for Paho MQTT server with Python 2.7☆10Mar 29, 2016Updated 10 years ago
- trio async MQTT client that wraps paho-mqtt☆12Feb 8, 2021Updated 5 years ago
- An AI character interaction system with emotional modeling and advanced memory management☆17Oct 26, 2024Updated last year
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆49Nov 6, 2024Updated last year
- PDF parser using pdfminer and pytesseract for OCR support☆11Sep 19, 2019Updated 6 years ago
- A collection of CLI LLM tools that I built and use daily☆15Aug 7, 2024Updated last year
- UM1 test programs and sample code☆11Jul 25, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- ☆16Mar 13, 2025Updated last year
- Deepseek-CoT☆10Oct 6, 2024Updated last year
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆11Jan 12, 2021Updated 5 years ago
- Create string diagrams with LaTeX!☆14Jan 3, 2025Updated last year
- 📰 Computing the information content of trained neural networks☆24Oct 8, 2021Updated 4 years ago
- Say hi to anyone, for humans and agents. An Inbox Zero product☆19Jul 8, 2025Updated 11 months ago
- ☆17Jul 4, 2025Updated 11 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Nov 5, 2024Updated last year
- ☆34May 26, 2025Updated last year
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 4 years ago
- The Compositionality article class.☆14Mar 16, 2026Updated 3 months ago
- ☆19Sep 1, 2025Updated 9 months ago
- Combining SOAP and MUON☆22Feb 11, 2025Updated last year
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 3 years ago
- ☆14Feb 25, 2019Updated 7 years ago
- ☆13Jul 19, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- mqtt for MycroftAI☆17Mar 15, 2019Updated 7 years ago
- Exploring how optimizations for GEMMs work☆36Feb 28, 2026Updated 3 months ago
- Structured outputs from DSPy and Jinja2☆27Jun 27, 2025Updated 11 months ago
- rust bindings for blender via extism☆17Feb 3, 2026Updated 4 months ago
- Zig Vector Database!☆15Jan 30, 2026Updated 4 months ago
- ☆15Mar 2, 2025Updated last year
- An MCP server for Raindrop.io (bookmarking service)☆20Apr 10, 2025Updated last year