vukrosic / leo-optimizerLinks
Leo optimizer, variation of Muon that runs faster
☆30Updated 5 months ago
Alternatives and similar repositories for leo-optimizer
Users that are interested in leo-optimizer are comparing it to the libraries listed below
Sorting:
- ☆12Updated 8 months ago
- Unofficial implementation of Tiny Recursive Model (TRM), improvement to HRM from Sapient AI, by Alexia Jolicoeur-Martineau☆174Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)☆110Updated 11 months ago
- Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP☆34Updated 5 months ago
- open source alpha evolve☆69Updated 8 months ago
- Repository to create traveling waves integrate special information through time☆56Updated 6 months ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated 4 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59Updated 8 months ago
- My submission to the ARC-AGI-3 Developer Preview Agent Compitition.☆34Updated 2 weeks ago
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆151Updated this week
- Large multi-modal models (L3M) pre-training.☆230Updated 4 months ago
- Qwen3-0.6B megakernel: 527 tok/s decode on RTX 3090 (3.8x faster than PyTorch)☆70Updated this week
- ☆62Updated 7 months ago
- ☆147Updated last year
- Simple & Scalable Pretraining for Neural Architecture Research☆308Updated 2 months ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆31Updated 10 months ago
- Verification of Google DeepMind's AlphaEvolve 48-multiplication matrix algorithm, a breakthrough in matrix multiplication after 56 years.☆133Updated 7 months ago
- Code for Bolmo: Byteifying the Next Generation of Language Models☆117Updated last month
- Implementation of SOAR☆49Updated 4 months ago
- RAG Agent for the ARC AGI Challenge☆20Updated last year
- aesthetic tensor visualiser☆28Updated 9 months ago
- A Repo focusing on Engineering Physics Applications of MLX☆12Updated last year
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆112Updated 8 months ago
- ☆67Updated 10 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆128Updated 4 months ago
- ☆15Updated 7 months ago
- MLX Transformers is a library that provides model implementation in MLX. It uses a similar model interface as HuggingFace Transformers an…☆72Updated last year
- ☆59Updated 2 months ago
- ☆63Updated 7 months ago
- ☆41Updated 9 months ago