Use Muon optimizer instead of AdamW.
☆50Mar 2, 2026Updated 2 months ago
Alternatives and similar repositories for muon-optimizer-guide
Users that are interested in muon-optimizer-guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notes for CS294/194-196: Large Language Model Agents (Fall 2024, UC Berkeley), summarizing 12 lectures on LLM fundamentals, reasoning, pl…☆15Jan 7, 2025Updated last year
- Educational WIP☆70Feb 16, 2026Updated 2 months ago
- ☆13May 4, 2023Updated 3 years ago
- Atari-style POMDPs☆28Apr 24, 2026Updated last week
- A simple, generic, and flexible keyframe animation library for Rust.☆30Mar 27, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Sep 8, 2023Updated 2 years ago
- opc-ua-pubsub-dotnet is a library which implements OPC UA PubSub encoding and decoding in a simplified way.☆29Feb 6, 2025Updated last year
- Simple demo showing how to use the Forge API by Nous Research☆17Nov 12, 2024Updated last year
- A Comparion of Increasing Cost Tree Search (ICTS) and Enhanced Partial Expansion A*☆19Jul 6, 2023Updated 2 years ago
- Multi-Agent LLM System for Digital Scam Protection☆15Dec 19, 2024Updated last year
- Realize closed-loop position control for mobile robots with sliding mode control in ROS.☆25May 16, 2025Updated 11 months ago
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- Learnable MAPF. “Distributed Heuristic Multi-Agent Path Finding with Communication” (DHC) algorithm from ICRA 2021 is implemented and ben…☆24Nov 9, 2023Updated 2 years ago
- Prebuilt WASM binaries for tree-sitter's language parsers.☆16Oct 7, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- decontamination☆30Mar 4, 2026Updated 2 months ago
- ☆21Nov 7, 2019Updated 6 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Apr 12, 2026Updated 3 weeks ago
- ☆20Apr 13, 2026Updated 3 weeks ago
- ☆30Sep 3, 2024Updated last year
- ☆14Jan 22, 2025Updated last year
- ☆10Jun 13, 2022Updated 3 years ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆32Aug 21, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A ratatui based vertical and horizontal slider.☆41Apr 12, 2026Updated 3 weeks ago
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Fuzzing solmate with medusa☆10Aug 14, 2023Updated 2 years ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 11 months ago
- Research on DeepSeek Sparse Attention☆40Oct 8, 2025Updated 6 months ago
- Lightweight Git for microcontroller☆16Aug 16, 2022Updated 3 years ago
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)☆17Feb 25, 2026Updated 2 months ago
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆19Jun 3, 2025Updated 11 months ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- This is an umbrella repository that contains links and information about all the tools and algorithms related to the POGEMA Benchmark.☆38Dec 10, 2025Updated 4 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- A repo with data files, assets and code supporting and powering the Learning Path Index Project☆18May 13, 2025Updated 11 months ago
- Source code to execute signal injection attacks against CCD image sensors☆11Aug 26, 2021Updated 4 years ago