Use Muon optimizer instead of AdamW.
☆49Mar 2, 2026Updated last month
Alternatives and similar repositories for muon-optimizer-guide
Users that are interested in muon-optimizer-guide are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notes for CS294/194-196: Large Language Model Agents (Fall 2024, UC Berkeley), summarizing 12 lectures on LLM fundamentals, reasoning, pl…☆14Jan 7, 2025Updated last year
- Train LLM on Hugging Face infra☆71Apr 2, 2026Updated last week
- Multimodal single-cell data API, for adding information for -omics like scRNA-seq as well as ATAC-seq, spatial transcriptomics, hyperspec…☆13Jul 12, 2024Updated last year
- Simple demo showing how to use the Forge API by Nous Research☆16Nov 12, 2024Updated last year
- Multi-Agent LLM System for Digital Scam Protection☆13Dec 19, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- Prebuilt WASM binaries for tree-sitter's language parsers.☆16Oct 7, 2025Updated 6 months ago
- decontamination☆30Mar 4, 2026Updated last month
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- ☆10Sep 4, 2025Updated 7 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Updated this week
- ☆13Jan 22, 2025Updated last year
- ☆10Jun 13, 2022Updated 3 years ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆31Aug 21, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- Fuzzing solmate with medusa☆10Aug 14, 2023Updated 2 years ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 10 months ago
- ☆14Mar 17, 2022Updated 4 years ago
- ☆10Jun 14, 2024Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Mar 27, 2024Updated 2 years ago
- Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)☆17Feb 25, 2026Updated last month
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆18Jun 3, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- A repo with data files, assets and code supporting and powering the Learning Path Index Project☆18May 13, 2025Updated 11 months ago
- Source code to execute signal injection attacks against CCD image sensors☆11Aug 26, 2021Updated 4 years ago
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Jul 29, 2024Updated last year
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 5 months ago
- ☆14Jul 5, 2024Updated last year
- Enjoy Hip hop beats to Relax or Study! 🎧 🎶☆17May 14, 2021Updated 4 years ago
- This repository contains resources, documentation and artifacts describing LLM agents☆15Jan 22, 2025Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Repository for the Hacktoberfest Meetup on 7th October 2019☆14Oct 20, 2020Updated 5 years ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆14Apr 25, 2024Updated last year
- ☆12Nov 3, 2024Updated last year
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated 2 years ago
- ☆11Jul 1, 2024Updated last year
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 6 months ago
- In this course navigates through the LLMOps pipeline, enabling you to preprocess training data for supervised fine-tuning and deploy cust…☆14Feb 13, 2024Updated 2 years ago