Use Muon optimizer instead of AdamW.
☆39Mar 2, 2026Updated this week
Alternatives and similar repositories for muon-optimizer-guide
Users that are interested in muon-optimizer-guide are comparing it to the libraries listed below
Sorting:
- Notes for CS294/194-196: Large Language Model Agents (Fall 2024, UC Berkeley), summarizing 12 lectures on LLM fundamentals, reasoning, pl…☆14Jan 7, 2025Updated last year
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆29Feb 18, 2026Updated 2 weeks ago
- Train LLM on Hugging Face infra☆68Nov 13, 2025Updated 3 months ago
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- ☆10Sep 4, 2025Updated 6 months ago
- Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition☆10Oct 31, 2022Updated 3 years ago
- Official code repository for the paper titled "Efficient Molecular Conformer Generation with SO(3) Averaged Flow-Matching and Reflow" (IC…☆13Jan 8, 2026Updated last month
- RAG Based LLM Chatbot Built using Open Source Stack (Llama 3.2 Model, BGE Embeddings, and Qdrant running locally within a Docker Containe…☆15Jan 9, 2025Updated last year
- Source code to execute signal injection attacks against CCD image sensors☆11Aug 26, 2021Updated 4 years ago
- DualBind is a 3D structure-based deep learning model with a dual-loss framework for accurate and fast protein-ligand binding affinity pre…☆14Oct 21, 2025Updated 4 months ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- IBM Quantum Challenge Fall 2023☆10May 23, 2023Updated 2 years ago
- An efficient and scalable attention module designed to reduce memory usage and improve inference speed in large language models. Designe…☆21Jun 25, 2025Updated 8 months ago
- decontamination☆26Dec 3, 2025Updated 3 months ago
- ☆11Nov 30, 2023Updated 2 years ago
- Colab notebooks exploring different Machine Learning topics.☆16Apr 2, 2022Updated 3 years ago
- SING: SDE Inference via Natural Gradients☆36Dec 9, 2025Updated 2 months ago
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- A ratatui based vertical and horizontal slider.☆37Updated this week
- Code for WACV24 work for multiview acoustic-visual detection☆13Mar 22, 2024Updated last year
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Sep 8, 2023Updated 2 years ago
- Multimodal single-cell data API, for adding information for -omics like scRNA-seq as well as ATAC-seq, spatial transcriptomics, hyperspec…☆13Jul 12, 2024Updated last year
- iOS app for Meta smart glasses with OpenAI Realtime API voice assistant. SwiftUI + Bluetooth HFP + GPT realtime voice.☆50Jan 15, 2026Updated last month
- ☆10Jun 13, 2022Updated 3 years ago
- ☆12Aug 21, 2024Updated last year
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- ☆10Jun 14, 2024Updated last year
- A simple, generic, and flexible keyframe animation library for Rust.☆30Dec 30, 2025Updated 2 months ago
- Fuzzing solmate with medusa☆10Aug 14, 2023Updated 2 years ago
- DeepSimulator is a hybrid tool between DDS and DL techniques to simulate business processes☆11Mar 31, 2023Updated 2 years ago
- Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]☆11Dec 26, 2024Updated last year
- Benchmarking Multi-Image Understanding in Vision and Language Models☆12Jul 29, 2024Updated last year
- pytorch code for sound event localization and classification☆13Aug 12, 2021Updated 4 years ago
- ☆15Jan 24, 2023Updated 3 years ago
- This repository is associated with the research paper titled ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large…☆15Jun 4, 2025Updated 9 months ago
- ☆14Mar 17, 2022Updated 3 years ago
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 4 months ago
- Langchain_CrewAI_Gemini - An Gemini AI powered AI Agent (Multi-Agent) Project.☆13Mar 24, 2024Updated last year
- Collaborative retina modelling across datasets and species.☆18Updated this week