A single-line modification to any (dualizer-based) optimizer that allows the optimizer to adapt to the scale of the gradients as they change during training
☆19Jan 11, 2025Updated last year
Alternatives and similar repositories for adaptive-muon
Users that are interested in adaptive-muon are comparing it to the libraries listed below
Sorting:
- Code for paper Almost-Orthogonal Layers for Efficient General-Purpose Lipschitz Networks☆12Aug 9, 2022Updated 3 years ago
- ☆24Mar 2, 2026Updated last week
- Github Repository for the HOI4 ULTRA Project.☆11Updated this week
- A collection of niche / personally useful PyTorch optimizers with modified code.☆27Oct 25, 2025Updated 4 months ago
- [NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling☆40Dec 2, 2023Updated 2 years ago
- ☆44Nov 1, 2025Updated 4 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- ☆50Aug 21, 2025Updated 6 months ago
- ☆55Feb 24, 2026Updated last week
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 8 years ago
- ☆14Apr 14, 2025Updated 10 months ago
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- ☆22Updated this week
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- Pre-train BERT from scratch, with HuggingFace. Accompanies the blog post: sidsite.com/posts/bert-from-scratch☆43May 20, 2025Updated 9 months ago
- ☆11Apr 3, 2023Updated 2 years ago
- A compressed SDL_Surface format using the LZ4 compression library.☆14Sep 28, 2022Updated 3 years ago
- CLI utilty to work out proper constants for vpternlogic instruction☆13Jan 22, 2023Updated 3 years ago
- A stream to RTL compiler based on MLIR and CIRCT☆16Nov 15, 2022Updated 3 years ago
- Stereo lithography file support for Rust.☆12Jul 29, 2023Updated 2 years ago
- 4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)☆13Feb 13, 2025Updated last year
- A library for training crosscoders☆16May 28, 2025Updated 9 months ago
- Towards Hardware and Software Continuous Integration☆13Jun 8, 2020Updated 5 years ago
- CoMeT is a new low-cost RowHammer mitigation that uses Count-Min Sketch-based aggressor row tracking, as described in our HPCA'24 paper h…☆11Jan 23, 2026Updated last month
- ICLR 26 Texture Vector-Quantization and Reconstruction Aware Prediction for Generative Super-Resolution☆16Feb 2, 2026Updated last month
- A survey of manufacturer-provided DRAM operating parameters and timings as specified by DRAM chip datasheets from between 1970 and 2021. …☆11May 4, 2022Updated 3 years ago
- EdX course from MIT on machine learning 6.86x☆11Dec 16, 2020Updated 5 years ago
- ☆11Aug 4, 2022Updated 3 years ago
- Provides current Voreen Sources (with modifications) by Uni Münster to build voreen for PC, server or lrz cluster, including workspaces a…☆12Mar 2, 2024Updated 2 years ago
- Testing Ibex build using Yosys and open source toolchains.☆11Oct 2, 2021Updated 4 years ago
- This simulator models multi core systems, intended primarily for studies on main memory management techniques. It models a trace-based ou…☆12Jan 18, 2016Updated 10 years ago
- ☆10Oct 27, 2023Updated 2 years ago
- A merged read deduplication tool capable to perform merged read deduplication on single end data.☆12Sep 4, 2024Updated last year
- ☆12Jul 9, 2021Updated 4 years ago
- Musings in GEMM (General Matrix Multiplication)☆14Dec 14, 2025Updated 2 months ago
- ☆13Jan 10, 2026Updated 2 months ago
- Solution of the telegram ML competition 2023☆14May 26, 2024Updated last year