MathIsAll/ZO-AdaMU

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MathIsAll/ZO-AdaMU)

MathIsAll / ZO-AdaMU

This project is a implementation in PyTorch for ZO-AdaMU optimization: Adapting Perturbation with the Momentum and Uncertainty in Zeroth-order Optimization.

☆15

Alternatives and similar repositories for ZO-AdaMU

Users that are interested in ZO-AdaMU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

f-dangel / sirfshampoo
View on GitHub
[ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)
☆15Nov 4, 2024Updated last year
UbiquitousLearning / Backpropagation_Free_Training_Survey
View on GitHub
☆26Feb 22, 2024Updated 2 years ago
inikishev / beat-manipulator
View on GitHub
beat swapping powered by AI
☆16Jul 7, 2024Updated 2 years ago
andytu28 / VQT
View on GitHub
☆22Mar 3, 2023Updated 3 years ago
kvfrans / notes
View on GitHub
☆15Oct 26, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AtlasAnalyticsLab / AdaFisher
View on GitHub
[ICLR 2025] AdaFisher: Adaptive Second Order Optimization via Fisher Information
☆52Feb 7, 2025Updated last year
kaloureyes3 / v4-clients
View on GitHub
☆10Apr 5, 2024Updated 2 years ago
lecoan / pytorch-RLE
View on GitHub
A implement of run-length encoding for Pytorch tensor using CUDA
☆14Apr 7, 2021Updated 5 years ago
yifanycc / AdaZeta
View on GitHub
[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…
☆13Dec 15, 2024Updated last year
OPTML-Group / DeepZero
View on GitHub
[ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…
☆72Oct 9, 2024Updated last year
OPTAMI / OPTAMI
View on GitHub
This package is dedicated to high-order optimization methods. All the methods can be used similarly to standard PyTorch optimizers.
☆30Jun 17, 2025Updated last year
amazon-science / mezo_svrg
View on GitHub
Code the ICML 2024 paper: "Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models"
☆12Jun 25, 2024Updated 2 years ago
timlautk / polargrad
View on GitHub
PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective
☆18Oct 1, 2025Updated 9 months ago
zephyrtronium / bwst
View on GitHub
Burrows-Wheeler-Scott transform
☆14Jun 7, 2013Updated 13 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
2187Nick / ADAS
View on GitHub
Automated Design of Agentic Systems
☆10Sep 7, 2024Updated last year
eda-lab / AES-based-on-FPGA
View on GitHub
AES-based-on-FPGA developed by verilog.
☆23Apr 23, 2020Updated 6 years ago
erfanzar / ejkernel
View on GitHub
easydel jax kernels writen in triton for gpus and pallas for tpus
☆28Jul 11, 2026Updated last week
choidami / inductive-oocr
View on GitHub
☆16Mar 22, 2025Updated last year
Clybius / Personalized-Optimizers
View on GitHub
A collection of niche / personally useful PyTorch optimizers with modified code.
☆28Apr 14, 2026Updated 3 months ago
NyanCatTW1 / RedMetaClassAnalyzer
View on GitHub
Does all kind of cool stuff to make analyzing meta classes easier. Now featuring WRedLogger.py, the previous backend of NetDbg
☆10Jun 7, 2023Updated 3 years ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
ml-gde / jflux
View on GitHub
JAX Implementation of Black Forest Labs' Flux.1 family of models
☆40Jun 18, 2026Updated last month
riverstone496 / awesome-second-order-optimization
View on GitHub
☆32May 17, 2026Updated 2 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lsxliron / SemiSupervisedKMeans
View on GitHub
☆11Dec 8, 2016Updated 9 years ago
optsuite / LOZO
View on GitHub
☆20Dec 5, 2024Updated last year
SmerkyG / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆16Dec 9, 2025Updated 7 months ago
amodaresi / MemLLM
View on GitHub
☆13Aug 13, 2024Updated last year
AgoraOpus / brownian-motion
View on GitHub
An implementation of a Brownian motion using ClojureScript with re-frame and Highcharts
☆11Feb 8, 2019Updated 7 years ago
dynamic-superb / multimodal-llama
View on GitHub
The official implementation of ImageBind-LLM and Whisper-LLM from the paper "Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Compre…
☆21Oct 30, 2023Updated 2 years ago
kywch / brax-trainer
View on GitHub
Brax + Pufferlib + CARBS for gpu-accelerated robotics RL
☆12Jun 12, 2025Updated last year
Snektron / exaregex
View on GitHub
Zig regex experiment
☆13Nov 6, 2025Updated 8 months ago
global-computing-consortium / HiFloat4
View on GitHub
☆17Apr 20, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RyunMi / NCG-Optimizer
View on GitHub
PyTorch optimizer based on nonlinear conjugate gradient method
☆31Apr 25, 2025Updated last year
thomasahle / kanmlps
View on GitHub
KANs and MLPs
☆12Jun 7, 2024Updated 2 years ago
zimingyy / SubZero
View on GitHub
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces (ICCV 2025)
☆20Nov 22, 2024Updated last year
WangXuan95 / LLMA
View on GitHub
LLMA = LLM + Arithmetic coder, which use LLM to do insane text data compression. LLMA=大模型+算术编码，它能使用LLM对文本数据进行暴力的压缩，达到极高的压缩率。
☆23Nov 24, 2024Updated last year
sekstini / gpupoor
View on GitHub
☆18Dec 2, 2024Updated last year
lukasc-ch / ExtendedBitPlaneCompression
View on GitHub
Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…
☆18Oct 6, 2019Updated 6 years ago
rachmaninoffcode / AI_Classical_Music_Composer
View on GitHub
Building the Bi-LSTM & the CNN-GAN models to compose Classical Music in different eras
☆12Aug 2, 2021Updated 4 years ago