IST-DASLab/MicroAdam

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IST-DASLab/MicroAdam)

IST-DASLab / MicroAdam

This repository contains code for the MicroAdam paper.

☆21

Alternatives and similar repositories for MicroAdam

Users that are interested in MicroAdam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

stephenqz / OATS
View on GitHub
Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition
☆20Apr 16, 2025Updated last year
IST-DASLab / HALO
View on GitHub
HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…
☆31Feb 17, 2025Updated last year
exo-explore / evML
View on GitHub
Resources regarding evML (edge verified machine learning)
☆24Jan 4, 2025Updated last year
sgl-project / whl
View on GitHub
SGLang Kernel Wheel Index
☆24Updated this week
DS3Lab / CocktailSGD
View on GitHub
☆27Aug 25, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ylsung / rsq
View on GitHub
Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"
☆23Mar 25, 2026Updated 3 months ago
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
Unicorn-Commander / Center-Deep
View on GitHub
Privacy-focused metasearch engine with one-click setup, beautiful admin panel, and AI integration. Fork of SearXNG.
☆20Sep 7, 2025Updated 10 months ago
song-wx / SIFT
View on GitHub
[ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely
☆24Jun 26, 2024Updated 2 years ago
IST-DASLab / SparseFinetuning
View on GitHub
Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry
☆43Jan 15, 2024Updated 2 years ago
yuxwind / CBS
View on GitHub
Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]
☆16Sep 20, 2022Updated 3 years ago
caijixueIT / CUDA_Learning_for_Freshman
View on GitHub
☆14Nov 3, 2025Updated 8 months ago
epfml / schedules-and-scaling
View on GitHub
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆93Oct 30, 2024Updated last year
lzhangbv / acpsgd
View on GitHub
[ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
☆10Apr 28, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
gallen881 / Physics_Master
View on GitHub
Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!
☆16Aug 24, 2024Updated last year
jiwonsong-dev / SLEB
View on GitHub
[ICML 2024] Official Implementation of SLEB: Streamlining LLMs through Redundancy Verification and Elimination of Transformer Blocks
☆41Feb 4, 2025Updated last year
IST-DASLab / torch_cgx
View on GitHub
Pytorch distributed backend extension with compression support
☆17Mar 24, 2025Updated last year
sfox14 / block_minifloat
View on GitHub
Training with Block Minifloat number representation
☆18May 2, 2021Updated 5 years ago
ezelikman / justonebyte
View on GitHub
☆10Jun 19, 2023Updated 3 years ago
Pathos14489 / Pantella
View on GitHub
☆16Updated this week
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
IST-DASLab / Sparse-Marlin
View on GitHub
Boosting 4-bit inference kernels with 2:4 Sparsity
☆96Sep 4, 2024Updated last year
ZhengaoLi / DISP-LLM-Dimension-Independent-Structural-Pruning
View on GitHub
An implementation of the DISP-LLM method from the NeurIPS 2024 paper: Dimension-Independent Structural Pruning for Large Language Models.
☆24Aug 6, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Model-Context-Interface / mci-uvx
View on GitHub
CLI tool designed to manage MCI (Model Context Interface) schemas and dynamically run MCP servers using defined MCI toolsets
☆16Nov 12, 2025Updated 8 months ago
biomedical-cybernetics / Relative-importance-and-activation-pruning
View on GitHub
☆60Jun 10, 2024Updated 2 years ago
zaydzuhri / softpick-attention
View on GitHub
Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"
☆91Sep 12, 2025Updated 10 months ago
chenzx921020 / MoEQuant
View on GitHub
☆17Apr 7, 2025Updated last year
zhuhanqing / Lightening-Transformer-AE
View on GitHub
Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…
☆11Mar 3, 2024Updated 2 years ago
tenstorrent / tt-buda-benchmarks
View on GitHub
Repository for AI model benchmarking on TT-Buda
☆16Feb 9, 2026Updated 5 months ago
g588928812 / qlora
View on GitHub
QLoRA: Efficient Finetuning of Quantized LLMs
☆11Jul 22, 2023Updated 2 years ago
IST-DASLab / spdy
View on GitHub
Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"
☆20May 3, 2023Updated 3 years ago
syjmelody / RankE
View on GitHub
Implementation of RankE: End-to-End Discrete Text-to-Image Post-Training via Rank-Consistent Alignment
☆20May 27, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
pudo-attic / addressformatting
View on GitHub
International Address formatter which considers the standard formatting rules of the country
☆14Nov 21, 2024Updated last year
VITA-Group / R-Sparse
View on GitHub
[ICLR'25] R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
☆21Apr 28, 2025Updated last year
mwbini / ether
View on GitHub
[ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"
☆16May 31, 2024Updated 2 years ago
nisten / grokadamw
View on GitHub
new optimizer
☆20Aug 4, 2024Updated last year
kabachuha / nanoGPKANT
View on GitHub
Testing KAN-based text generation GPT models
☆19May 6, 2024Updated 2 years ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
RUCAIBox / QuantizedEmpirical
View on GitHub
☆15Sep 24, 2023Updated 2 years ago