kyleliang919/Online-Subspace-Descent

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kyleliang919/Online-Subspace-Descent)

kyleliang919 / Online-Subspace-Descent

[NeurIPS 2024] Low rank memory efficient optimizer without SVD

☆33

Alternatives and similar repositories for Online-Subspace-Descent

Users that are interested in Online-Subspace-Descent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kyleliang919 / Super_Muon
View on GitHub
☆68Mar 21, 2025Updated last year
jiangycTarheel / SQ-Transformer
View on GitHub
☆10Feb 12, 2024Updated 2 years ago
xypan0 / G-DIG
View on GitHub
☆12Jun 30, 2024Updated 2 years ago
drarijitdas / Natural-GaLore
View on GitHub
An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace
☆19Oct 21, 2024Updated last year
zqOuO / GWT
View on GitHub
☆13May 4, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
whyNLP / tinyllama
View on GitHub
A side project that follows all the acceleration tricks in tinyllama, with the minimal modification to the huggingface transformers code.
☆13Sep 2, 2024Updated last year
eryk-mazus / sigh
View on GitHub
Seamless Voice Interactions with LLMs
☆12Oct 28, 2023Updated 2 years ago
Cranial-XIX / DualSMC
View on GitHub
☆12Jul 15, 2020Updated 6 years ago
ZO-Bench / ZO-LLM
View on GitHub
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆128Jul 6, 2025Updated last year
lqiang67 / generative-models-on-toys
View on GitHub
generative models on toys
☆12Sep 10, 2024Updated last year
duykhuongnguyen / MAT-Steer
View on GitHub
☆21Aug 19, 2025Updated 11 months ago
ttf248 / game_helper
View on GitHub
玩游戏是为了找乐子，策划设计的无聊任务，靠边站
☆10Jan 22, 2026Updated 6 months ago
osehmathias / lisa
View on GitHub
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
☆38Apr 4, 2024Updated 2 years ago
argolab / dyna-R
View on GitHub
Dyna built on R-exprs (First Prototype)
☆17Mar 7, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jiaweizzhao / GaLore
View on GitHub
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,699Oct 28, 2024Updated last year
lunary-ai / llm-benchmarks
View on GitHub
LLM benchmarks
☆13Feb 22, 2024Updated 2 years ago
facebookresearch / GCD
View on GitHub
Computing the greatest common divisor with transformers, source code for the paper https//arxiv.org/abs/2308.15594
☆14Aug 11, 2025Updated 11 months ago
timmytonga / sn-sm
View on GitHub
Subset-Norm and Subset-Momentum. This repo is built on top of https://github.com/jiaweizzhao/GaLore.
☆19Jul 9, 2025Updated last year
microsoft / TransformerCompression
View on GitHub
For releasing code related to compression methods for transformers, accompanying our publications
☆462Jan 16, 2025Updated last year
purbeshmitra / MOTIF
View on GitHub
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
☆17Jul 6, 2025Updated last year
Cranial-XIX / longhorn
View on GitHub
Official PyTorch Implementation of the Longhorn Deep State Space Model
☆57Dec 4, 2024Updated last year
wizard1203 / FuseFL
View on GitHub
FuseFL: One-Shot Federated Learning through the Lens of Causality with Progressive Model Fusion (NeurIPS 2024 Spotlight)
☆15Mar 31, 2025Updated last year
criticalml-uw / TamperBench
View on GitHub
Toolkit to benchmark the tamper-resistance of LLMs.
☆28May 15, 2026Updated 2 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
deemeetree / infranodus
View on GitHub
A Node.Js / Neo4J tool that translates words and relations into network graphs and shows you how it all connects.
☆13Oct 24, 2019Updated 6 years ago
L3030 / FedCyBGD
View on GitHub
The implement of FedCyBGD
☆12Jul 19, 2024Updated 2 years ago
sahagobinda / SGP
View on GitHub
Official [AAAI] Code Repository for "Continual Learning with Scaled Gradient Projection".
☆17Jun 28, 2023Updated 3 years ago
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
epfml / quasi-global-momentum
View on GitHub
☆11Dec 23, 2022Updated 3 years ago
uiuctml / Localize-and-Stitch
View on GitHub
Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic
☆32Feb 18, 2026Updated 5 months ago
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
wenlai-lavine / jola
View on GitHub
Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning
☆28Jun 18, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ZJU-REAL / HBPO
View on GitHub
☆34Aug 11, 2025Updated 11 months ago
SplittyDev / flux1-cli
View on GitHub
Easy local FLUX.1 Inference
☆11Aug 29, 2024Updated last year
Antondfger / Does-It-Look-Sequential
View on GitHub
Code for ACM RecSys 2024 paper 'Does It Look Sequential? An Analysis of Datasets for Evaluation of Sequential Recommendations' and the ex…
☆28Mar 19, 2025Updated last year
THU-KEG / SafetyNeuron
View on GitHub
Data and code for the paper: Finding Safety Neurons in Large Language Models
☆30Jan 29, 2026Updated 6 months ago
themrzmaster / git-re-basin-pytorch
View on GitHub
Git Re-Basin: Merging Models modulo Permutation Symmetries in PyTorch
☆78Feb 9, 2023Updated 3 years ago
lixilinx / psgd_torch
View on GitHub
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆199May 30, 2026Updated last month
Cranial-XIX / metric-residual-network
View on GitHub
Official PyTorch Implementation for Metric Residual Networks for Sample Efficient Goal-Conditioned Reinforcement Learning
☆20Jan 11, 2023Updated 3 years ago