garyfanhku/Galore-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/garyfanhku/Galore-pytorch)

garyfanhku / Galore-pytorch

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

☆22

Alternatives and similar repositories for Galore-pytorch

Users that are interested in Galore-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vlievin / gan-experiments-pytorch
View on GitHub
Experiments with GAN, WGAN, WGAN-GP, DC-GAN, cGAN, AC,GAN and pix2pix
☆10May 28, 2019Updated 7 years ago
facebookresearch / language-model-plasticity
View on GitHub
Official code for the paper Improving Language Plasticity via Pretraining with Active Forgetting, NeurIPS 2023
☆21Mar 12, 2026Updated 4 months ago
zwhe99 / RaSA
View on GitHub
[ICLR 2025] RaSA: Rank-Sharing Low-Rank Adaptation
☆10May 19, 2025Updated last year
ChasonShi / MELoRA
View on GitHub
code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"
☆34Feb 19, 2025Updated last year
linhaowei1 / CLoG
View on GitHub
✌ CLoG: Benchmarking Continual Learning of Image Generation Models
☆20Jun 10, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
KAI-YUE / rog
View on GitHub
☆16Aug 29, 2023Updated 2 years ago
OsmanMalik / tr-als-sampled
View on GitHub
Code for our ICML 2021 paper titled "A Sampling-Based Method for Tensor Ring Decomposition"
☆11Mar 19, 2024Updated 2 years ago
Qznan / SpanKL
View on GitHub
Code for paper: A Neural Span-Based Continual Named Entity Recognition Model
☆18Dec 11, 2023Updated 2 years ago
georgetown-cset / ai-relevant-papers
View on GitHub
Replication materials for "Identifying the Development and Application of Artificial Intelligence in Scientific Text"
☆14Feb 18, 2020Updated 6 years ago
cityuhkai / SBoRA
View on GitHub
☆11Sep 9, 2024Updated last year
Kowsher / Propulsion
View on GitHub
☆19Nov 30, 2024Updated last year
wangjs9 / Aligned-dPM
View on GitHub
PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach
☆32Nov 6, 2023Updated 2 years ago
starrYYxuan / LeCo
View on GitHub
This the implementation of LeCo
☆33Jan 20, 2025Updated last year
jiaweizzhao / GaLore
View on GitHub
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,699Oct 28, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
With-Coding-Cat / LG_plant_disease_diagnosis_competition
View on GitHub
☆13Feb 9, 2022Updated 4 years ago
didizhu-judy / Model-Tailor
View on GitHub
[ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models
☆34Sep 1, 2024Updated last year
Gyeongmin47 / KoCHET-A-Korean-Cultural-Heritage-corpus-for-Entity-related-Tasks
View on GitHub
☆13Nov 30, 2022Updated 3 years ago
WangRongsheng / Statistical-learning-method-lihang
View on GitHub
《统计学习方法》，作者李航，本书全面系统地介绍了统计学习的主要内容
☆38Oct 3, 2019Updated 6 years ago
gilfernandes / complex_chain_playground
View on GitHub
Playground project acting as an example for a complex LangChain workflow
☆11Jun 20, 2023Updated 3 years ago
zeroxleo / HyperGT
View on GitHub
The implementation of ICASSP 2024 lecture presentation paper "Hypergraph Transformer for Semi-Supervised Classification"
☆21Nov 23, 2024Updated last year
kh-kim / deeplearning_with_pytorch
View on GitHub
☆12Mar 8, 2020Updated 6 years ago
mkmenta / rag-chatgpt
View on GitHub
This is a simple lab I have implemented to test Knowledge Augmented or Retrieval Augmented Generation (RAG) with Large Language Models. I…
☆10Dec 10, 2023Updated 2 years ago
Meteor-han / ReaMVP
View on GitHub
☆16Aug 5, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ahmdtaha / distributed_sigmoid_loss
View on GitHub
Unofficial implementation for Sigmoid Loss for Language Image Pre-Training
☆11Sep 26, 2023Updated 2 years ago
Andrewzh112 / Awesome-LLM-based-MultiAgents
View on GitHub
☆28Oct 9, 2024Updated last year
wzhuang-xmu / LoSA
View on GitHub
[ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".
☆25Mar 16, 2025Updated last year
wwh0411 / MCP-Flow
View on GitHub
[ACL 2026 Main] MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools.
☆24Apr 8, 2026Updated 3 months ago
Yiwei98 / ESC
View on GitHub
☆14Jul 17, 2025Updated last year
pixas / NoRM
View on GitHub
ICLR 2025
☆30May 21, 2025Updated last year
kylehkhsu / tripod
View on GitHub
☆12Apr 19, 2024Updated 2 years ago
GarrettJenkinson / condor_pytorch
View on GitHub
CONditionals for Ordinal Regression and classification in PyTorch
☆12Nov 5, 2022Updated 3 years ago
chang-github-00 / Predictive-Decoding
View on GitHub
Repo for Anonymous purpose, pls don't distribute
☆10Oct 2, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chaochun / nlu-asdiv-dataset
View on GitHub
☆52Jul 4, 2023Updated 3 years ago
TsinghuaC3I / SoRA
View on GitHub
[EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models
☆87Mar 5, 2024Updated 2 years ago
Shwai-He / SparseAdapter
View on GitHub
Source code of EMNLP 2022 Findings paper "SparseAdapter: An Easy Approach for Improving the Parameter-Efficiency of Adapters"
☆23Feb 28, 2026Updated 4 months ago
Jussmith01 / ANI-Tools
View on GitHub
☆11Aug 29, 2022Updated 3 years ago
intervention-training / int
View on GitHub
☆16Feb 4, 2026Updated 5 months ago
buschman-lab / RotationalDynamics
View on GitHub
Code for creating recurrent neural network with rotational dynamics. Model is discussed in detail in "Rotational Dynamics Reduce Interfer…
☆17Jul 23, 2020Updated 5 years ago
Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago