YuanheZ / LoRA-OneLinks

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently (ICML2025 Oral)

☆24

Alternatives and similar repositories for LoRA-One

Users that are interested in LoRA-One are comparing it to the libraries listed below

Sorting:

pixas / NoRM
ICLR 2025
☆29Updated 6 months ago
ambisinister / lossfreebalance
toy reproduction of Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts
☆25Updated last year
mzf666 / LORO-main
Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'
☆13Updated 6 months ago
NUS-HPC-AI-Lab / DD-Ranking
Data distillation benchmark
☆71Updated 5 months ago
horseee / dKV-Cache
[NeurIPS'25] dKV-Cache: The Cache for Diffusion Language Models
☆119Updated 6 months ago
zju-vipa / training_free_model_merging
This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).
☆32Updated last year
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆160Updated 4 months ago
czg1225 / dParallel
dParallel: Learnable Parallel Decoding for dLLMs
☆42Updated last month
ThisisBillhe / ZipAR
[ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…
☆53Updated 7 months ago
xie-lab-ml / Meissonic-Inference
Bag of Design Choices for Inference of High-Resolution Masked Generative Transformer
☆16Updated last year
yu-rp / Dimple
Dimple, the first Discrete Diffusion Multimodal Large Language Model
☆109Updated 4 months ago
pixeli99 / Prophet
Official implementation of "Diffusion Language Models Know the Answer Before Decoding"
☆39Updated 2 months ago
yu-rp / NeuralLineage
Code for CVPR 2024 Oral "Neural Lineage"
☆17Updated last year
czg1225 / VeriThinker
[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient
☆61Updated last month
StargazerX0 / ScaleKV
[NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression
☆51Updated 2 weeks ago
ML-GSAI / Scaling-Diffusion-Transformers-muP
[NeurIPS 2025] Official implementation for our paper "Scaling Diffusion Transformers Efficiently via μP".
☆92Updated 3 weeks ago
aim-uofa / LoRAPrune
☆61Updated 11 months ago
VectorSpaceLab / EditScore
EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling
☆164Updated 2 weeks ago
zkx06111 / ReDiffusion
☆17Updated 2 years ago
mrflogs / LoRA-Pro
Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "
☆135Updated 7 months ago
byeongjun-park / Switch-DiT
[ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"
☆47Updated last year
lliai / Awesome-Efficient-Diffusion-Models
Paper survey of efficient computation for large scale models.
☆34Updated 11 months ago
NUS-HPC-AI-Lab / DATM
ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching
☆102Updated last year
horseee / learning-to-cache
[NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
☆116Updated last year
huanranchen / LLMLandscape
The loss landscape of Large Language Models resemble basin!
☆33Updated 4 months ago
Pepper-lll / LMforImageGeneration
Codebase for the paper-Elucidating the design space of language models for image generation
☆46Updated last year
LeapLabTHU / ImprovedNAT
A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"
☆46Updated last year
revelio-diffusion / revelio
☆24Updated 4 months ago
czg1225 / CoDe
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆107Updated last month
HelmholtzAI-FZJ / flex_gen
☆19Updated 10 months ago