[ICML2025 Oral] LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently
☆33Oct 22, 2025Updated 8 months ago
Alternatives and similar repositories for LoRA-One
Users that are interested in LoRA-One are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆150Apr 8, 2025Updated last year
- ☆11Dec 8, 2022Updated 3 years ago
- [TMLR 2024] Revisiting Random Weight Perturbation for Efficiently Improving Generalization☆12Oct 18, 2024Updated last year
- Official implementation of ICLR 2025 'LORO: Parameter and Memory Efficient Pretraining via Low-rank Riemannian Optimization'☆18Apr 24, 2025Updated last year
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆22Jan 23, 2024Updated 2 years ago
- [arXiv 2024] FairVision: Equitable Deep Learning for Eye Disease Screening via Fair Identity Scaling☆18Apr 15, 2026Updated 2 months ago
- Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations☆22Mar 25, 2023Updated 3 years ago
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆19Oct 18, 2025Updated 8 months ago
- Workable training script for ControlNet tile☆35May 2, 2024Updated 2 years ago
- (MICCAI-2025) MedDiff-FT: Data-Efficient Diffusion Model Fine-tuning with Structural Guidance for Controllable Medical Image Synthesis☆19Jul 11, 2025Updated 11 months ago
- HINT: High-quality INpainting Transformer with Enhanced Attention and Mask-aware Encoding☆58Jan 14, 2025Updated last year
- ☆17Dec 11, 2022Updated 3 years ago
- [SIGGRAPH ASIA 2025] This is the official implementation of the SIGGRAPH ASIA 2025 : Hierarchical Neural Semantic Representation for 3D S…☆19Dec 21, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆13Dec 29, 2024Updated last year
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- The first large scale formally verified reasoning dataset for Verilog☆21May 16, 2025Updated last year
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 5 months ago
- ☆11Feb 26, 2024Updated 2 years ago
- ☆19Aug 23, 2025Updated 10 months ago
- ☆13Oct 14, 2024Updated last year
- LGEB: Benchmark of Language Generation Evaluation☆16Oct 21, 2022Updated 3 years ago
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆36Oct 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- KV cache compression via sparse coding☆18Oct 26, 2025Updated 8 months ago
- [ICLR ML4RS 2025] Official implementation for the paper "Tackling Few-Shot Segmentation in Remote Sensing via Inpainting Diffusion Model"☆14Feb 2, 2026Updated 5 months ago
- [ICML'25] The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products☆19Jul 16, 2025Updated 11 months ago
- A flexible training-free diffusion-based method for generating tileable image sets, including self-tiling images, stochastic self-tiling …☆22May 26, 2025Updated last year
- [COLING 2025 Industry] LoRA Soups☆20Nov 29, 2024Updated last year
- [CVPR 2022 oral] Subspace Adversarial Training☆28Apr 27, 2023Updated 3 years ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16May 31, 2024Updated 2 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆35Mar 26, 2026Updated 3 months ago
- ☆13Apr 19, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects☆12Mar 5, 2026Updated 3 months ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- [ICLR 2023] Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions☆28Feb 11, 2025Updated last year
- ☆12Jan 16, 2025Updated last year
- ☆17Feb 21, 2025Updated last year
- CVPR2025-Multi-party Collaborative Attention Control for Image Customization☆17May 14, 2025Updated last year
- Interaction Compass: Multi-Label Zero-Shot Learning of Human-Object Interactions via Spatial Relations @ ICCV21☆13Jul 15, 2022Updated 3 years ago