The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"
☆21Dec 10, 2024Updated last year
Alternatives and similar repositories for LowMemoryBP
Users that are interested in LowMemoryBP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)☆19Nov 15, 2023Updated 2 years ago
- python file for lilab☆16Sep 11, 2025Updated 9 months ago
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆70Mar 27, 2025Updated last year
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆40Sep 24, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 3 years ago
- ☆17Jul 23, 2024Updated last year
- ☆19Oct 14, 2024Updated last year
- [ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…☆65Apr 15, 2024Updated 2 years ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆44Jun 28, 2024Updated 2 years ago
- Separating Anything from Image in Context☆12May 29, 2024Updated 2 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆42Dec 22, 2025Updated 6 months ago
- The official repo of continuous speculative decoding☆35Mar 28, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 8 months ago
- First Latency-Aware Competitive LLM Agent Benchmark☆29Jun 3, 2025Updated last year
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆48Apr 21, 2026Updated 2 months ago
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆21Jun 16, 2023Updated 3 years ago
- [ICML 2024] Code for the paper "Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases"☆38Jul 12, 2024Updated last year
- Code to reproduce experiments in Markovian Flow Matching: Accelerating MCMC with Continuous Normalizing Flows☆14May 23, 2024Updated 2 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆15Jul 31, 2025Updated 11 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆88Nov 16, 2025Updated 7 months ago
- Implementation of CamTrol: Training-free Camera Control for Video Generation☆34Oct 2, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Feb 12, 2024Updated 2 years ago
- ☆153Nov 17, 2025Updated 7 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models☆49Nov 5, 2024Updated last year
- 何语言(元宇宙版),次世代赛博元宇宙元编程语言,C++模板元编程实现☆15Nov 2, 2023Updated 2 years ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- [ICLR 2024 Oral] Improving Convergence and Generalization Using Parameter Symmetries☆31May 29, 2024Updated 2 years ago
- Experimental deep learning framework written in Rust☆15Nov 2, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆71Jul 11, 2024Updated last year
- Gaussian Splating 2d implemented in triton☆12Mar 19, 2024Updated 2 years ago
- [TNNLS 2022] Official pytorch implementation of "Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions"☆11Apr 19, 2022Updated 4 years ago
- ☆14Sep 22, 2025Updated 9 months ago
- ☆13Jun 23, 2022Updated 4 years ago
- ☆10Apr 6, 2026Updated 2 months ago
- 清华大学电子系科协学培部Sast Tutor共享仓库☆16Apr 27, 2022Updated 4 years ago