The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"
☆21Dec 10, 2024Updated last year
Alternatives and similar repositories for LowMemoryBP
Users that are interested in LowMemoryBP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Restore Globally, Refine Locally: A Mask-Guided Scheme to Accelerate Super-Resolution Networks☆35Mar 25, 2024Updated 2 years ago
- A CPU 3D Reconstruction pipeline using COLMAP and OpenMVS☆20Nov 5, 2025Updated 7 months ago
- Repository for journal "Probabilistic-based Feature Learning of Light Fields for Compressive Imaging and Denoising"☆13Jan 16, 2024Updated 2 years ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning (INTERSPEECH 2022)☆19Nov 15, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official code for "Vision Transformers with Self-Distilled Registers" (NeurIPS 2025 Spotlight)☆35Dec 6, 2025Updated 6 months ago
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆40Sep 24, 2024Updated last year
- "Why do I feel offended?" - Korean Dataset for Offensive Language Identification (EACL2023 Findings)☆15May 14, 2023Updated 3 years ago
- ☆26Mar 31, 2026Updated 2 months ago
- ☆19Oct 14, 2024Updated last year
- ☆36Nov 14, 2025Updated 6 months ago
- My solution code to parallel architecture and programming Spring 2016☆12Aug 15, 2016Updated 9 years ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Jun 1, 2026Updated last week
- Separating Anything from Image in Context☆12May 29, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of the paper - Fast Training of Convolutional Networks through FFTs (CUDA for parallelization)☆10May 8, 2020Updated 6 years ago
- A space dedicated for our universe.☆17Feb 10, 2024Updated 2 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 5 months ago
- The official repo of continuous speculative decoding☆34Mar 28, 2025Updated last year
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆104May 8, 2026Updated last month
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Oct 30, 2025Updated 7 months ago
- ☆69Mar 30, 2025Updated last year
- First Latency-Aware Competitive LLM Agent Benchmark☆29Jun 3, 2025Updated last year
- sast2022-pytorch-training☆11Jul 21, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for CVPR 2023 paper "SViTT: Temporal Learning of Sparse Video-Text Transformers"☆21Jun 16, 2023Updated 2 years ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- [CVPR 2024] MFP: Making Full Use of Probability Maps for Interactive Image Segmentation☆17Jul 8, 2024Updated last year
- ☆12Aug 18, 2023Updated 2 years ago
- Video stabilization using IMU motion data from internal or external logs☆21Feb 4, 2022Updated 4 years ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆86Nov 16, 2025Updated 6 months ago
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 5 months ago
- CFG-GAN: Composite functional gradient learning of generative adversarial models☆15Jul 9, 2020Updated 5 years ago
- Implementation of CamTrol: Training-free Camera Control for Video Generation☆34Oct 2, 2025Updated 8 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- ☆149Nov 17, 2025Updated 6 months ago
- Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models☆49Nov 5, 2024Updated last year
- 何语言(元宇宙版),次世代赛博元宇宙元编程语言,C++模板元编程实现☆15Nov 2, 2023Updated 2 years ago
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆71Dec 17, 2025Updated 5 months ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year