yyyyychen / LowMemoryBP
The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"
☆19Updated 3 months ago
Alternatives and similar repositories for LowMemoryBP:
Users that are interested in LowMemoryBP are comparing it to the libraries listed below
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 8 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆29Updated 5 months ago
- ☆52Updated last year
- i-mae Pytorch Repo☆21Updated 11 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- Triton implement of bi-directional (non-causal) linear attention☆43Updated last month
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆28Updated 2 months ago
- A torch-based implementation of K-Means and K-Means++☆17Updated 4 years ago
- ☆16Updated last year
- ☆15Updated last year
- Moved to https://github.com/NUS-HPC-AI-Lab/InfoBatch☆6Updated last year
- ☆15Updated 3 months ago
- Collect papers about Mamba (a selective state space model).☆14Updated 7 months ago
- ☆38Updated last year
- GIFT: Generative Interpretable Fine-Tuning☆20Updated 5 months ago
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆21Updated 7 months ago
- ☆22Updated 5 months ago
- The official repo of continuous speculative decoding☆24Updated 3 months ago
- Mixture of Attention Heads☆41Updated 2 years ago
- LMM solved catastrophic forgetting, AAAI2025☆39Updated 4 months ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated 10 months ago
- ☆19Updated last year
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 11 months ago
- ImaginaryNet: Learning Object Detectors without Real Images and Annotations☆26Updated 2 years ago
- [ICCV 2023] CLR: Channel-wise Lightweight Reprogramming for Continual Learning☆29Updated 9 months ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML'24)☆29Updated 6 months ago
- Adapting LLaMA Decoder to Vision Transformer☆26Updated 9 months ago
- [EMNLP 2024] Official code for "Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models"☆15Updated 4 months ago