yyyyychen / LowMemoryBP
The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"
☆19Updated 2 months ago
Alternatives and similar repositories for LowMemoryBP:
Users that are interested in LowMemoryBP are comparing it to the libraries listed below
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆34Updated 7 months ago
- [ECCV 2024] This is the official implementation of "Stitched ViTs are Flexible Vision Backbones".☆27Updated last year
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆29Updated 4 months ago
- Triton implement of bi-directional (non-causal) linear attention☆42Updated last week
- ☆12Updated 2 months ago
- ☆52Updated last year
- The official repo of continuous speculative decoding☆24Updated 2 months ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆35Updated 7 months ago
- GIFT: Generative Interpretable Fine-Tuning☆20Updated 4 months ago
- A torch-based implementation of K-Means and K-Means++☆17Updated 4 years ago
- ☆37Updated 3 months ago
- ☆17Updated last month
- Inference-only implementation of "One-Step Diffusion Distillation through Score Implicit Matching" [NIPS 2024]☆77Updated 2 months ago
- LMM which strictly superset LLM embedded☆37Updated 3 months ago
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆16Updated 2 months ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆26Updated last month
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆20Updated 6 months ago
- PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆37Updated 5 months ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆17Updated 10 months ago
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated 3 months ago
- Officail Repo of γ -MOD: Mixture-of-Depth Adaptation for Multimodal Large Language Models☆29Updated this week
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆34Updated 7 months ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆27Updated 11 months ago
- A curated list of papers and resources for text-to-image evaluation.☆27Updated last year
- [CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference☆27Updated 11 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated 10 months ago
- minisora-DiT, a DiT reproduction based on XTuner from the open source community MiniSora☆40Updated 10 months ago
- Moved to https://github.com/NUS-HPC-AI-Lab/InfoBatch☆6Updated last year
- [NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effect…☆35Updated 7 months ago
- ☆37Updated last year