ziplab / efficient-stable-diffusion
☆16Updated last year
Related projects: ⓘ
- Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆56Updated 2 months ago
- PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆27Updated 2 weeks ago
- ☆17Updated last year
- ☆20Updated 9 months ago
- ☆41Updated this week
- Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"☆91Updated 6 months ago
- Memory Efficient Training Framework for Large Video Generation Model☆20Updated 4 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆23Updated 3 months ago
- DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception☆103Updated 3 weeks ago
- ☆74Updated last week
- (ICLR 2024, CVPR 2024) SparseFormer☆62Updated 5 months ago
- TerDiT: Ternary Diffusion Models with Transformers☆57Updated 3 months ago
- ☆52Updated last year
- IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆18Updated last week
- Code for paper "Unsegment Anything by Simulating Deformation" (CVPR 2024)☆21Updated 3 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆75Updated 2 months ago
- OpenMMLab Detection Toolbox and Benchmark for V3Det☆15Updated 5 months ago
- Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆38Updated last month
- VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation☆84Updated last week
- This is a repo to track the latest autoregressive visual generation papers.☆20Updated last week
- ☆38Updated 9 months ago
- fixed official code for paper "A Closer Look at Parameter-Efficient Tuning in Diffusion Models".☆39Updated last year
- ☆20Updated last year
- ☆19Updated last year
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆76Updated 5 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 4 months ago
- A curated list of papers and resources for text-to-image evaluation.☆26Updated last year
- Paper survey of efficient computation for large scale models.☆24Updated 4 months ago
- Adapting LLaMA Decoder to Vision Transformer☆25Updated 3 months ago
- Video Diffusion State Space Models☆19Updated 5 months ago