haoningwu3639 / SimpleSDM-Video
A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.
☆16Updated 11 months ago
Alternatives and similar repositories for SimpleSDM-Video:
Users that are interested in SimpleSDM-Video are comparing it to the libraries listed below
- A simple and flexible PyTorch implementation of StableDiffusion based on diffusers.☆22Updated 3 months ago
- A simple and flexible PyTorch implementation of StableDiffusion-3 based on diffusers for DIY and finetuning.☆16Updated this week
- ☆16Updated last month
- ☆19Updated 4 months ago
- A Simple Plugin for Transforming Images to Arbitrary Scales☆18Updated last year
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆60Updated 5 months ago
- CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient☆75Updated last month
- VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection☆45Updated this week
- A simple and flexible PyTorch implementation of StableDiffusion-XL based on diffusers.☆14Updated 4 months ago
- Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""☆11Updated 6 months ago
- XQ-GAN🚀: An Open-source Image Tokenization Framework for Autoregressive Generation☆179Updated last month
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆95Updated 3 weeks ago
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆31Updated 10 months ago
- Official PyTorch code of "Grounded Question-Answering in Long Egocentric Videos", accepted by CVPR 2024.☆56Updated 4 months ago
- ☆37Updated 3 months ago
- [NeurIPS 2024] Efficient Multi-modal Models via Stage-wise Visual Context Compression☆50Updated 5 months ago
- ☆117Updated 6 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆39Updated 2 weeks ago
- ☆14Updated this week
- ☆57Updated last year
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆73Updated 6 months ago
- ☆16Updated last year
- Official Implementation of VideoDPO☆37Updated last week
- This is the official implementation for ControlVAR.☆91Updated last month
- Official implementation of TagAlign☆34Updated last month
- Diffusion Powers Video Tokenizer for Comprehension and Generation☆40Updated last month
- ☆44Updated last year
- ☆19Updated 11 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆45Updated 3 months ago