haoningwu3639 / SimpleSDM-Video
A simple and flexible PyTorch implementation of Video StableDiffusion (ZeroScope_v2) based on diffusers.
☆15Updated 7 months ago
Related projects: ⓘ
- A simple and flexible PyTorch implementation of StableDiffusion based on diffusers.☆22Updated 9 months ago
- ☆16Updated this week
- Code Release of F-LMM: Grounding Frozen Large Multimodal Models☆35Updated last month
- [ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization☆53Updated 10 months ago
- A simple and flexible PyTorch implementation of StableDiffusion-XL based on diffusers.☆12Updated 2 weeks ago
- Visual self-questioning for large vision-language assistant.☆22Updated 3 weeks ago
- Official code for CVPR 2024 paper: Discriminative Probing and Tuning for Text-to-Image Generation☆23Updated 2 weeks ago
- [CVPR '23] Unite and Conquer: Plug & Play Multi-Modal Synthesis using Diffusion Models☆32Updated 5 months ago
- The collection of awesome papers on alignment of diffusion model.☆21Updated last week
- ☆19Updated last year
- ☆104Updated 3 months ago
- ☆35Updated 3 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆68Updated 10 months ago
- A Simple Plugin for Transforming Images to Arbitrary Scales☆18Updated last year
- ☆49Updated 11 months ago
- ☆26Updated last week
- ☆55Updated 11 months ago
- Video Diffusion State Space Models☆19Updated 5 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆75Updated 2 months ago
- [ECCV2024] ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference☆37Updated last month
- A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting [ECCV 2024]☆50Updated 7 months ago
- DiverGen (CVPR 2024) & BSGAL (ICML 2024)☆33Updated 3 weeks ago
- Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆38Updated last month
- The benchmark for "Video Object Segmentation in Panoptic Wild Scenes".☆11Updated 11 months ago
- [ICML2024]The official implementation of SemiRES in PyTorch.☆18Updated 3 months ago
- ☆52Updated last year
- [CVPR-2023] The official dataset of Advancing Visual Grounding with Scene Knowledge: Benchmark and Method.☆28Updated last year
- PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"☆23Updated 6 months ago
- A PyTorch implementation of the paper "Revisiting Non-Autoregressive Transformers for Efficient Image Synthesis"☆26Updated 3 months ago
- ☆16Updated last year