feizc / Vespa
Video Diffusion State Space Models
☆19Updated 10 months ago
Alternatives and similar repositories for Vespa:
Users that are interested in Vespa are comparing it to the libraries listed below
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆46Updated 4 months ago
- Video Diffusion Transformers are In-Context Learners☆17Updated last month
- Official PyTorch Implementation for Shape-Guided Diffusion with Inside-Outside Attention, WACV 2024☆37Updated last year
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆44Updated 2 months ago
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆35Updated 2 months ago
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Updated last year
- ☆20Updated 7 months ago
- ☆40Updated last year
- MCPL: MULTI-CONCEPT PROMPT LEARNING☆20Updated 8 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆28Updated 3 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆26Updated 9 months ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆21Updated 4 months ago
- Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing☆22Updated 2 months ago
- ORES: Open-vocabulary Responsible Visual Synthesis☆13Updated last year
- ☆16Updated last year
- Compositional Inversion for Stable Diffusion Models (AAAI 2024)☆35Updated 10 months ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆45Updated 2 months ago
- [CVPR 2024] Official PyTorch implementation of "ECLIPSE: Revisiting the Text-to-Image Prior for Efficient Image Generation"☆62Updated 9 months ago
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Updated last year
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆29Updated 2 months ago
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆32Updated 3 weeks ago
- ☆25Updated 5 months ago
- The official code of "Concept-centric Personalization with Large-scale Diffusion Priors".☆17Updated last year
- Vico: Compositional Video Generation as Flow Equalization☆57Updated 3 months ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆22Updated 9 months ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion☆35Updated 6 months ago
- ☆24Updated 9 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 7 months ago