XuweiyiChen / Pix2Gif
☆10Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for Pix2Gif
- 扩散模型算法基础文档、训练、实验、部署等仓库☆32Updated 5 months ago
- ACM MM'23 (oral), SUR-adapter for pre-trained diffusion models can acquire the powerful semantic understanding and reasoning capabilities…☆117Updated 7 months ago
- GlyphDraw2: Automatic Generation of Complex Glyph Posters with Diffusion Models and Large Language Models☆46Updated 4 months ago
- [ECCV 2024] Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models☆60Updated 3 weeks ago
- MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)☆127Updated 5 months ago
- A list for Text-to-Video, Image-to-Video works☆187Updated last month
- ☆176Updated 4 months ago
- Official code of SmartEdit [CVPR-2024 Highlight]☆258Updated 5 months ago
- The official implementation of "Relay Diffusion: Unifying diffusion process across resolutions for image synthesis" [ICLR 2024 Spotlight]☆272Updated 6 months ago
- 📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.☆217Updated 2 weeks ago
- 🔥🔥First-ever hour scale video understanding models☆169Updated 3 weeks ago
- An initiative to replicate Sora☆99Updated 7 months ago
- https://www.shoufachen.com/Awesome-Diffusion-Transformers/☆120Updated 8 months ago
- ☆35Updated 5 months ago
- A Training-free Iterative Framework for Long Story Visualization☆62Updated this week
- Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions (NeurIPS 2024)☆145Updated 3 months ago
- ☆93Updated 4 months ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".☆124Updated 10 months ago
- Scaling Diffusion Transformers with Mixture of Experts☆207Updated 2 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆107Updated last week
- [NeurIPS 2024] VideoTetris: Towards Compositional Text-To-Video Generation☆206Updated 2 weeks ago
- [NeurIPS 2023] Customize spatial layouts for conditional image synthesis models, e.g., ControlNet, using GPT☆132Updated 6 months ago
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆81Updated last month
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆98Updated 2 weeks ago
- ☆77Updated 6 months ago
- The official implementation of Latte: Latent Diffusion Transformer for Video Generation.☆32Updated 9 months ago
- ClassDiffusion: Official impl. of Paper "ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance"☆33Updated 4 months ago
- [CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"☆233Updated 7 months ago
- [CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models☆207Updated last month