ali-vilab / matrix
☆17Updated last week
Alternatives and similar repositories for matrix:
Users that are interested in matrix are comparing it to the libraries listed below
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆52Updated 7 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆62Updated last week
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 4 months ago
- Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.☆70Updated this week
- ☆40Updated 8 months ago
- The collection of awesome papers on alignment of diffusion models.☆149Updated 3 weeks ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆88Updated 2 weeks ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆85Updated last month
- Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]☆16Updated 6 months ago
- LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models☆52Updated 7 months ago
- The official code of "Weak-to-Strong Diffusion with Reflection".☆38Updated last month
- [NeurIPS 2024] VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models☆137Updated 6 months ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 8 months ago
- Streaming Video Diffusion: Online Video Editing with Diffusion Models☆17Updated 9 months ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆72Updated this week
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆41Updated last month
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆115Updated 4 months ago
- [CVPR2025] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project/☆129Updated last week
- The code of our work "Golden Noise for Diffusion Models: A Learning Framework".☆145Updated last month
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆86Updated 2 months ago
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆47Updated 5 months ago
- [ICLR 2025] Official code implementation of DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation☆95Updated last month
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆56Updated 9 months ago
- Code release for our NeurIPS 2024 Spotlight paper "GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing"☆113Updated 5 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆47Updated 6 months ago
- [CVPR 2024] Dynamic Prompt Optimizing for Text-to-Image Generation☆68Updated 8 months ago
- Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".☆138Updated last year
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillation☆61Updated 5 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆100Updated last year
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆63Updated last month