OpenGVLab / PhyGenBench
The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
☆71Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for PhyGenBench
- Empowering Unified MLLM with Multi-granular Visual Generation☆106Updated last month
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆31Updated 6 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆76Updated last month
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆47Updated 2 months ago
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆50Updated this week
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆42Updated 4 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆47Updated this week
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆55Updated last month
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆84Updated 4 months ago
- Implements VAR+CLIP for image generation☆78Updated 3 months ago
- The collection of awesome papers on alignment of diffusion models.☆47Updated 2 weeks ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆77Updated 7 months ago
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆55Updated this week
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆93Updated last month
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆91Updated 2 weeks ago
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆50Updated 5 months ago
- This is the official implementation for ControlVAR.☆52Updated last month
- Denoising Diffusion Step-aware Models (ICLR2024)☆52Updated 9 months ago
- ☆30Updated 3 weeks ago
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆161Updated last month
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆75Updated 7 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆36Updated last month
- ☆193Updated 4 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆23Updated 2 weeks ago
- RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with t…☆105Updated 4 months ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆33Updated 3 months ago
- 🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)☆155Updated 7 months ago
- [ICCV 2023] Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models☆78Updated last year
- ☆73Updated 7 months ago