OpenGVLab / PhyGenBench
The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation
☆68Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for PhyGenBench
- Empowering Unified MLLM with Multi-granular Visual Generation☆104Updated 3 weeks ago
- The paper collections for the autoregressive models in vision.☆95Updated last week
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]☆48Updated last week
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆40Updated 4 months ago
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆76Updated last month
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆30Updated 6 months ago
- 🔥ImageFolder: Autoregressive Image Generation with Folded Tokens☆53Updated 3 weeks ago
- T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation☆46Updated 2 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆42Updated last month
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆54Updated last month
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training☆160Updated 3 weeks ago
- ☆30Updated 2 weeks ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆88Updated last month
- The collection of awesome papers on alignment of diffusion models.☆45Updated last week
- This is the official implementation for ControlVAR.☆52Updated last month
- Adaptive Caching for Faster Video Generation with Diffusion Transformers☆79Updated last week
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆77Updated 7 months ago
- Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for…☆50Updated this week
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 3 months ago
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆35Updated 3 weeks ago
- [ICML 2024] On Discrete Prompt Optimization for Diffusion Models - Google☆31Updated 2 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.☆72Updated last year
- Implements VAR+CLIP for image generation☆78Updated 3 months ago
- ☆189Updated 3 months ago
- Official PyTorch Implementation of "Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models"☆30Updated last month
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆50Updated 5 months ago
- [NeurIPS 2024] RealCompo: Balancing Realism and Compositionality Improves Text-to-Image Diffusion Models☆106Updated last month
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆78Updated 9 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆22Updated last week