google-deepmind / physics-IQ-benchmark
Benchmarking physical understanding in generative video models
☆38Updated this week
Alternatives and similar repositories for physics-IQ-benchmark:
Users that are interested in physics-IQ-benchmark are comparing it to the libraries listed below
- ☆101Updated 2 weeks ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆70Updated 3 months ago
- [ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.☆72Updated 5 months ago
- ArXiv paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆60Updated 3 months ago
- Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.☆87Updated 7 months ago
- Official code for paper: Text-to-Image Rectified Flow as Plug-and-Play Priors☆94Updated 2 months ago
- This is a PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Framework for Cross-Modality Evolu…☆125Updated 2 weeks ago
- The official PyTorch implementation of Consistent3D (CVPR 2024)☆77Updated 10 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆46Updated 6 months ago
- [ECCV2024] Vista3D: Unravel the 3D Darkside of a Single Image☆55Updated 4 months ago
- Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]☆72Updated 8 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆62Updated 3 months ago
- Code for ICLR 2024 paper "Motion Guidance: Diffusion-Based Image Editing with Differentiable Motion Estimators"☆95Updated 11 months ago
- Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.☆44Updated last year
- Semantic Score Distillation Sampling for Compositional Text-to-3D Generation☆38Updated 3 months ago
- [CVPR 2024] SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds☆32Updated 7 months ago
- ☆44Updated 3 weeks ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆90Updated 2 months ago
- Trans4D: Realistic Geometry-Aware Transition for Compositional Text-to-4D Synthesis☆52Updated 3 months ago
- Learning Naturally Aggregated Appearance for Efficient 3D Editing☆34Updated last year
- [CVPR 2024] "Taming Mode Collapse in Score Distillation for Text-to-3D Generation" by Peihao Wang, Dejia Xu, Zhiwen Fan, Dilin Wang, Srey…☆48Updated 11 months ago
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation"☆65Updated 4 months ago
- Code for FreeTraj, a tuning-free method for trajectory-controllable video generation☆95Updated 5 months ago
- ObjCtrl-2.5D☆41Updated last month
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆34Updated last month
- Generative World Explorer☆123Updated last month
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆20Updated 3 months ago
- [CVPR 2024 Highlight] ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models☆155Updated 5 months ago