minnie-lin / Awesome-Physics-Cognition-based-Video-Generation
☆26Updated this week
Alternatives and similar repositories for Awesome-Physics-Cognition-based-Video-Generation:
Users that are interested in Awesome-Physics-Cognition-based-Video-Generation are comparing it to the libraries listed below
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆88Updated 2 weeks ago
- Official code for MotionBench (CVPR 2025)☆31Updated 3 weeks ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆30Updated 10 months ago
- Code release for NeurIPS 2023 paper SlotDiffusion: Object-centric Learning with Diffusion Models☆84Updated last year
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆98Updated 4 months ago
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆39Updated 11 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆69Updated last month
- ☆122Updated 2 months ago
- ☆37Updated last week
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"☆50Updated last month
- A collection of vision foundation models unifying understanding and generation.☆47Updated 2 months ago
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆62Updated last week
- Code for paper "Grounding Video Models to Actions through Goal Conditioned Exploration".☆44Updated 3 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (arXiv 2025)☆24Updated last week
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆95Updated 5 months ago
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆71Updated 3 weeks ago
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆60Updated last month
- [CVPR 2025] Open implementation of "RandAR"☆69Updated last week
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆37Updated 3 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated 10 months ago
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆43Updated 8 months ago
- ☆17Updated 5 months ago
- Aether: Geometric-Aware Unified World Modeling☆198Updated this week
- ☆47Updated last month
- ☆32Updated last week
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆77Updated 7 months ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated last month
- ☆55Updated last month
- Official repository for "iVideoGPT: Interactive VideoGPTs are Scalable World Models" (NeurIPS 2024), https://arxiv.org/abs/2405.15223☆121Updated 3 weeks ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 6 months ago