pandayuanyu / NewtonGenLinks
[ICLR 2026] NewtonGen: Physics-Consistent and Controllable Text-to-Video Generation via Neural Newtonian Dynamics
☆121Updated 2 weeks ago
Alternatives and similar repositories for NewtonGen
Users that are interested in NewtonGen are comparing it to the libraries listed below
Sorting:
- Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation☆272Updated last month
- [ICLR 2026] Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation☆379Updated 2 weeks ago
- Are Video Models Ready as Zero-shot Reasoners?☆84Updated 2 months ago
- [NeurIPS 2025 Spotlight] Towards Understanding Camera Motions in Any Video☆271Updated 2 months ago
- Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"☆137Updated 3 months ago
- This is the repository that contains source code for the PhysGen3D.☆240Updated 4 months ago
- [NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation☆192Updated last month
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆262Updated 3 weeks ago
- Official repo for paper "EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning"☆127Updated 4 months ago
- ☆125Updated 11 months ago
- [Tutorial] Few-Step Distillation for Text-to-Image Generation: A Practical Guide☆340Updated last month
- Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning☆178Updated 3 months ago
- Official implementation for "Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model" (NeurIPS 2024)☆258Updated 9 months ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆414Updated 6 months ago
- This project is the official implementation of "UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Gener…☆206Updated 2 weeks ago
- Implementation of paper: Flux Already Knows – Activating Subject-Driven Image Generation without Training☆141Updated 5 months ago
- Video-Inpaint-Anything: This is the inference code for our paper CoCoCo: Improving Text-Guided Video Inpainting for Better Consistency, C…☆320Updated last year
- AnyTalker: Scaling Multi-person Talking Video Generation with Interactivity Refinement☆278Updated 2 months ago
- [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding☆539Updated 3 months ago
- Visualization of DiT self attention features☆235Updated last year
- [ECCV2024] DreamScene: 3D Gaussian-based Text-to-3D Scene Generation via Formation Pattern Sampling☆228Updated 2 months ago
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆180Updated last week
- LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation☆38Updated 11 months ago
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easy☆819Updated last month
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆137Updated last year
- Pytorch Implementation of FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing (ICLR 2024)☆212Updated last year
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆90Updated 2 months ago
- [NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation☆210Updated 8 months ago
- [AAAI26] Next Patch Prediction☆132Updated last year
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆259Updated 4 months ago