GigaAI-research / Motion-R1Links
☆47Updated 5 months ago
Alternatives and similar repositories for Motion-R1
Users that are interested in Motion-R1 are comparing it to the libraries listed below
Sorting:
- Code for "Motion-2-to-3: Leveraging 2D Motion Data to Boost 3D Motion Generation", Arxiv 2024☆96Updated 3 weeks ago
- [COLING 2025] Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs☆51Updated 10 months ago
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆37Updated 4 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆78Updated 6 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆56Updated 3 months ago
- Seeing World Dynamics in a Nutshell☆110Updated 8 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆104Updated 8 months ago
- [ICLR 2025] Dataset and Code for Paper "Learning to Generate Diverse Pedestrian Movements from Web Videos with Noisy Labels"☆43Updated 4 months ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆83Updated 4 months ago
- SceneCompleter: Dense 3D Scene Completion for Generative Novel View Synthesis☆35Updated 5 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆97Updated 7 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆42Updated 3 months ago
- ☆46Updated 7 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆195Updated last month
- A list of works on video generation towards world model☆210Updated last week
- [ICLR 2025] Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation☆40Updated 8 months ago
- Code and data for UniEgoMotion (ICCV 2025)☆34Updated last week
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 7 months ago
- [ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis☆98Updated 3 weeks ago
- [ICCV 2025] The official implementation for EgoM2P: Egocentric Multimodal Multitask Pretraining.☆32Updated last week
- Official implement for LaserHuman.☆35Updated 7 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated last year
- [Arxiv'25] MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization☆50Updated 2 months ago
- The official implementation of work "REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment".☆121Updated last year
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆88Updated last month
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆122Updated 3 months ago
- ☆78Updated 7 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆132Updated 7 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆158Updated last month
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆54Updated 9 months ago