☆119Jan 18, 2026Updated last month
Alternatives and similar repositories for FoundationMotion
Users that are interested in FoundationMotion are comparing it to the libraries listed below
Sorting:
- Project Page for GaussianFormer☆24May 30, 2024Updated last year
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated 10 months ago
- ☆13Mar 28, 2025Updated 11 months ago
- [Arxiv'25] SaLon3R: Structure-aware Long-term Generalizable 3D Reconstruction from Unposed Images☆46Oct 18, 2025Updated 4 months ago
- The official implementation of "Low-power, Continuous Remote Behavioral Localization with Event Cameras" (CVPR 2024)☆12Sep 25, 2024Updated last year
- [AAAI 2026] Official implementation of the paper ”SegDINO3D: 3D Instance Segmentation Empowered by Both Image-Level and Object-Level 2D F…☆33Jan 8, 2026Updated last month
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆72Jan 10, 2025Updated last year
- ☆66Feb 23, 2026Updated last week
- [CVPR 2024] This is official implementation of our CVPR 2024 paper "Building a Strong Pre-Training Baseline for Universal 3D Large-Scale …☆17Jun 11, 2024Updated last year
- GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving☆29Mar 21, 2025Updated 11 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆95Mar 1, 2025Updated last year
- OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer☆282Jan 14, 2026Updated last month
- ☆46Dec 31, 2025Updated 2 months ago
- Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving☆32Nov 20, 2025Updated 3 months ago
- [ACM MM 2025] EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler☆26Aug 7, 2025Updated 6 months ago
- [ICLR 2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework☆57Nov 20, 2025Updated 3 months ago
- ☆141Oct 15, 2025Updated 4 months ago
- [ICCV2025] II-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene Forecasting☆174Oct 21, 2025Updated 4 months ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆36Dec 2, 2025Updated 3 months ago
- [AAAI 26 Oral] Official implementation of "FreeGaussian: Annotation-free Control of Articulated Objects via 3D Gaussian Splats with Flow …☆40Dec 1, 2025Updated 3 months ago
- [AAAI 2025] DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation☆239Mar 26, 2025Updated 11 months ago
- ☆28Jan 27, 2025Updated last year
- ☆22May 8, 2023Updated 2 years ago
- [CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…☆63Mar 25, 2025Updated 11 months ago
- Semantic Instance Fusion for 3D reconstruction of RGB-D indoor images in python☆21Sep 23, 2021Updated 4 years ago
- Official implementation of "LidarDM: Generative LiDAR Simulation in a Generated World" (ICRA 2025)☆181Aug 12, 2025Updated 6 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆61Jan 10, 2025Updated last year
- Official repository for paper "Can LVLMs Obtain a Driver’s License? A Benchmark Towards Reliable AGI for Autonomous Driving"☆30May 18, 2025Updated 9 months ago
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆32Jul 16, 2025Updated 7 months ago
- [CVPR'25] Official repository for "Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Eva…☆44Jan 7, 2026Updated last month
- ☆30Sep 4, 2024Updated last year
- Atom3d, atomising geometry, is a mesh processing toolbox specifically designed for 3D learning.☆136Jan 17, 2026Updated last month
- [NeurIPS'25 Spotlight] GeoSVR: Taming Sparse Voxels for Geometrically Accurate Surface Reconstruction☆168Jan 1, 2026Updated 2 months ago
- [ICCV 2025] Language Driven Occupancy Prediction☆35Dec 23, 2024Updated last year
- ☆26Jun 17, 2021Updated 4 years ago
- StableWorld: Towards Stable and Consistent Long Interactive Video Generation☆81Feb 3, 2026Updated last month
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models☆79Dec 27, 2025Updated 2 months ago
- Empowering Unified MLLM with Multi-granular Visual Generation☆130Jan 16, 2025Updated last year