DarlingHang / ChatCamLinks
This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.
☆19Updated last year
Alternatives and similar repositories for ChatCam
Users that are interested in ChatCam are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆196Updated last month
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆164Updated last week
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆83Updated 4 months ago
- Official code for "LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models" (CVPR 2025)☆148Updated 5 months ago
- Official implementation of Physics3D: Learning Physical Properties of 3D Gaussians via Video Diffusion☆223Updated last year
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆39Updated 2 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆160Updated last month
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆104Updated 8 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 7 months ago
- Code for "Motion-2-to-3: Leveraging 2D Motion Data to Boost 3D Motion Generation", Arxiv 2024☆96Updated 3 weeks ago
- [arXiv 2025] Generative View Stitching☆88Updated 3 weeks ago
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆178Updated 5 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆132Updated 7 months ago
- [Nips 2025] EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆122Updated 3 months ago
- [3DV 2026] "SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass"☆213Updated last week
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆98Updated 3 weeks ago
- Trace Anything: Representing Any Video in 4D via Trajectory Fields☆406Updated 3 weeks ago
- A novel 4D reconstruction method that directly generates high-quality, animation-ready 4D mesh asset (.GLB file) from a single monocular …☆93Updated this week
- Self-reimplemented version of 4D-LRM.☆63Updated 5 months ago
- ☆244Updated last month
- [AAAI 2025] DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion Priors☆220Updated last year
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆392Updated 2 weeks ago
- [SIGGRAPH Asia 2025] WorldExplorer: Towards Generating Fully Navigable 3D Scenes☆134Updated last month
- PhysX: Physical-Grounded 3D Asset Generation (NeurIPS 2025, Spotlight)☆316Updated last week
- [CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"☆19Updated 7 months ago
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆195Updated 3 months ago
- Open-world 3D part segmentation of point clouds☆103Updated 4 months ago
- [CVPR 2025] ArtFormer: Controllable Generation of Diverse 3D Articulated Objects☆32Updated 4 months ago
- [ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.☆83Updated last year
- [ICCV 2025 Highlights] Large-scale photo-realistic virtual worlds for embodied AI☆235Updated 3 weeks ago