fudan-zvg / spar
☆29Updated last month
Alternatives and similar repositories for spar:
Users that are interested in spar are comparing it to the libraries listed below
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆70Updated this week
- [ECCV 2024] EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion.☆93Updated 11 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆20Updated last month
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 3 weeks ago
- Open-world 3D part segmentation of point clouds☆75Updated last month
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆31Updated 2 months ago
- ☆59Updated 2 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated 8 months ago
- Spatial-R1: The first MLLM trained using GRPO for spatial reasoning in videos☆31Updated this week
- Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"☆75Updated 9 months ago
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆44Updated 4 months ago
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆103Updated 5 months ago
- A collection of object-compositional modeling by implicit neural representation.☆58Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆43Updated 3 months ago
- This is the project page of ShowRoom3D☆25Updated last year
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆60Updated 7 months ago
- Sora Generates Videos with Stunning Geometrical Consistency☆49Updated last year
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆15Updated 2 months ago
- ☆10Updated 6 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆96Updated 2 weeks ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆50Updated last week
- [NeurIPS 2023 Spotlight] Code for "Contrastive Lift: 3D Object Instance Segmentation by Slow-Fast Contrastive Fusion"☆67Updated last year
- A list of works on video generation towards world model☆53Updated this week
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆37Updated 4 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆84Updated last month
- [NeurIPS2024] DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion☆35Updated 7 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆89Updated 3 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆51Updated last week
- ConDense backbone, weights, and evaluation code.☆32Updated 10 months ago