[T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm
☆369Sep 30, 2025Updated 5 months ago
Alternatives and similar repositories for PonderV2
Users that are interested in PonderV2 are comparing it to the libraries listed below
Sorting:
- [ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI☆643Jan 12, 2026Updated last month
- Pointcept: Perceive the world with sparse points, a codebase for point cloud perception research. Latest works: Concerto (NeurIPS'25), So…☆2,831Feb 4, 2026Updated 3 weeks ago
- UniPAD: A Universal Pre-training Paradigm for Autonomous Driving (CVPR 2024)☆203Jul 9, 2024Updated last year
- (CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learn…☆297Jun 28, 2024Updated last year
- ☆20Feb 1, 2026Updated 3 weeks ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆172Jun 19, 2025Updated 8 months ago
- Open-source implementations on real robots☆35Nov 25, 2024Updated last year
- [CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)☆1,683Oct 24, 2025Updated 4 months ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆278Mar 19, 2025Updated 11 months ago
- [ICCV 2023] Code for NeRF-Det: Learning Geometry-Aware Volumetric Representation for Multi-View 3D Object Detection☆303Sep 14, 2023Updated 2 years ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆573Oct 26, 2025Updated 4 months ago
- [ESI highly cited] TLS point cloud registration benchmark consists of 115 scans collected from 11 different scenarios☆72Oct 27, 2023Updated 2 years ago
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…☆311Jul 17, 2024Updated last year
- Code for 3D-LLM: Injecting the 3D World into Large Language Models☆1,177Jun 6, 2024Updated last year
- 3D object discovery from casual object captures☆37Jul 14, 2023Updated 2 years ago
- Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources☆2,117Feb 3, 2026Updated 3 weeks ago
- [ICCV'23 Workshop] SAM3D: Segment Anything in 3D Scenes☆1,306Apr 21, 2024Updated last year
- [ICLR 2024] FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators☆302Nov 25, 2024Updated last year
- ☆52Feb 14, 2024Updated 2 years ago
- PatchAugNet: Patch feature augmentation-based heterogeneous point cloud place recognition in large-scale street scenes☆49Apr 1, 2025Updated 10 months ago
- [IEEE TGRS 2024] A novel method for registration of MLS and stereo reconstructed point clouds☆75Jun 27, 2024Updated last year
- [CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies☆799Oct 27, 2023Updated 2 years ago
- [ CVPR 2023 Award Candidate ] OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation☆516Sep 2, 2024Updated last year
- ☆582Jan 21, 2026Updated last month
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆349Dec 1, 2025Updated 2 months ago
- [Information Fusion 2024] SparseDC: Depth Completion From Sparse and Non-uniform Inputs☆130Aug 20, 2024Updated last year
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆61Oct 1, 2024Updated last year
- ☆60Jun 13, 2024Updated last year
- Mask3D predicts accurate 3D semantic instances achieving state-of-the-art on ScanNet, ScanNet200, S3DIS and STPLS3D.☆712Oct 29, 2023Updated 2 years ago
- [CVPR 2024 & NeurIPS 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI☆652Jun 13, 2025Updated 8 months ago
- [ICML 2023] Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining☆153Jul 21, 2024Updated last year
- Align 3D Point Cloud with Multi-modalities for Large Language Models☆459Dec 9, 2023Updated 2 years ago
- SIGGRAPH Asia 2023: Code for "Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes"☆247Dec 25, 2023Updated 2 years ago
- Official implementation of Continuous 3D Perception Model with Persistent State☆1,340Aug 27, 2025Updated 6 months ago
- Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024☆126Dec 31, 2024Updated last year
- [3DV'25] 3D Reconstruction with Spatial Memory☆1,119Feb 25, 2025Updated last year
- A shift-window based transformer for 3D sparse tasks☆284Jun 25, 2023Updated 2 years ago
- [ECCV 2024 Best Paper Candidate & TPAMI 2025] PointLLM: Empowering Large Language Models to Understand Point Clouds☆975Aug 14, 2025Updated 6 months ago
- [ICML 2024] LEO: An Embodied Generalist Agent in 3D World☆476Apr 20, 2025Updated 10 months ago