pqh22 / ProxyTransformationLinks
[CVPR2025] ProxyTransformation : Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
☆48Updated 4 months ago
Alternatives and similar repositories for ProxyTransformation
Users that are interested in ProxyTransformation are comparing it to the libraries listed below
Sorting:
- Official implementation of paper "Controllable 3D Outdoor Scene Generation via Scene Graphs" (ICCV 2025)☆61Updated 6 months ago
- ☆48Updated 2 years ago
- ☆27Updated 7 months ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆61Updated 5 months ago
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆58Updated last week
- Unifying 2D and 3D Vision-Language Understanding☆119Updated 6 months ago
- Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion (ICCV 2025)☆76Updated 4 months ago
- [NeurIPS 2025] LabelAny3D: Label Any Object 3D in the Wild☆111Updated 2 weeks ago
- [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)☆123Updated 3 months ago
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆14Updated last year
- ConDense backbone, weights, and evaluation code.☆30Updated last year
- [CVPR 2023] Unsupervised Continual Semantic Adaptation through Neural Rendering☆41Updated 2 years ago
- [NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"☆92Updated last month
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆52Updated 2 months ago
- Project Page for GaussianFormer☆24Updated last year
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆53Updated 3 months ago
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆106Updated this week
- [CVPR 2025] RelationField: Relate Anything in Radiance Fields☆85Updated 10 months ago
- [ICLR 2025] Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention☆28Updated 11 months ago
- LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussians☆22Updated last year
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆30Updated last year
- [AAAI-26] GAGS: Granularity-Aware 3D Feature Distillation for Gaussian Splatting☆55Updated 2 months ago
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆21Updated 4 months ago
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆50Updated last month
- [ICLR'25] City-scale 3D Visual Grounding with Multi-modality LLMs☆62Updated 2 months ago
- Open-Vocabulary SAM3D: Understand Any 3D Scene☆37Updated 7 months ago
- [NeurIPS 2024] DiffSF: Diffusion Models for Scene Flow Estimation☆29Updated last year
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆117Updated 5 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆33Updated last month
- Code repository for "DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers"☆74Updated 2 months ago