pqh22 / ProxyTransformationLinks
[CVPR2025] ProxyTransformation : Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
☆45Updated last month
Alternatives and similar repositories for ProxyTransformation
Users that are interested in ProxyTransformation are comparing it to the libraries listed below
Sorting:
- Official implementation of paper "Controllable 3D Outdoor Scene Generation via Scene Graphs" (ICCV 2025)☆52Updated 2 months ago
- ☆24Updated 4 months ago
- [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)☆120Updated 2 weeks ago
- Unifying 2D and 3D Vision-Language Understanding☆108Updated 2 months ago
- ConDense backbone, weights, and evaluation code.☆31Updated last year
- EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory☆26Updated last week
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Updated 11 months ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆43Updated 2 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆38Updated 3 months ago
- [CVPR 2023] Unsupervised Continual Semantic Adaptation through Neural Rendering☆40Updated last year
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆99Updated 2 months ago
- [WACV 2025] Official code of "SEED4D: A Synthetic Ego-Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark"☆17Updated last month
- Project Page for GaussianFormer☆24Updated last year
- ☆41Updated last year
- ☆48Updated last year
- ☆94Updated 9 months ago
- LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences☆29Updated last week
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆29Updated 2 weeks ago
- [ICLR'25] City-scale 3D Visual Grounding with Multi-modality LLMs☆57Updated 2 months ago
- [ICCV 2025] 3DGraphLLM is a model that uses a 3D scene graph and an LLM to perform 3D vision-language tasks.☆80Updated 2 months ago
- [CVPR 2025] RelationField: Relate Anything in Radiance Fields☆76Updated 6 months ago
- [WACV2025] Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field☆14Updated 11 months ago
- [ICCV2025] Extrapolated Urban View Synthesis Benchmark☆44Updated last week
- [arXiv'24] GAGS: Granularity-Aware 3D Feature Distillation for Gaussian Splatting☆45Updated 4 months ago
- [NeurIPS 2024] DiffSF: Diffusion Models for Scene Flow Estimation☆28Updated 9 months ago
- [ICCV2025] LONG3R: Long Sequence Streaming 3D Reconstruction☆37Updated 2 months ago
- Official PyTorch implementation of the paper ‘CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Und…☆53Updated last year
- ☆91Updated 9 months ago
- ☆53Updated 4 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆273Updated last month