pqh22 / ProxyTransformation
[CVPR2025] ProxyTransformation : Preshaping Point Cloud Manifold With Proxy Attention For 3D Visual Grounding
☆27Updated 2 months ago
Alternatives and similar repositories for ProxyTransformation
Users that are interested in ProxyTransformation are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆53Updated 9 months ago
- [ICLR'25] [3D-LLM] City-scale 3D Visual Grounding with Multi-modality LLMs☆44Updated last month
- [ECCV24] Navigation Instruction Generation with BEV Perception and Large Language Models☆31Updated 10 months ago
- Project Page for GaussianFormer☆25Updated 11 months ago
- Unifying 2D and 3D Vision-Language Understanding☆82Updated last month
- ☆46Updated 4 months ago
- ☆84Updated 4 months ago
- [ICLR 2025] Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention☆20Updated 2 months ago
- Official PyTorch implementation of D^2-World as the second place and innovation award of CVPR 2024 Predictive World Model Challenge.☆14Updated last month
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆45Updated last month
- ☆19Updated 3 months ago
- [NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation☆32Updated 3 months ago
- [WACV2025] Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field☆12Updated 6 months ago
- Implementation of the project: SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆26Updated last month
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆27Updated 3 months ago
- Code of 3DMIT: 3D MULTI-MODAL INSTRUCTION TUNING FOR SCENE UNDERSTANDING☆30Updated 9 months ago
- [CVPR 2025 Highlight] Towards Autonomous Micromobility through Scalable Urban Simulation☆21Updated 2 weeks ago
- ☆49Updated 7 months ago
- [CVPR 2024] GeoAuxNet: Torwards Universal 3D Representation Learning for Multi-sensor Point Clouds☆14Updated last year
- Paper: UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting☆16Updated 2 months ago
- ☆23Updated 4 months ago
- [CVPR'25 Rating: 4/4/2 Final-Stage-Desk-Reject] GSOT3D: Towards Generic 3D Single Object Tracking in the Wild☆25Updated last week
- Driving Everywhere with Large Language Model Policy Adaptation☆15Updated 10 months ago
- ☆12Updated last month
- [ICRA 2025] Official implementation for "TrackOcc: Camera-based 4D Panoptic Occupancy Tracking"☆31Updated 2 weeks ago
- officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"☆14Updated 10 months ago
- [CVPR2024] Multiagent Multitraversal Multimodal Self-Driving: Open MARS Dataset☆52Updated 10 months ago
- ☆84Updated 4 months ago
- High-res 3D Occupancy Dataset for Unified 3D Scene Understanding.☆24Updated 10 months ago
- ☆38Updated 9 months ago