chrislin0621 / CleanPoseLinks
[ICCV2025] CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation
☆20Updated 4 months ago
Alternatives and similar repositories for CleanPose
Users that are interested in CleanPose are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] 3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer☆87Updated 8 months ago
- A survey on Multimodal Fusion for Robot Vision☆35Updated 3 months ago
- ☆87Updated 8 months ago
- 😎 up-to-date & curated list of awesome 3D Visual Grounding papers, methods & resources.☆260Updated 3 weeks ago
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆282Updated last month
- MonoASRH: Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object Detection☆13Updated 8 months ago
- ☆73Updated 10 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆208Updated 9 months ago
- Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes☆19Updated 5 months ago
- [CVPR 2025] DFormerv2: Geometry Self-Attention for RGBD Semantic Segmentation & [ICLR 2024] DFormer & [NeuriPS 2025] OmniSegmentor☆444Updated 2 months ago
- [CVPR 2025] Mr. DETR: Instructive Multi-Route Training for Detection Transformers☆163Updated 5 months ago
- ☆53Updated 2 months ago
- ☆32Updated last year
- [NeurIPS 2024] A Unified Framework for 3D Scene Understanding☆170Updated 7 months ago
- [CVPR 2025] OnlineAnySeg: Online Zero-Shot 3D Segmentation by Visual Foundation Model Guided 2D Mask Merging☆40Updated 8 months ago
- [CVPR 2025, All Strong Accept] TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding☆249Updated 7 months ago
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆66Updated last year
- ☆50Updated 9 months ago
- CVPR2025☆21Updated 5 months ago
- [ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time☆613Updated 9 months ago
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perception☆125Updated 10 months ago
- [CVPR24] Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation☆85Updated 3 weeks ago
- [ICLR 2025 Spotlight] Multimodality Helps Few-shot 3D Point Cloud Semantic Segmentation☆68Updated 9 months ago
- Repository for Vision-and-Language Navigation via Causal Learning (Accepted by CVPR 2024)☆98Updated 8 months ago
- [NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆224Updated 7 months ago
- [ICRA 2026] Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"☆47Updated last week
- 3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)☆69Updated 6 months ago
- ☆27Updated last year
- [CVPR 2024] Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships☆145Updated last year
- ImOV3D: Learning Open Vocabulary Point Clouds 3D Object Detection from Only 2D Images (NeurIPS2024)☆88Updated 4 months ago