[CVPR'24] MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding
☆18Dec 13, 2024Updated last year
Alternatives and similar repositories for MiKASA-3DVG
Users that are interested in MiKASA-3DVG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Chain_of_Thoughts_3D_Visual_Grounding☆21Apr 20, 2024Updated 2 years ago
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆67Dec 2, 2022Updated 3 years ago
- This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…☆27Oct 10, 2024Updated last year
- [ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds☆43Jul 6, 2022Updated 3 years ago
- [CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding☆134Oct 11, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- (ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'…☆60Apr 18, 2024Updated 2 years ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆84Oct 10, 2024Updated last year
- ☆46Mar 27, 2023Updated 3 years ago
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆63Aug 3, 2024Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆17Jun 20, 2023Updated 2 years ago
- ☆13Jul 22, 2024Updated last year
- Official implementation for CIGN☆17Sep 11, 2023Updated 2 years ago
- This is a pytorch implementation of our AAAI paper for learned image transmission with HVAE☆12Mar 2, 2026Updated 3 months ago
- [NeurIPS‘24] Multi-Object 3D Grounding with Dynamic Modules and Language Informed Spatial Attention☆28Jun 15, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Oct 4, 2023Updated 2 years ago
- Lecture notes include almost everything in my notebook.☆13Sep 8, 2025Updated 9 months ago
- ☆12May 19, 2025Updated last year
- Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision☆11Jul 22, 2024Updated last year
- HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation☆15Sep 13, 2024Updated last year
- ☆12Dec 19, 2024Updated last year
- This is the source code of Part2Word: Learning Joint Embedding of Point Clouds and Text by Bidirectional Matching between Parts and Words☆16Mar 22, 2023Updated 3 years ago
- A Keras Implementation of Coordinate Attention follows https://github.com/Andrew-Qibin/CoordAttention☆13Sep 25, 2021Updated 4 years ago
- [CVPR 2025] Official implementation of the paper "Point-Cache: Test-time Dynamic and Hierarchical Cache for Robust and Generalizable Poin…☆19Mar 13, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR 2025 Oral] MOS: Model Synergy for Test-Time Adaptation on LiDAR-Based 3D Object Detection☆17Jul 24, 2025Updated 10 months ago
- Official Repository of "Transcrib3D: 3D Referring Expression Resolution through Large Language Models" accepted at IROS 2024☆13Mar 30, 2026Updated 2 months ago
- [ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects☆97Mar 26, 2026Updated 2 months ago
- ☆26Jan 19, 2026Updated 4 months ago
- Official PyTorch Implementation of the Paper "Test-Time Adaptation of 3D Point Clouds via Denoising Diffusion Models"☆22Apr 24, 2025Updated last year
- ☆13Aug 27, 2021Updated 4 years ago
- Unofficial implementation of "Coordinate Attention for Efficient Mobile Network Design". CoordAttention tensorflow slim☆17Mar 10, 2021Updated 5 years ago
- DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation☆98Dec 31, 2024Updated last year
- [TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…☆28May 8, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆284Mar 19, 2025Updated last year
- The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering☆28Apr 18, 2025Updated last year
- ☆39Jul 19, 2024Updated last year
- ☆20Sep 27, 2024Updated last year
- Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"☆94Jun 9, 2023Updated 3 years ago
- Visual Grounding with Multi-modal Conditional Adaptation (ACMMM 2024 Oral)☆26Jun 11, 2025Updated last year
- [ECCV'24] PyTorch Implementation of "Free-Editor: Zero-shot Text-driven 3D Scene Editing"☆21Dec 5, 2024Updated last year