sega-hsj/MVT-3DVG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sega-hsj/MVT-3DVG)

sega-hsj / MVT-3DVG

[CVPR 2022] Multi-View Transformer for 3D Visual Grounding

☆81

Alternatives and similar repositories for MVT-3DVG

Users that are interested in MVT-3DVG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zlccccc / 3DVL_Codebase
View on GitHub
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
☆57Jan 29, 2023Updated 3 years ago
fjhzhixi / 3D-SPS
View on GitHub
☆64May 17, 2023Updated 3 years ago
zlccccc / 3DVG-Transformer
View on GitHub
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
☆43Jul 6, 2022Updated 4 years ago
heng-hw / SpaCap3D
View on GitHub
[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)
☆21Aug 31, 2022Updated 3 years ago
zyang-ur / SAT
View on GitHub
SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)
☆32Sep 29, 2021Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Ivan-Tang-3D / ViewRefer3D
View on GitHub
(ICCV2023) Official implementation of 'ViewRefer: Grasp the Multi-view Knowledge for 3D Visual Grounding with GPT and Prototype Guidance'…
☆60Apr 18, 2024Updated 2 years ago
nickgkan / butd_detr
View on GitHub
Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"
☆95Jun 9, 2023Updated 3 years ago
yanmin-wu / EDA
View on GitHub
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
☆135Oct 11, 2023Updated 2 years ago
3dlg-hcvc / multi3drefer
View on GitHub
[ICCV 2023] Multi3DRefer: Grounding Text Description to Multiple 3D Objects
☆98Mar 26, 2026Updated 3 months ago
cshizhe / vil3dref
View on GitHub
Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).
☆67Dec 2, 2022Updated 3 years ago
rohjunha / language-refer
View on GitHub
☆27Jan 3, 2024Updated 2 years ago
jianghaojun / Awesome-3D-Vision-and-Language
View on GitHub
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
☆101Feb 26, 2023Updated 3 years ago
CurryYuan / PhraseRefer
View on GitHub
[TNNLS] Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
☆17Jul 10, 2025Updated last year
PNXD / FFL-3DOG
View on GitHub
Free-form Description-guided 3D Visual Graph Networks for Object Grounding in Point Cloud
☆18Jun 23, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
eslambakr / CoT3D_VG
View on GitHub
Chain_of_Thoughts_3D_Visual_Grounding
☆21Apr 20, 2024Updated 2 years ago
CurryYuan / InstanceRefer
View on GitHub
[ICCV 2021] InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextua…
☆74Mar 22, 2025Updated last year
ATR-DBI / ScanQA
View on GitHub
☆161Aug 23, 2023Updated 2 years ago
PPjmchen / HAM
View on GitHub
☆17Jul 8, 2023Updated 3 years ago
daveredrum / ScanRefer
View on GitHub
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
☆303Feb 10, 2023Updated 3 years ago
eslambakr / LAR-Look-Around-and-Refer
View on GitHub
This is the official implementation for our paper;"LAR:Look Around and Refer".
☆30Dec 1, 2022Updated 3 years ago
SxJyJay / MORE
View on GitHub
[ECCV 2022] MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes official implementation
☆16Feb 2, 2023Updated 3 years ago
sosppxo / 3D-STMN
View on GitHub
[AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…
☆45Dec 20, 2023Updated 2 years ago
wenz116 / DRFT
View on GitHub
End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021
☆18Oct 24, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Leon1207 / 3DRefTR
View on GitHub
This is a PyTorch implementation of 3DRefTR proposed by our paper "A Unified Framework for 3D Point Cloud Visual Grounding"
☆26Aug 24, 2023Updated 2 years ago
daveredrum / Scan2Cap
View on GitHub
[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans
☆106Sep 6, 2022Updated 3 years ago
ikuinen / semantic_completion_network
View on GitHub
☆26Aug 4, 2020Updated 5 years ago
luo-junyu / TransRefer3D
View on GitHub
TransRefer3D: Entity-and-Relation Aware Transformer for Fine-Grained 3D Visual Grounding [ACM MM'21]
☆20Apr 23, 2022Updated 4 years ago
hanhung / TGNN
View on GitHub
☆26Mar 15, 2022Updated 4 years ago
CurryYuan / X-Trans2Cap
View on GitHub
[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
☆36Aug 26, 2022Updated 3 years ago
LeapLabTHU / Pseudo-Q
View on GitHub
[CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding
☆153Jul 13, 2024Updated 2 years ago
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
daveredrum / ScanRefer_Browser
View on GitHub
☆11Feb 1, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SilongYong / SQA3D
View on GitHub
[ICLR 2023] SQA3D for embodied scene understanding and reasoning
☆169Oct 13, 2023Updated 2 years ago
Haiyang-W / CAGroup3D
View on GitHub
[NeurIPS2022] This is the official code of "CAGroup3D: Class-Aware Grouping for 3D Object Detection on Point Clouds".
☆96May 31, 2023Updated 3 years ago
3dlg-hcvc / minsu3d
View on GitHub
MINSU3D: MinkowskiEngine-powered Scene Understanding in 3D
☆42Jun 24, 2024Updated 2 years ago
Open3DA / LL3DA
View on GitHub
[CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…
☆319Jul 17, 2024Updated 2 years ago
djiajunustc / TransVG
View on GitHub
☆198Feb 27, 2024Updated 2 years ago
szacho / pointcam
View on GitHub
Self-supervised adversarial masking for point clouds
☆11Jul 12, 2023Updated 3 years ago
qzp2018 / MCLN
View on GitHub
This is a PyTorch implementation of MCLN proposed by our paper "Multi-branch Collaborative Learning Network for 3D Visual Grounding"(ECCV…
☆27Oct 10, 2024Updated last year