daveredrum/Scan2Cap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/daveredrum/Scan2Cap)

daveredrum / Scan2Cap

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

☆106

Alternatives and similar repositories for Scan2Cap

Users that are interested in Scan2Cap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CurryYuan / X-Trans2Cap
View on GitHub
[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
☆36Aug 26, 2022Updated 3 years ago
zlccccc / 3DVL_Codebase
View on GitHub
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
☆57Jan 29, 2023Updated 3 years ago
daveredrum / D3Net
View on GitHub
[ECCV2022] D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
☆44Aug 27, 2022Updated 3 years ago
CurryYuan / InstanceRefer
View on GitHub
[ICCV 2021] InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextua…
☆74Mar 22, 2025Updated last year
daveredrum / ScanRefer
View on GitHub
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
☆303Feb 10, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PNXD / FFL-3DOG
View on GitHub
Free-form Description-guided 3D Visual Graph Networks for Object Grounding in Point Cloud
☆18Jun 23, 2022Updated 4 years ago
HaolinLiu97 / Refer-it-in-RGBD
View on GitHub
Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021
☆42May 24, 2024Updated 2 years ago
referit3d / referit3d
View on GitHub
Code accompanying our ECCV-2020 paper on 3D Neural Listeners.
☆141Jun 29, 2021Updated 5 years ago
zyang-ur / SAT
View on GitHub
SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)
☆32Sep 29, 2021Updated 4 years ago
ATR-DBI / ScanQA
View on GitHub
☆161Aug 23, 2023Updated 2 years ago
daveredrum / ScanRefer_Browser
View on GitHub
☆11Feb 1, 2023Updated 3 years ago
SxJyJay / MORE
View on GitHub
[ECCV 2022] MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes official implementation
☆16Feb 2, 2023Updated 3 years ago
heng-hw / SpaCap3D
View on GitHub
[IJCAI 2022] Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds (official pytorch implementation)
☆21Aug 31, 2022Updated 3 years ago
chrdiller / mitsuba-visualize
View on GitHub
Visualizes meshes, pointclouds and video flythroughs in publication quality
☆120Apr 19, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
zlccccc / 3DVG-Transformer
View on GitHub
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
☆43Jul 6, 2022Updated 4 years ago
nickgkan / butd_detr
View on GitHub
Code for the ECCV22 paper "Bottom Up Top Down Detection Transformers for Language Grounding in Images and Point Clouds"
☆95Jun 9, 2023Updated 3 years ago
ZzZZCHS / WS-3DVG
View on GitHub
[ICCV 2023] Distilling Coarse-to-fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding
☆14Oct 2, 2024Updated last year
chaoyivision / SGGpoint
View on GitHub
[CVPR 2021] Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis (official pytorch implementation)
☆56Mar 23, 2022Updated 4 years ago
hanhung / TGNN
View on GitHub
☆26Mar 15, 2022Updated 4 years ago
leolyj / 3D-VLP
View on GitHub
This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).
☆29Jun 15, 2023Updated 3 years ago
fjhzhixi / 3D-SPS
View on GitHub
☆64May 17, 2023Updated 3 years ago
3d-vista / 3D-VisTA
View on GitHub
Official implementation of ICCV 2023 paper "3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment"
☆215Sep 7, 2023Updated 2 years ago
jianghaojun / Awesome-3D-Vision-and-Language
View on GitHub
A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.
☆101Feb 26, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
xheon / JointEmbedding
View on GitHub
[ICCV'19] Joint Embedding of 3D Scan and CAD Objects
☆59Jan 15, 2022Updated 4 years ago
rohjunha / language-refer
View on GitHub
☆27Jan 3, 2024Updated 2 years ago
tgxs002 / wikiscenes
View on GitHub
Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision. ICCV 2021.
☆43Apr 30, 2024Updated 2 years ago
mako443 / Text2Pos-CVPR2022
View on GitHub
Code, dataset and models for our CVPR 2022 publication "Text2Pos"
☆58Jun 17, 2022Updated 4 years ago
CurryYuan / PhraseRefer
View on GitHub
[TNNLS] Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases
☆17Jul 10, 2025Updated last year
antao97 / SegGroup.annotator
View on GitHub
Seg-Level Label Annotator
☆25Jul 24, 2022Updated 3 years ago
yanmin-wu / EDA
View on GitHub
[CVPR 2023] EDA: Explicit Text-Decoupling and Dense Alignment for 3D Visual Grounding
☆134Oct 11, 2023Updated 2 years ago
ch3cook-fdu / Vote2Cap-DETR
View on GitHub
[T-PAMI 2024] & [CVPR 2023] Vote2Cap-DETR; A set-to-set perspective towards 3D Dense Captioning; State-of-the-Art 3D Dense Captioning met…
☆104Aug 17, 2024Updated last year
WaldJohannaU / 3RScan
View on GitHub
3RScan Toolkit
☆269May 26, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / ContrastiveSceneContexts
View on GitHub
Code for CVPR 2021 oral paper "Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts"
☆238Jun 30, 2022Updated 4 years ago
SilongYong / SQA3D
View on GitHub
[ICLR 2023] SQA3D for embodied scene understanding and reasoning
☆168Oct 13, 2023Updated 2 years ago
szzexpoi / rex
View on GitHub
Official Repository for CVPR 2022 paper "REX: Reasoning-aware and Grounded Explanation"
☆22Nov 21, 2023Updated 2 years ago
NUAAXQ / MLCVNet
View on GitHub
[CVPR 2020] MLCVNet: Multi-Level Context VoteNet for 3D Object Detection
☆123Nov 18, 2021Updated 4 years ago
Asterisci / Language-Assisted-3D
View on GitHub
[AAAI 2023 Oral] Language-Assisted 3D Feature Learning for Semantic Scene Understanding
☆12Aug 1, 2023Updated 2 years ago
RozDavid / LanguageGroundedSemseg
View on GitHub
Implementation for ECCV 2022 paper Language-Grounded Indoor 3D Semantic Segmentation in the Wild
☆124Nov 3, 2022Updated 3 years ago
skanti / Scan2CAD
View on GitHub
[CVPR'19] Dataset and code used in the research project Scan2CAD: Learning CAD Model Alignment in RGB-D Scans
☆475Feb 25, 2025Updated last year