YiyiyiZhao / VIALMLinks

Survey and Benchmark of VIALM

☆9

Alternatives and similar repositories for VIALM

Users that are interested in VIALM are comparing it to the libraries listed below

Sorting:

jyFengGoGo / InstructDet
☆37Updated last year
CurryYuan / InstanceRefer
[ICCV 2021] InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextua…
☆75Updated 3 months ago
fjhzhixi / 3D-SPS
☆62Updated 2 years ago
yinjunbo / ProficientTeachers
☆17Updated 2 years ago
ayesha-ishaq / Open3DTrack
Code for Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
☆26Updated 4 months ago
sega-hsj / MVT-3DVG
[CVPR 2022] Multi-View Transformer for 3D Visual Grounding
☆76Updated 2 years ago
Space3D-Bench / Space3D-Bench
☆12Updated 3 months ago
Asterisci / Language-Assisted-3D
[AAAI 2023 Oral] Language-Assisted 3D Feature Learning for Semantic Scene Understanding
☆12Updated last year
nomiaro / OPA
☆10Updated last year
Whale-ice / DDS3D
This is the repository for DDS3D(ICRA2023)
☆17Updated last year
RunsenXu / MV-JAR
[CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training
☆47Updated 2 years ago
YunzeMan / DualCross
[IROS 2023] DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception
☆29Updated last year
sunnyHelen / RCVC-depth
The code for On Robust Cross-View Consistency in Outdoor Self-Supervised Monocular Depth Estimation
☆13Updated 2 years ago
xmed-lab / NuInstruct
☆63Updated 11 months ago
katieluo88 / DRIFT
☆15Updated last year
dzcgaara / OVO-Open-Vocabulary-Occupancy
☆80Updated 2 years ago
XiandaGuo / Drive-MLLM
☆45Updated last month
GradiusTwinbee / GLIS
officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"
☆14Updated last year
leolyj / 3D-VLP
This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).
☆29Updated 2 years ago
zlccccc / 3DVG-Transformer
[ICCV2021] 3DVG-Transformer: Relation Modeling for Visual Grounding on Point Clouds
☆42Updated 3 years ago
rolsheng / MM-VUFM4DS
【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios
☆51Updated last year
Fayeben / ADAS
A Simple Active-and-Adaptive Baseline for Cross-Domain 3D Semantic Segmentation
☆13Updated 2 years ago
gpt4vision / R1-SGG
☆20Updated 2 months ago
taco-group / NuScenes-SpatialQA
☆16Updated 3 months ago
hanhung / TGNN
☆24Updated 3 years ago
gpt4vision / OvSGTR
[ECCV 2024 Best Paper Candidate] Implementation of "Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Vi…
☆70Updated 2 months ago
Mr-Neko / JM3D
The offical implemention of JM3D.
☆30Updated 2 months ago
sosppxo / 3D-STMN
[AAAI 2024] The official implementation of the paper "3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Refer…
☆42Updated last year
Yangsenqiao / LiDAR-LLM
☆48Updated 6 months ago
Sranc3 / M-BEV
[AAAI24]This is the implementation for the paper M-BEV: Masked BEV Perception for Robust Autonomous Driving
☆38Updated last year