VincentDENGP/3D-LR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VincentDENGP/3D-LR)

VincentDENGP / 3D-LR

Can 3D Vision-Language Models Truly Understand Natural Language?

☆20

Alternatives and similar repositories for 3D-LR

Users that are interested in 3D-LR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CVMI-Lab / clip-beyond-tail
View on GitHub
(NeurIPS 2024) What Makes CLIP More Robust to Long-Tailed Pre-Training Data? A Controlled Study for Transferable Insights
☆27Oct 28, 2024Updated last year
CVMI-Lab / FS3D
View on GitHub
(NeurlPS 2022) Prototypical VoteNet for Few-Shot 3D Point Cloud Object Detection
☆60Jan 3, 2023Updated 3 years ago
sled-group / COMFORT
View on GitHub
[ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…
☆22Oct 24, 2024Updated last year
CVMI-Lab / ResKD
View on GitHub
[NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".
☆31Nov 16, 2022Updated 3 years ago
XingruiWang / 3D-Aware-VQA
View on GitHub
Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"
☆21Oct 17, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
AlvinWen428 / spatial-relation-benchmark
View on GitHub
☆15Oct 12, 2024Updated last year
CVMI-Lab / Hita
View on GitHub
(ICCV 2025) Holistic Tokenizer for Autoregressive Image Generation
☆34Oct 9, 2025Updated 9 months ago
vision-x-nyu / test-set-training
View on GitHub
☆15Nov 25, 2025Updated 7 months ago
CVMI-Lab / CoDet
View on GitHub
(NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection
☆123Apr 26, 2024Updated 2 years ago
CVMI-Lab / IST-Net
View on GitHub
(ICCV2023) IST-Net: Prior-free Category-level Pose Estimation with Implicit Space Transformation
☆120Dec 7, 2023Updated 2 years ago
MIV-XJTU / EvoPrompt
View on GitHub
PyTorch implementation of paper "Evolving Parameterized Prompt Memory for Continual Learning" in AAAI 2024 (Oral).
☆13Apr 15, 2024Updated 2 years ago
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
CVMI-Lab / MarS3D
View on GitHub
(CVPR 2023) MarS3D: A Plug-and-Play Motion-Aware Model for Semantic Segmentation on Multi-Scan 3D Point Clouds
☆68Jul 31, 2023Updated 2 years ago
OrigamiSL / OTETrack
View on GitHub
Source code of the paper: Overlapped Trajectory-Enhanced Visual Tracking
☆11Sep 3, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
CVMI-Lab / SparseKD
View on GitHub
(NeurlPS 2022) Towards Efficient 3D Object Detection with Knowledge Distillation
☆126Jul 26, 2023Updated 2 years ago
findalexli / mllm-dpo
View on GitHub
[ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model
☆48Nov 10, 2024Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
MIV-XJTU / FLAME
View on GitHub
[CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"
☆33Jul 8, 2025Updated last year
Learning-and-Intelligent-Systems / lisdf
View on GitHub
A repository for a universal I/O spec for TAMP, along with scripts to convert from popular specs to our spec
☆14Jun 25, 2025Updated last year
shizhediao / DaVinci
View on GitHub
Source code for the paper "Prefix Language Models are Unified Modal Learners"
☆45Apr 30, 2023Updated 3 years ago
MIV-XJTU / DKT
View on GitHub
☆14Jun 19, 2023Updated 3 years ago
Haochen-Wang409 / ross3d
View on GitHub
[ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness
☆70Jul 22, 2025Updated 11 months ago
facebookresearch / open-eqa
View on GitHub
OpenEQA Embodied Question Answering in the Era of Foundation Models
☆366Sep 20, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
IT3DEgo / IT3DEgo
View on GitHub
CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"
☆19Jun 27, 2024Updated 2 years ago
ZzZZCHS / Chat-Scene
View on GitHub
[NeurIPS 2024 & TPAMI 2026] Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers
☆216Apr 12, 2026Updated 3 months ago
KarlesZheng / FERMT
View on GitHub
☆13Jul 15, 2024Updated 2 years ago
Aaronhuang-778 / Mixture-Compressor-MoE
View on GitHub
[ICLR 2025, IEEE TPAMI 2026] Mixture Compressor & MC#
☆75Feb 12, 2025Updated last year
MIV-XJTU / SPEED
View on GitHub
PyTorch implementation of paper "Sparse Parameterization for Epitomic Dataset Distillation" in NeurIPS 2023.
☆20Jun 28, 2024Updated 2 years ago
CVMI-Lab / DODA
View on GitHub
(ECCV 2022) DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation
☆52Oct 20, 2022Updated 3 years ago
CVMI-Lab / SyntheticData
View on GitHub
Is synthetic data from generative models ready for image recognition?
☆187Feb 16, 2023Updated 3 years ago
JiwanChung / vlis
View on GitHub
☆24Oct 9, 2023Updated 2 years ago
StanfordMIMI / villa
View on GitHub
[ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data
☆45Oct 15, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Should-AI-Lab / GRID
View on GitHub
The official implementation of 'GRID: Visual Layout Generation.'
☆21Dec 28, 2024Updated last year
Georgelingzj / up-to-date-Vision-Language-Models
View on GitHub
Up-to-date Vision Language Models collection. Mainly focus on computer vision
☆20Feb 9, 2023Updated 3 years ago
leolee99 / CLIP_ITM
View on GitHub
A simple pytorch implementation of baseline based-on CLIP for Image-text Matching.
☆19May 25, 2023Updated 3 years ago
DTennant / distill_visual_priors
View on GitHub
2nd place solution of ECCV 2020 workshop VIPriors Image Classification Challenge, https://arxiv.org/abs/2008.00261
☆13Aug 22, 2021Updated 4 years ago
Yingdong-Hu / PVM-Robotics
View on GitHub
The repository for a thorough empirical evaluation of pre-trained vision model performance across different downstream policy learning me…
☆24Aug 19, 2023Updated 2 years ago
iancovert / locality-alignment
View on GitHub
☆55Jan 17, 2025Updated last year
CVMI-Lab / Hybrid-Occ-SDF
View on GitHub
This is the officially implementation of ICCV 2023 paper " Learning A Room with the Occ-SDF Hybrid: Signed Distance Function Mingled with…
☆11Dec 7, 2023Updated 2 years ago