lslrh/DMA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lslrh/DMA)

lslrh / DMA

Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024

☆32

Alternatives and similar repositories for DMA

Users that are interested in DMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Wang-pengfei / GGSD
View on GitHub
Official PyTorch codes for "Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation", ECCV2024
☆31Jul 19, 2024Updated 2 years ago
ZhuWenjie98 / DDE
View on GitHub
(ECCV2026) Dual Distribution Estimation for Zero-shot Noisy Test-Time Adaptation with VLMs
☆15Jul 2, 2026Updated 2 weeks ago
eslambakr / CoT3D_VG
View on GitHub
Chain_of_Thoughts_3D_Visual_Grounding
☆21Apr 20, 2024Updated 2 years ago
lslrh / SyncNoise
View on GitHub
SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing
☆19Dec 28, 2024Updated last year
theEricMa / ScaleDreamer
View on GitHub
[ECCV2024] ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation
☆53Mar 28, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PolyU-VCLab / DepthMaster
View on GitHub
DepthMaster: Unified Monocular Depth Estimation for Perspective and Panoramic Images
☆25Jun 13, 2026Updated last month
wangzy22 / XMask3D
View on GitHub
[NeurIPS 2024] XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation
☆37Jan 20, 2025Updated last year
GradiusTwinbee / GLIS
View on GitHub
officical code for ECCV 2024 paper "Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection"
☆14Jul 4, 2024Updated 2 years ago
skyhehe123 / ScatterFormer
View on GitHub
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention (ECCV 2024)
☆80May 20, 2025Updated last year
ChrisDud0257 / SSL
View on GitHub
Official code for our Paper "SSL: A Self-similarity Loss for Improving Generative Image Super-resolution" in ACMMM 2024
☆51Jun 6, 2026Updated last month
gwenzhang / GGA
View on GitHub
[ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…
☆36Jul 26, 2024Updated last year
Eaphan / OLIVINE
View on GitHub
Fine-grained Image-to-LiDAR Contrastive Distillation with Visual Foundation Models (NeurIPS2024)
☆44Nov 22, 2024Updated last year
peoplelu / SAS
View on GitHub
[ICCV 2025] SAS: Segment Any 3D Scene with Integrated 2D Priors
☆38Jun 25, 2025Updated last year
iGuoYanjun / Memorize-When-Needed
View on GitHub
☆23Jun 29, 2026Updated 3 weeks ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
mt-cly / FPR
View on GitHub
FPR: False Positive Rectification for Weakly Supervised Semantic Segmentation (ICCV 2023)
☆24Sep 24, 2023Updated 2 years ago
chchnii / GaussianSR
View on GitHub
Project page of "GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors"
☆23Jul 1, 2024Updated 2 years ago
ZCMax / ScanReason
View on GitHub
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
☆85Oct 10, 2024Updated last year
mt-cly / SimCMF
View on GitHub
SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality
☆34Nov 25, 2024Updated last year
gwenzhang / Voxel-Mamba
View on GitHub
[NeurIPS24 Spotlight] Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection
☆163Sep 26, 2024Updated last year
PQ3D / PQ3D
View on GitHub
Official implementation of the paper "Unifying 3D Vision-Language Understanding via Promptable Queries"
☆85Aug 2, 2024Updated last year
OpenM3D / M3DBench
View on GitHub
[ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.
☆61Oct 1, 2024Updated last year
skyhehe123 / spconv
View on GitHub
☆12Jul 18, 2024Updated 2 years ago
lyhdet / OV-3DET
View on GitHub
☆99Mar 25, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
xtudbxk / FreCaS
View on GitHub
The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"
☆32Jul 7, 2025Updated last year
gwenzhang / BEVDilation
View on GitHub
[AAAI'26] BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection
☆42Dec 3, 2025Updated 7 months ago
sg-3d / sg3d
View on GitHub
☆55Oct 3, 2024Updated last year
leeruibin / hybrid-forcing
View on GitHub
☆32Apr 29, 2026Updated 2 months ago
Wang-pengfei / One2Scene
View on GitHub
[ICLR 2026] - One2Scene
☆48May 25, 2026Updated last month
Liangsanzhu / Photo3D
View on GitHub
Photo3D: Advancing Photorealistic 3D Generation through Structure‑Aligned Detail Enhancement
☆22Mar 18, 2026Updated 4 months ago
CVMI-Lab / PLA
View on GitHub
(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learn…
☆301Jun 28, 2024Updated 2 years ago
VinAIResearch / Open3DIS
View on GitHub
Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance (CVPR 2024)
☆135Nov 12, 2024Updated last year
YunzeMan / Lexicon3D
View on GitHub
[NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
☆102Feb 2, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
dk-liang / UniSeg3D
View on GitHub
[NeurIPS 2024] A Unified Framework for 3D Scene Understanding
☆179Jul 7, 2025Updated last year
ayushjain1144 / odin
View on GitHub
Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)
☆177Feb 27, 2026Updated 4 months ago
YBZh / LAPT
View on GitHub
ECCV2024, LAPT: Label-driven Automated Prompt Tuning for OOD Detection with Vision-Language Models
☆18Aug 9, 2024Updated last year
Chat-3D / Chat-3D
View on GitHub
Code for "Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes"
☆57Mar 28, 2024Updated 2 years ago
hanxunyu / Inst3D-LMM
View on GitHub
[CVPR 2025 Highlight] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuning…
☆131Jan 30, 2026Updated 5 months ago
mt-cly / ViP3DEdit
View on GitHub
[AAAI26] ViP3DE: Fast Multi-view Consistent 3D Editing with Video Priors
☆22Mar 5, 2026Updated 4 months ago
csslc / Self-Transcendence
View on GitHub
[ECCV 2026] Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Trans…
☆36Jul 3, 2026Updated 2 weeks ago