LZ-CH/DSPNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LZ-CH/DSPNet)

LZ-CH / DSPNet

The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering

☆28

Alternatives and similar repositories for DSPNet

Users that are interested in DSPNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HCPLab-SYSU / TAVP
View on GitHub
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation (CVPR-26)
☆25May 19, 2026Updated 2 months ago
YangLiu9208 / VisionGRU
View on GitHub
VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis
☆13Dec 26, 2024Updated last year
HCPLab-SYSU / DDP-WM
View on GitHub
DDP-WM: Disentangled Dynamics Prediction for Efficient World Models (ICML-26)
☆19Mar 4, 2026Updated 4 months ago
HCPLab-SYSU / DART
View on GitHub
DART: Differentiable Adaptive Region Tokenizer for Vision Foundation Models
☆21Oct 13, 2025Updated 9 months ago
GillianZhu / HORLN
View on GitHub
☆14Oct 23, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
zhoujiahuan1991 / ICML2025-GAPrompt
View on GitHub
Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025
☆17Dec 25, 2025Updated 6 months ago
YangLiu9208 / JSRDA
View on GitHub
[IEEE T-CSVT 2019] Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition
☆14Nov 26, 2019Updated 6 years ago
HCPLab-SYSU / 3DAffordSplat
View on GitHub
3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians (ACM MM 25)
☆79Jul 21, 2025Updated last year
pzhren / InfiniteWorld
View on GitHub
☆86Jun 16, 2026Updated last month
fereenwong / cdViews
View on GitHub
official code for "3D Question Answering via only 2D Vision-Language Models"
☆24Mar 4, 2026Updated 4 months ago
YangLiu9208 / SAKDN
View on GitHub
[IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition
☆29Jan 6, 2025Updated last year
VinceOuti / Open3DVQA
View on GitHub
☆31Nov 18, 2025Updated 8 months ago
beacon-3d / Beacon3D
View on GitHub
[CVPR 2025] Beacon3D: Object-centric Evaluation for 3D Grounding-QA
☆28Nov 25, 2025Updated 7 months ago
Oleki-xxh / FPB-FOMM
View on GitHub
[IEEE TIP 2024] Facial Prior Guided Micro-Expression Generation
☆13Nov 8, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
WissingChen / CMCRL
View on GitHub
The official implementation of “Cross-Modal Causal Representation Learning for Radiology Report Generation” （IEEE T-IP 2025）
☆68May 27, 2025Updated last year
leolyj / 3D-VLP
View on GitHub
This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).
☆29Jun 15, 2023Updated 3 years ago
rohanNkhaire / RL_SB3_carla
View on GitHub
Deep Reinforcement Learning in CARLA simulator
☆16Mar 10, 2024Updated 2 years ago
zihaosheng / ExploreVLA
View on GitHub
[ECCV 2026] ExploreVLA: Dense World Modeling and Exploration for End-to-End Autonomous Driving
☆18Jun 20, 2026Updated last month
songw-zju / PointLoRA
View on GitHub
The official implementation of "PointLoRA: Low-Rank Adaptation with Token Selection for Point Cloud Learning" (CVPR 2025)
☆29Oct 31, 2025Updated 8 months ago
edurnebernal / SST-Sal
View on GitHub
SST-Sal: A spherical spatio-temporal approach for saliency prediction in 360º videos
☆15Aug 31, 2023Updated 2 years ago
WeitaiKang / Intent3D
View on GitHub
[ICLR 2025] Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention
☆29Feb 21, 2025Updated last year
Richard-Zhang-AI / APDM
View on GitHub
☆17Feb 24, 2025Updated last year
yuanzhoulvpi2017 / yuanzhoulvpi2017
View on GitHub
personal info
☆11Mar 23, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gchhablani / embodied-splat-v1
View on GitHub
Code for the paper - EmbodiedSplat: Personalized Real-to-Sim-to-Real Navigation with Gaussian Splats from a Mobile Device (ICCV, 2025).
☆28Oct 21, 2025Updated 9 months ago
LZ-CH / GAIIC2023
View on GitHub
GAIIC赛道一：影像学 NLP — 医学影像诊断报告生成 [A100换你大棚甜瓜 Rank-12 方案]
☆68Jun 9, 2023Updated 3 years ago
HCPLab-SYSU / EXPRESS-Bench
View on GitHub
Embodied Question Answering (EQA) benchmark and method (ICCV 2025)
☆60Aug 12, 2025Updated 11 months ago
Necolizer / ISTA-Net
View on GitHub
[IROS 2023] Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition
☆21Jul 12, 2025Updated last year
yl3800 / LASO
View on GitHub
☆48Aug 8, 2024Updated last year
jiaming-zhou / Zero-WAM
View on GitHub
Zero-WAM, an in-context world model for zero-shot robotic task generalization
☆33Jul 8, 2026Updated 2 weeks ago
FannyChao / SalGAN360
View on GitHub
Saliency prediction on 360° image with SalGAN
☆16Jan 5, 2021Updated 5 years ago
wudongming97 / AffordanceNet
View on GitHub
[ICCV 2025] RAGNet: Large-scale Reasoning-based Affordance Segmentation Benchmark towards General Grasping
☆50Nov 21, 2025Updated 8 months ago
Jiang-HB / FUSER
View on GitHub
[CVPR'2026, Oral] FUSER: Feed-Forward MUltiview 3D Registration Transformer and SE(3)^N Diffusion Refinement
☆37Jun 4, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dfki-av / MiKASA-3DVG
View on GitHub
[CVPR'24] MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding
☆18Dec 13, 2024Updated last year
YoujunZhao / OpenScan
View on GitHub
OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding
☆23Jul 6, 2026Updated 2 weeks ago
Daisy-1227 / FRI-Net
View on GitHub
[ECCV 2024] FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation
☆35May 14, 2025Updated last year
ajhamdi / vointcloud
View on GitHub
Voint Cloud: Multi-View Point Cloud Representation for 3D Understanding (ICLR 2023)
☆22May 2, 2023Updated 3 years ago
xmed-lab / TP-Mamba
View on GitHub
MICCAI 2024: Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images
☆28Apr 3, 2025Updated last year
Wi-sc / NVF
View on GitHub
Official implementation of neural vector fields
☆38Aug 27, 2023Updated 2 years ago
JLUtangchuan / Parts2Words
View on GitHub
This is the source code of Part2Word: Learning Joint Embedding of Point Clouds and Text by Bidirectional Matching between Parts and Words
☆16Mar 22, 2023Updated 3 years ago