The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering
☆25Apr 18, 2025Updated 10 months ago
Alternatives and similar repositories for DSPNet
Users that are interested in DSPNet are comparing it to the libraries listed below
Sorting:
- VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis☆13Dec 26, 2024Updated last year
- Official implementation of paper "GAPrompt: Geometry-Aware Point Cloud Prompt for 3D Vision Model", ICML 2025☆15Dec 25, 2025Updated 2 months ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆43Apr 27, 2025Updated 10 months ago
- Transferable Feature Representation for Visible-to-Infrared Cross-Dataset Human Action Recognition (Complexity 2018)☆13Dec 14, 2022Updated 3 years ago
- Deep Reinforcement Learning in CARLA simulator☆16Mar 10, 2024Updated last year
- [IEEE T-PAMI 2023] Cross-Modal Causal Relational Reasoning for Event-Level Visual Question Answering☆20Jul 6, 2023Updated 2 years ago
- [ICLR 2025] Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention☆29Feb 21, 2025Updated last year
- [IEEE T-IP 2021] Semantics-aware Adaptive Knowledge Distillation for Cross-modal Action Recognition☆29Jan 6, 2025Updated last year
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Dec 19, 2023Updated 2 years ago
- [ECCV 2024] FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation☆28May 14, 2025Updated 9 months ago
- This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).☆29Jun 15, 2023Updated 2 years ago
- [ICLR 2025 Oral] NeuralPlane: Structured 3D Reconstruction in Planar Primitives with Neural Fields☆57Jan 3, 2026Updated 2 months ago
- Official implementation of "g3D-LF: Generalizable 3D-Language Feature Fields for Embodied Tasks" (CVPR'25).☆45Jul 14, 2025Updated 7 months ago
- Embodied Question Answering (EQA) benchmark and method (ICCV 2025)☆47Aug 12, 2025Updated 6 months ago
- Official implementation of neural vector fields☆38Aug 27, 2023Updated 2 years ago
- ☆14Jul 11, 2024Updated last year
- ☆19Oct 27, 2025Updated 4 months ago
- Official Repository of "Transcrib3D: 3D Referring Expression Resolution through Large Language Models" accepted at IROS 2024☆12Mar 7, 2025Updated 11 months ago
- ☆14Nov 23, 2024Updated last year
- ☆20Oct 15, 2025Updated 4 months ago
- ☆17Nov 14, 2025Updated 3 months ago
- ☆14Nov 25, 2024Updated last year
- Official implementation of paper "Vision Graph Prompting via Semantic Low-Rank Decomposition", ICML 2025☆16Dec 25, 2025Updated 2 months ago
- [ECCV 2024 Oral] The official implementation of paper: COHO: Context-Sensitive City-Scale Hierarchical Urban Layout Generation☆11Aug 13, 2024Updated last year
- [ICLR 2022] Official implementation of "Unrolling PALM for Sparse Semi-Blind Source Separation"☆11Apr 9, 2022Updated 3 years ago
- ☆12Oct 10, 2024Updated last year
- ☆24Oct 31, 2025Updated 4 months ago
- A detail Implementation of handling long-term memory in Agentic AI☆36Oct 9, 2025Updated 4 months ago
- [CVPR 2025] GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector☆16Mar 19, 2025Updated 11 months ago
- [NeurIPS 2025] EOC-Bench, an innovative benchmark designed to systematically evaluate object-centric embodied cognition in dynamic egocen…☆22Jun 17, 2025Updated 8 months ago
- Reward Evolution with Large Language Models using Human Feedback☆18Nov 14, 2025Updated 3 months ago
- Official Repo of "CIBench: Evaluation of LLMs as Code Interpreter "☆14Jul 19, 2024Updated last year
- A Reinforcement Learning package for 6th semester project☆12Jun 26, 2018Updated 7 years ago
- Official implementation of the paper "Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model"☆44Mar 4, 2025Updated 11 months ago
- PlaneRecTR: Unified Query Learning for 3D Plane Recovery from a Single View☆47Sep 11, 2024Updated last year
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Jan 15, 2026Updated last month
- ☆14Jul 23, 2024Updated last year
- ☆33May 29, 2025Updated 9 months ago
- Official Repository for 'Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences' (CVPR 2024)☆16Mar 29, 2024Updated last year