[AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Understanding
☆13Dec 8, 2024Updated last year
Alternatives and similar repositories for 3DVLP
Users that are interested in 3DVLP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Aug 5, 2024Updated last year
- [ESWA 2025] Official pytorch implementation of "What and When to look?: Temporal Span Proposal Network for Video Relation Detection"☆16Aug 9, 2021Updated 4 years ago
- [CVPR'24] MiKASA: Multi-Key-Anchor & Scene-Aware Transformer for 3D Visual Grounding☆18Dec 13, 2024Updated last year
- [NeurIPS2024] BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping☆18Feb 28, 2026Updated 3 weeks ago
- Chain_of_Thoughts_3D_Visual_Grounding☆19Apr 20, 2024Updated last year
- [ICLR2023] Video Scene Graph Generation from Single-Frame Weak Supervision☆12Sep 17, 2023Updated 2 years ago
- GraspFast: Multi-stage Lightweight 6-DoF Grasp Pose Detection with RGB-D Image☆24Jun 20, 2025Updated 9 months ago
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 11 months ago
- ☆15Dec 25, 2025Updated 2 months ago
- MixMatch Domain Adaptation: Prize-winning solution for both tracks of VisDA 2019 challenge☆23Mar 24, 2023Updated 2 years ago
- The implementation of "A Simple Baseline for Weakly-Supervised Scene Graph Generation" for ICCV2021☆15Aug 17, 2021Updated 4 years ago
- [ICME 2024 Oral] DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding☆23Feb 26, 2025Updated last year
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆40Dec 5, 2022Updated 3 years ago
- [ICCVW2025] V-RoAst: A New Dataset for Visual Road Assessment☆11Dec 17, 2025Updated 3 months ago
- A toolkit for hybrid log parsing☆18Aug 23, 2023Updated 2 years ago
- 可扩展的番剧订阅平台,轻松实现一键追番!☆34Feb 12, 2026Updated last month
- ☆22Sep 13, 2021Updated 4 years ago
- Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors☆37Aug 7, 2025Updated 7 months ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Code for recreating the HoS benchmark of VISOR☆23Jul 2, 2023Updated 2 years ago
- ☆16Jun 4, 2023Updated 2 years ago
- ☆36Feb 11, 2026Updated last month
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 6 months ago
- ☆20Mar 17, 2026Updated last week
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- [ECCV'24] A novel weakly supervised framework for 3D object detection from 2D bounding boxes. It can easily extend to novel scenarios and…☆36Jul 26, 2024Updated last year
- Official implementation of BGNN(CVPR 2021)☆20Jul 12, 2021Updated 4 years ago
- Bird's Eye View Calibration Toolkit☆17Jun 21, 2025Updated 9 months ago
- ☆34Feb 12, 2026Updated last month
- My profile☆20May 14, 2024Updated last year
- This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…☆10Jul 6, 2023Updated 2 years ago
- Rank9 IJCAI-18 阿里妈妈搜索广告转化预测 第一赛季☆10Aug 22, 2018Updated 7 years ago
- InternDataEngine: Pioneering High-Fidelity Synthetic Data Generator for Robotic Manipulation☆74Updated this week
- ☆16Mar 26, 2025Updated 11 months ago
- 微信AI内容创作智能体,可自动完成信息爬取、内容整理、排版及草稿推送。涵盖Kaggle竞赛、HuggingFace论文以及ProductHunt产品资讯。☆16Aug 3, 2025Updated 7 months ago
- ☆36Feb 8, 2026Updated last month
- One framework to evaluate any VLA model on any robot simulation benchmark.☆102Updated this week
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆17Apr 3, 2024Updated last year
- ☆18Mar 6, 2026Updated 2 weeks ago