☆18Apr 10, 2025Updated last year
Alternatives and similar repositories for NuScenes-SpatialQA
Users that are interested in NuScenes-SpatialQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TIP2024] MWFormer: Multi-Weather Image Restoration Using Degradation-Aware Transformers☆80Dec 6, 2024Updated last year
- Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing☆59Dec 17, 2024Updated last year
- [EMNLP'25] A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.☆51Aug 21, 2025Updated 9 months ago
- [TMLR'25] AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public saf…☆55Nov 20, 2025Updated 6 months ago
- ☆17Nov 27, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception☆63Feb 4, 2025Updated last year
- 🏆 [CVPRW 2024] COVER: A Comprehensive Video Quality Evaluator. 🥇 Winner solution for Video Quality Assessment Challenge at the 1st AIS…☆99Jul 18, 2024Updated last year
- ☆13Mar 28, 2025Updated last year
- PISCO: Precise Video Instance Insertion with Sparse Control☆62Feb 13, 2026Updated 4 months ago
- [ICML2025] Official Code for IAL (Multi-modal 3D Panoptic Segmentation Model)☆30Oct 2, 2025Updated 8 months ago
- a comprehensive and critical synthesis of the emerging role of GenAI across the full autonomous driving stack☆236Sep 20, 2025Updated 8 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆33Dec 22, 2025Updated 5 months ago
- ☆230May 26, 2026Updated 2 weeks ago
- [RAL 2023] NSLF-OL: Online Learning of Neural Surface Light Fields alongside Real-time Incremental 3D Reconstruction☆22May 5, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ECCV2024] Weakly Supervised 3D Object Detection via Multi-Level Visual Guidance☆23Jul 14, 2024Updated last year
- ☆12Apr 1, 2025Updated last year
- Our inference and training framework to run on the Cosmos Models☆195Updated this week
- https://github.com/PRBonn/kiss-icp☆11Dec 6, 2022Updated 3 years ago
- [TIV 2024] PyTorch implementation of FlowLens (https://arxiv.org/pdf/2211.11293)☆35Mar 25, 2024Updated 2 years ago
- A modern, responsive academic personal website.☆23Apr 5, 2025Updated last year
- [ICCVW2025] V-RoAst: Visual Road Assessment. Can VLM be a Road Safety Assessor Using the iRAP Standard?☆13Dec 17, 2025Updated 5 months ago
- OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.☆938May 13, 2025Updated last year
- [ICLR 2026] The official implementation of "Dichotomous Diffusion Policy Optimization"☆43May 2, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆40Feb 11, 2026Updated 4 months ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Entropy-Driven GRPO with Guided Error Correction for Advantage Diversity☆22Aug 28, 2025Updated 9 months ago
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Code and Data for Real-time Human-Centric Segmentation for Complex Video Scenes☆17Feb 8, 2024Updated 2 years ago
- [3DV'21] CAMPARI: Camera-Aware Decomposed Generative Neural Radiance Fields☆30Feb 1, 2022Updated 4 years ago
- [CVPR2025] We present SleeperMark, a novel framework designed to embed resilient watermarks into T2I diffusion models☆39May 26, 2025Updated last year
- Bird's Eye View Calibration Toolkit☆19Jun 21, 2025Updated 11 months ago
- Reinforcing Action Policies by Prophesying☆41Nov 26, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆64Apr 12, 2018Updated 8 years ago
- ☆11Apr 7, 2026Updated 2 months ago
- [ICCV 2025] GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding☆77Jun 26, 2025Updated 11 months ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆13Updated this week
- This repository contains all the code and data used in our article titled “Estimating international trade status of countries from global…☆10Jul 6, 2023Updated 2 years ago
- Differentiable Point Radiance Fields Rasteriser for Novel View Synthesis☆36Jun 4, 2023Updated 3 years ago
- UniRL is a Framework for Unified Multimodal Model Reinforcement Learning☆522Updated this week