SenseTime-FVG / UniMLVG
☆9Updated 2 months ago
Alternatives and similar repositories for UniMLVG:
Users that are interested in UniMLVG are comparing it to the libraries listed below
- ☆14Updated 3 months ago
- ☆46Updated 4 months ago
- ☆19Updated last week
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆16Updated last week
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆79Updated 3 months ago
- [ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving☆23Updated 4 months ago
- [CVPR 2025] DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation☆42Updated last month
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆37Updated 3 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆67Updated 5 months ago
- A collection of vision foundation models unifying understanding and generation.☆48Updated 3 months ago
- ☆10Updated 11 months ago
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆10Updated 9 months ago
- A dual-branch conditional diffusion model designed to enhance driving scene generation across multiple views and video sequences.☆16Updated last week
- [CVPR 2025] ReconDreamer☆121Updated 3 months ago
- Implementation of the paper: VG4D: Vision-Language Model Goes 4D Video Recognition(ICRA 2024)☆14Updated 11 months ago
- This is the official implementation of UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving☆19Updated this week
- [ICLR'25] Official Implementation of STAMP: Scalable Task And Model-agnostic Collaborative Perception☆24Updated last month
- ☆44Updated 2 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆65Updated 3 months ago
- ☆18Updated this week
- AutoTrust, a groundbreaking benchmark designed to assess the trustworthiness of DriveVLMs. This work aims to enhance public safety by ens…☆43Updated 3 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆113Updated last month
- Official Github Repo for GEM☆32Updated 3 months ago
- The offical implemention of JM3D.☆29Updated last year
- ☆12Updated 2 weeks ago
- ☆28Updated 7 months ago
- [AAAI 2025]MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation☆19Updated last week
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆34Updated this week
- BIP3D: Bridging 2D Images and 3D Perception for Embodied Intelligence☆38Updated last week
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆55Updated 5 months ago