OrangeSodahub / InfGenLinks
[ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.
☆39Updated 3 weeks ago
Alternatives and similar repositories for InfGen
Users that are interested in InfGen are comparing it to the libraries listed below
Sorting:
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆59Updated last week
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆61Updated 7 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆66Updated 10 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆74Updated 9 months ago
- [ICCV'25] Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness☆55Updated 2 months ago
- OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆59Updated last month
- The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering☆23Updated 5 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆189Updated 4 months ago
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆113Updated last month
- ☆82Updated this week
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆66Updated this week
- ☆17Updated 3 months ago
- [NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving…☆208Updated this week
- [NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency☆36Updated this week
- ☆54Updated 5 months ago
- ☆16Updated last month
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆348Updated 3 months ago
- (NeurIPS2025) DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆181Updated this week
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆129Updated 6 months ago
- ☆15Updated last year
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆72Updated 6 months ago
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆52Updated 3 months ago
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆52Updated last month
- The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)☆79Updated last month
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 9 months ago
- [ECCV 2024] Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression☆47Updated last year
- Benchmark and model for step-by-step reasoning in autonomous driving.☆66Updated 6 months ago
- the official code of DriveMonkey☆32Updated 3 months ago
- 🦾 A Dual-System VLA with System2 Thinking☆110Updated last month
- Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)☆190Updated 2 months ago