OrangeSodahub / InfGenLinks
[ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.
☆30Updated 3 weeks ago
Alternatives and similar repositories for InfGen
Users that are interested in InfGen are comparing it to the libraries listed below
Sorting:
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆54Updated 3 weeks ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆157Updated 2 months ago
- DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆71Updated this week
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆59Updated 5 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆62Updated 8 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆44Updated 3 weeks ago
- The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering☆20Updated 3 months ago
- OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding☆54Updated this week
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆93Updated 2 months ago
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆63Updated last month
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆290Updated 3 weeks ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆70Updated 7 months ago
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆61Updated 2 months ago
- ☆13Updated last year
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆25Updated last month
- ☆24Updated 6 months ago
- ☆65Updated this week
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆126Updated 4 months ago
- ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving☆111Updated last month
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆138Updated last month
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 7 months ago
- Code for CVPR2025 paper: Generating Multimodal Driving Scenes via Next-Scene Prediction☆70Updated 4 months ago
- ☆49Updated 9 months ago
- A paper list for spatial reasoning☆121Updated last month
- 🦾 A Dual-System VLA with System2 Thinking☆71Updated this week
- Unified Vision-Language-Action Model☆128Updated 2 weeks ago
- Official Code for Epona: Autoregressive Diffusion World Model for Autonomous Driving (ICCV 2025)☆76Updated this week
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆69Updated last week
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆39Updated 7 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆49Updated 3 weeks ago