OrangeSodahub / InfGenLinks
[ICCV 2025] Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation.
☆25Updated this week
Alternatives and similar repositories for InfGen
Users that are interested in InfGen are comparing it to the libraries listed below
Sorting:
- Official implementation of LangCoop: Collaborative Driving with Natural Language☆29Updated this week
- 🦾 A Dual-System VLA with System2 Thinking☆38Updated this week
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆22Updated last month
- [ICLR 2025] The offical implementation of "PSEC: Skill Expansion and Composition in Parameter Space", a new framework designed to facilit…☆28Updated 4 months ago
- ☆22Updated last week
- WorldVLA: Towards Autoregressive Action World Model☆94Updated this week
- [CVPR 2025] Noise-Consistent Siamese-Diffusion for Medical Image Synthesis and Segmentation☆39Updated this week
- [CVPR 2025 Highlight🔥] Official code repository for "Inst3D-LMM: Instance-Aware 3D Scene Understanding with Multi-modal Instruction Tuni…☆90Updated last month
- Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆65Updated this week
- Unified Vision-Language-Action Model☆61Updated this week
- [PVLDB 2025] TAB: Unified Benchmarking of Time Series Anomaly Detection Methods☆21Updated this week
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆133Updated last month
- Embodied Intelligence in Endovascular Robot Navigation -- 血管介入手术机器人具身导航☆14Updated last month
- The official repository of [CVPR2025] DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering☆17Updated 2 months ago
- ☆27Updated last month
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆43Updated this week
- 📖 This is a repository for organizing papers, codes, and other resources related to personalized video generation and editing.☆37Updated this week
- Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"☆29Updated 3 weeks ago
- [arXiv 2025] Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps☆38Updated last month
- ☆37Updated 2 weeks ago
- 🚀 Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models☆23Updated 2 weeks ago
- (ArXiv25) Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning☆37Updated 2 weeks ago
- Official implementation of T-PAMI25 paper "M²Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes"☆54Updated last week
- ☆48Updated 4 months ago
- ☆49Updated 8 months ago
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆14Updated this week
- Codes of Paper "Learning 2D Invariant Affordance Knowledge for 3D Affordance Grounding"☆18Updated 9 months ago
- (ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.☆12Updated 8 months ago
- The official implementation of The paper "Exploring the Potential of Encoder-free Architectures in 3D LMMs"☆53Updated last month
- SpaceR: The first MLLM empowered by SG-RLVR for video spatial reasoning☆63Updated 2 weeks ago