MIV-XJTU / FSDriveLinks
Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
☆158Updated 2 weeks ago
Alternatives and similar repositories for FSDrive
Users that are interested in FSDrive are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation☆116Updated this week
- GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models☆367Updated last week
- [ICCV 2025] Official implementation of the paper “MagicDrive-V2: High-Resolution Long Video Generation for Autonomous Driving with Adapti…☆602Updated 3 weeks ago
- 🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.☆88Updated last month
- Official implementation of the paper “MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes”☆289Updated last year
- [CVPR'25] Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception☆16Updated 2 months ago
- [ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" i…☆155Updated 2 months ago
- ☆184Updated 3 months ago
- ☆112Updated last month
- VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model☆344Updated 3 months ago
- Official Repository of paper OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference☆145Updated 4 months ago
- [ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”☆1,080Updated 2 months ago
- Awesome Data-Driven Autonomous Driving Solutions. Also the official repository of our survey paper: Data-Centric Evolution in Autonomous …☆174Updated last year
- [ICCV 2025] MM-IFEngine: Towards Multimodal Instruction Following☆97Updated 2 months ago
- StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding☆134Updated 2 months ago
- [AAAI 2025] The code repository for "MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning" in PyTorch.☆65Updated 3 months ago
- Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning☆82Updated last month
- CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation (ICML2025)☆131Updated last month
- Teaching LMMs for Image Quality Scoring and Interpreting☆94Updated 3 months ago
- [CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis☆173Updated 9 months ago
- [ACMMM 2025] "Set You Straight: Auto-Steering Denoising Trajectories to Sidestep Unwanted Concepts" (Official Implementation)☆46Updated last week
- Official Pytorch Code of the Paper "UniVST: A Unified Framework for Training-free Localized Video Style Transfer"☆69Updated 3 weeks ago
- Official code for "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"☆174Updated last week
- High Density Intersection Dataset (HDI)☆234Updated last month
- This repository is the official implementation of "DTL: Disentangled Transfer Learning for Visual Recognition", which is accepted by AAAI…☆121Updated last week
- Official code for "Direct Retrieval-augmented Optimization: Synergizing Knowledge Selection and Language Models"☆245Updated 2 months ago
- Official Pytorch Code of the Paper "LightMotion: A Light and Tuning-free Method for Simulating Camera Motion in Video Generation"☆38Updated last month
- [DASFAA'25] LLM4GraphTopology: Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs☆63Updated last month
- jyf-drawing-board是一个背景透明的Web画板项目,使用HTML5 的<canvas>元素来实现绘图功能。☆22Updated 5 months ago
- 🌳 An educational modern C++ deep learning framework supporting CUDA☆43Updated last month