ynw2021 / FENDLinks
☆19Updated 7 months ago
Alternatives and similar repositories for FEND
Users that are interested in FEND are comparing it to the libraries listed below
Sorting:
- A naturalistic trajectory dataset with dense driving interactions and the toolbox for driving interaction extraction.☆138Updated 4 months ago
- ☆94Updated 5 months ago
- Implementation for "Challenger: Affordable Adversarial Driving Video Generation"☆133Updated 3 months ago
- This is the source code for the ECCV paper "MTFormer: Multi-Task Learning via Transformer and Cross-Task Reasoning"☆200Updated 3 years ago
- [EMNLP2025]Official implementation: Agent-style vision question answer in Autonomous Driving!☆133Updated 2 months ago
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆252Updated last week
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆226Updated this week
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆259Updated last month
- Mem4Nav: Boosting Vision-and-Language Navigation in Urban Environments with a Hierarchical Spatial-Cognition Long-Short Memory System☆101Updated 4 months ago
- [CoRL2024] Let Occ Flow: Self-Supervised 3D Occupancy Flow Prediction☆126Updated 2 months ago
- Official implementation for "HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human …☆368Updated last month
- This repository contains the source code for our paper: "PrefMMT: Modeling Human Preferences in Preference-based Reinforcement Learning w…☆50Updated 9 months ago
- [CVPR 2025 Highlight] Official Implementation of SURGEON: Memory-Adaptive Fully Test-Time Adaptation via Dynamic Activation Sparsity☆113Updated 6 months ago
- See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model☆42Updated this week
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆134Updated this week
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆450Updated this week
- This is the official implementation of UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving☆187Updated 3 months ago
- ☆67Updated 4 months ago
- Official Pytorch implementation for ICML 2025 paper "Large Continual Instruction Assistant"☆65Updated 4 months ago
- A survey and paper list of current Spatio-Temporal Foundation Models from the pipeline perspective with awesome resources (paper, code, s…☆152Updated 6 months ago
- DPO-Shift: Shifting the Distribution of Direct Preference Optimization☆60Updated 9 months ago
- ☆26Updated 3 years ago
- [NeurIPS 2025] NAUTILUS: A Large Multimodal Model for Underwater Scene Understanding☆341Updated last month
- (ICCV-2025 Official Code)) Improving Generalist Model with Domain-Specific Experts☆86Updated last month
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆214Updated last month
- ☆188Updated 5 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆240Updated last week
- [VLDB 2025] SimRN: Trajectory Similarity Learning in Road Networks based on Distributed Deep Reinforcement Learning☆105Updated 7 months ago
- [MM 2024] Official code for VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness☆52Updated last year
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆88Updated last month