ydk122024 / AIDELinks
[ICCV2023] AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception
☆44Updated last year
Alternatives and similar repositories for AIDE
Users that are interested in AIDE are comparing it to the libraries listed below
Sorting:
- ☆43Updated 3 months ago
- The official Talk2Car dataset repo☆87Updated 3 weeks ago
- ☆12Updated 5 months ago
- ☆14Updated 2 years ago
- A curated list of peer-reviewed papers on theoretical and practical aspects of drivers' attention used for paper "Attention for Vision-Ba…☆127Updated 3 months ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆65Updated 9 months ago
- Driver Attention Prediction in Accidental Scenarios☆113Updated 9 months ago
- Tracking Multiple Deformable Objects in Egocentric Videos (CVPR 2023)☆11Updated 2 years ago
- [CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events☆56Updated last year
- [ACM MM 2020] Uncertainty-based Traffic Accident Anticipation☆77Updated 2 years ago
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆30Updated last year
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Updated last year
- [CVPR 2024] Action-slot: Visual Action-centric Representations for Atomic Activity Recognition in Traffic Scenes☆21Updated 4 months ago
- Risky Object Localization (ROL) in a Driving Scene Dataset☆14Updated last year
- This is the implementation code for the paper, "A Dynamic Spatial-temporal Attention Network for Early Anticipation of Traffic Accidents"☆34Updated 2 years ago
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆52Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆44Updated last year
- This is the implementation code for the paper, "An Attention-guided Multistream Feature Fusion Network for Early Localization of Risky Tr…☆22Updated last year
- This repo proves that sythtic dataset along with real world dataset can boost the performance of models for Pedestrian Intention Predicti…☆11Updated 6 months ago
- This repo contains code and models for the implementation of ViT-DD, a semi-supervised method for detecting driver distractions.☆30Updated 2 years ago
- [AAAI2025] Language Prompt for Autonomous Driving☆149Updated 9 months ago
- [ICCV 2023] The official PyTorch code for Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation☆88Updated 2 years ago
- Official Code for "EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye View"☆52Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆47Updated last year
- 3D-RetinaNet a baseline models on ROAD dataset☆75Updated 3 years ago
- Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"☆58Updated 6 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆134Updated 2 years ago
- GPT as Psychologist? Preliminary Evaluations for GPT-4V on Visual Affective Computing☆25Updated last year
- Frame Flexible Network (CVPR2023)☆56Updated 2 years ago
- Code and models for the Action Recognition benchmark of Assembly101☆11Updated 2 years ago