woven-visionai / wts-dataset
☆34Updated 9 months ago
Alternatives and similar repositories for wts-dataset:
Users that are interested in wts-dataset are comparing it to the libraries listed below
- [CVPRW 2024] TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning. Official code for the 3rd place solution of t…☆34Updated 2 months ago
- ☆15Updated last year
- ☆45Updated 9 months ago
- [CVPR2024 Highlight] The official repo for paper "Abductive Ego-View Accident Video Understanding for Safe Driving Perception"☆50Updated last month
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆36Updated last year
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆48Updated 5 months ago
- [ECCV 2024] BUSCA: "Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking"☆36Updated 4 months ago
- This is the implementation code for the paper, "An Attention-guided Multistream Feature Fusion Network for Early Localization of Risky Tr…☆21Updated last year
- Benchmark and model for step-by-step reasoning in autonomous driving.☆46Updated last month
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆30Updated 7 months ago
- ☆35Updated 11 months ago
- Improving Mamaba performance on Video Understanding task☆39Updated 6 months ago
- ☆36Updated last year
- ☆37Updated last month
- 【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆50Updated 11 months ago
- Foundation Models for Video Understanding: A Survey☆119Updated 7 months ago
- Video Feature Enhancement with PyTorch☆28Updated 4 months ago
- ☆37Updated 10 months ago
- [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“☆70Updated 2 months ago
- ☆43Updated 11 months ago
- (ICLR 2024, CVPR 2024) SparseFormer☆74Updated 5 months ago
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆86Updated 4 months ago
- [ICCV 2021] Deep Reinforced Accident Anticipation with Visual Explanation☆26Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆43Updated last year
- [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detect…☆53Updated 3 weeks ago
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆20Updated last year
- LongShortNet for Streaming Perception task.☆13Updated last year
- ☆24Updated 11 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆25Updated last year
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year