woven-visionai / wts-datasetLinks
☆48Updated 5 months ago
Alternatives and similar repositories for wts-dataset
Users that are interested in wts-dataset are comparing it to the libraries listed below
Sorting:
- ☆54Updated last year
- [CVPRW 2024] TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning. Official code for the 3rd place solution of t…☆49Updated 9 months ago
- ☆15Updated last year
- [CVPR 2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detec…☆61Updated 7 months ago
- AICITY2024 Track 2 - Code from AIO_ISC Team☆37Updated last year
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆53Updated last year
- ☆10Updated last year
- [ICCV2023] AIDE: A Vision-Driven Multi-View, Multi-Modal, Multi-Tasking Dataset for Assistive Driving Perception☆46Updated last year
- [CVPR2024 Highlight] The official repo for paper "Abductive Ego-View Accident Video Understanding for Safe Driving Perception"☆62Updated 8 months ago
- ☆51Updated last year
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆37Updated 2 years ago
- Video Feature Enhancement with PyTorch☆32Updated last year
- Official implementation of "Delving into CLIP latent space for Video Anomaly Recognition", CVIU 2024☆89Updated 2 months ago
- Official implementation of the CVPR paper Open-TransMind: A New Baseline and Benchmark for 1st Foundation Model Challenge of Intelligent …☆28Updated 2 years ago
- Official Code of CVPR'23 Paper "VLPD: Context-Aware Pedestrian Detection via Vision-Language Semantic Self-Supervision"☆22Updated last year
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆55Updated 8 months ago
- [WACV 2025] Official code for our paper "Enhancing Novel Object Detection via Cooperative Foundational Models"☆84Updated last month
- OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]☆109Updated last year
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆30Updated last year
- 【IEEE T-IV】A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios☆50Updated last year
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆38Updated last year
- Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".☆44Updated last year
- 🚀【AAAI 2025】Cross-View Referring Multi-Object Tracking☆64Updated 5 months ago
- Code release for "Language-conditioned Detection Transformer"☆88Updated last year
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆34Updated last year
- Benchmarking Panoptic Video Scene Graph Generation (PVSG), CVPR'23☆100Updated last year
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding☆79Updated 4 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated 2 years ago
- [NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context☆168Updated last year
- Taming Self-Training for Open-Vocabulary Object Detection, CVPR 2024☆20Updated last year