woven-visionai / wts-dataset
☆32Updated 8 months ago
Alternatives and similar repositories for wts-dataset:
Users that are interested in wts-dataset are comparing it to the libraries listed below
- [CVPRW 2024] TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning. Official code for the 3rd place solution of t…☆32Updated last month
- ☆43Updated 8 months ago
- [CVPR2024 Highlight] The official repo for paper "Abductive Ego-View Accident Video Understanding for Safe Driving Perception"☆47Updated this week
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆47Updated 4 months ago
- ☆16Updated last year
- [ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer☆36Updated last year
- [ECCV 2024] Beyond MOT: Semantic Multi-Object Tracking☆30Updated 6 months ago
- [ECCV 2024] BUSCA: "Lost and Found: Overcoming Detector Failures in Online Multi-Object Tracking"☆35Updated 3 months ago
- Codes for ICML 2023 Learning Dynamic Query Combinations for Transformer-based Object Detection and Segmentation☆37Updated last year
- Code release for "Language-conditioned Detection Transformer"☆88Updated 9 months ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆100Updated last year
- ☆35Updated 9 months ago
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆59Updated 2 years ago
- [ICCV'23] Cascade-DETR: Delving into High-Quality Universal Object Detection☆97Updated last year
- [ICCV 2023] Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment☆43Updated last year
- ☆57Updated 7 months ago
- Official PyTorch implementation of CODA-LM(https://arxiv.org/abs/2404.10595)☆86Updated 3 months ago
- ☆17Updated 2 years ago
- [CVPR2023] Referring Multi-Object Tracking☆132Updated 8 months ago
- ☆37Updated last year
- OVTrack: Open-Vocabulary Multiple Object Tracking [CVPR 2023]☆98Updated 5 months ago
- [CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution☆45Updated 3 weeks ago
- Benchmark and model for step-by-step reasoning in autonomous driving.☆35Updated 2 weeks ago
- ☆58Updated last year
- ☆24Updated 10 months ago
- [CVPR2024 Highlight] Official repository of the paper "The devil is in the fine-grained details: Evaluating open-vocabulary object detect…☆51Updated 6 months ago
- A Multi-Modal Large Language Model with Retrieval-augmented In-context Learning capacity designed for generalisable and explainable end-t…☆88Updated 5 months ago
- Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)☆56Updated 11 months ago
- Official repository for the NuScenes-MQA. This paper is accepted by LLVA-AD Workshop at WACV 2024.☆25Updated last year
- Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples☆22Updated 4 months ago