Shuyu-XJTU / SVTALinks
The official repo of "Towards Scalable Video Anomaly Retrieval: A Synthetic Video-Text Benchmark"
β17Updated this week
Alternatives and similar repositories for SVTA
Users that are interested in SVTA are comparing it to the libraries listed below
Sorting:
- SEED Datasetβ22Updated this week
- π¨Official Repo for Every Painting Awakened: A Training-free Framework for Painting-to-Animation Generationβ54Updated last month
- Offical repo for ECCV 2024: Depth-Aware Blind Image Decomposition for Real-World Weather Recoveryβ13Updated last year
- CVPR2025: Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioningβ32Updated 2 months ago
- ICLRβ24 Offical Implementation of Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularizationβ72Updated last year
- β¨A curated list of papers on the uncertainty in multi-modal large language model (MLLM).β45Updated 2 months ago
- β16Updated last year
- Official Implementation for ACM MM2024 paper "VrdONE: One-stage Video Visual Relation Detection".β10Updated 6 months ago
- Implementation of the paper Knowledge-Enhanced Dual-stream Zero-shot Composed Image Retrieval (CVPR 2024)β18Updated 7 months ago
- Official implementation for "Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter"β40Updated last year
- [AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.β41Updated 7 months ago
- [NIPS2023] This is an official implementation of paper "DAC-DETR: Divide the Attention Layers and Conquer".β55Updated 11 months ago
- β68Updated this week
- Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".β13Updated 3 months ago
- [CVPR'25] ππ EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering