dres-dev / DRESLinks
Distributed Retrieval Evaluation Server
☆16Updated last year
Alternatives and similar repositories for DRES
Users that are interested in DRES are comparing it to the libraries listed below
Sorting:
- Open-source release of the SOMHunter video retrieval tool☆24Updated 2 years ago
- Archive of Tasks and Results of the Video Browser Showdown☆13Updated 10 months ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆192Updated 2 years ago
- An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"☆181Updated last year
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆84Updated last year
- ☆193Updated 10 months ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆196Updated 6 months ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆16Updated last year
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆203Updated 2 years ago
- ☆192Updated last year
- [CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks☆55Updated 2 years ago
- PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models☆261Updated 5 months ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆82Updated 5 months ago
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆53Updated last year
- Summary about Video-to-Text datasets. This repository is part of the review paper *Bridging Vision and Language from the Video-to-Text Pe…☆132Updated 2 years ago
- A curated list of video-text datasets in a variety of languages. These datasets can be used for video captioning (video description) or v…☆39Updated last year
- mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)☆228Updated 2 years ago
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆131Updated last year
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆302Updated last year
- [NeurIPS 2021] Moment-DETR code and QVHighlights dataset☆341Updated last year
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆92Updated 10 months ago
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆188Updated 7 months ago
- Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models☆128Updated 3 months ago
- [AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding☆123Updated last year
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆379Updated 3 years ago
- Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 …☆243Updated 5 months ago
- [ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"☆84Updated last year
- [NeurIPS 2023] Self-Chained Image-Language Model for Video Localization and Question Answering☆195Updated 2 years ago
- [ICCV 2021 Oral + TPAMI] Just Ask: Learning to Answer Questions from Millions of Narrated Videos☆126Updated 2 years ago
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆142Updated 3 weeks ago