dres-dev / DRESLinks
Distributed Retrieval Evaluation Server
☆15Updated 7 months ago
Alternatives and similar repositories for DRES
Users that are interested in DRES are comparing it to the libraries listed below
Sorting:
- Archive of Tasks and Results of the Video Browser Showdown☆12Updated 4 months ago
- Open-source release of the SOMHunter video retrieval tool☆21Updated 2 years ago
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆180Updated last year
- An official implementation for "X-CLIP: End-to-End Multi-grained Contrastive Learning for Video-Text Retrieval"☆165Updated last year
- [NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset☆280Updated last year
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆81Updated 8 months ago
- [EMNLP2024 Demo], [ICASSP 2025] A user-friendly library for reproducible video moment retrieval and highlight detection. It also supports…☆162Updated last month
- mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)☆227Updated last year
- Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"☆89Updated 4 months ago
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆191Updated last year
- PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models☆257Updated last year
- ☆187Updated 4 months ago
- [CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".☆280Updated last year
- ☆86Updated last year
- [ICML'24 Oral] "MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions"☆183Updated 8 months ago
- ☆13Updated 2 years ago
- Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"☆30Updated 5 months ago
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆135Updated 11 months ago
- Densely Captioned Images (DCI) dataset repository.☆186Updated last year
- CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts☆151Updated last year
- Foundation Models for Video Understanding: A Survey☆126Updated last week
- Official repository for the MMFM challenge☆25Updated last year
- GRIT: Faster and Better Image-captioning Transformer (ECCV 2022)☆194Updated 2 years ago
- [TMM 2023] VideoXum: Cross-modal Visual and Textural Summarization of Videos☆45Updated last year
- [CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts☆325Updated last year
- EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties☆128Updated 8 months ago
- LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning☆140Updated 2 months ago
- ☆179Updated 9 months ago
- [CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadin…☆226Updated 9 months ago
- ☆136Updated 9 months ago