dres-dev / DRESLinks
Distributed Retrieval Evaluation Server
☆15Updated 10 months ago
Alternatives and similar repositories for DRES
Users that are interested in DRES are comparing it to the libraries listed below
Sorting:
- Archive of Tasks and Results of the Video Browser Showdown☆13Updated 7 months ago
- Open-source release of the SOMHunter video retrieval tool☆23Updated 2 years ago
- ☆87Updated last year
- [ACM TOMM 2023] - Composed Image Retrieval using Contrastive Learning and Task-oriented CLIP-based Features☆186Updated 2 years ago
- Repository for "CoMix: Comprehensive Benchmark for Multi-Task Comic Understanding"☆13Updated 10 months ago
- [ICML'24 Oral] "MagicLens: Self-Supervised Image Retrieval with Open-Ended Instructions"☆199Updated 11 months ago
- Data release for the ImageInWords (IIW) paper.☆220Updated 10 months ago
- Official repository for the MMFM challenge☆25Updated last year
- CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts☆155Updated last year
- ☆185Updated 11 months ago
- PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models☆257Updated 2 months ago
- ☆190Updated 7 months ago
- An implementation of "CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model".☆145Updated 7 months ago
- [NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale☆198Updated last year
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆81Updated 11 months ago
- ☆41Updated 8 months ago
- Official PyTorch Implementation of "WordStylist: Styled Verbatim Handwritten Text Generation with Latent Diffusion Models" - ICDAR 2023☆82Updated last year
- Densely Captioned Images (DCI) dataset repository.☆191Updated last year
- The HierText dataset contains ~12k images from the Open Images dataset v6 with large amount of text entities. We provide word, line and p…☆298Updated 10 months ago
- Multimodal Semi-Supervised Learning for Text Recognition (SemiMTR)☆83Updated 2 years ago
- [ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion☆190Updated 2 months ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆76Updated 2 months ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆28Updated last year
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆137Updated last year
- Towards Video Text Visual Question Answering: Benchmark and Baseline☆38Updated last year
- [CVPR 24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadin…☆228Updated last year
- Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023☆28Updated 2 years ago
- GRiT: A Generative Region-to-text Transformer for Object Understanding (ECCV2024)☆335Updated last year
- Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"☆268Updated last year
- Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"☆145Updated last week