pxiangwu / FORBLinks
"FORB: A Flat Object Retrieval Benchmark for Universal Image Embedding", NeurIPS 2023 Datasets and Benchmarks Track
☆12Updated last year
Alternatives and similar repositories for FORB
Users that are interested in FORB are comparing it to the libraries listed below
Sorting:
- Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)☆142Updated last month
- [AAAI 2023] The official implementation of "A Benchmark and Asymmetrical-Similarity Learning for Practical Image Copy Detection"☆22Updated last year
- Code for the Video Similarity Challenge.☆80Updated 2 years ago
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆57Updated 2 years ago
- [CVPR 2023] implementation of Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information.☆91Updated 2 years ago
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆28Updated 2 years ago
- Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"☆107Updated 2 years ago
- 2nd place solution to Google Universal Image Embedding Challenge!☆43Updated 3 years ago
- [CVPR 2022 - Demo Track] - Effective conditioned and composed image retrieval combining CLIP-based features☆84Updated last year
- PyTorch implementation of Omni-DETR for omni-supervised object detection: https://arxiv.org/abs/2203.16089☆69Updated 3 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Updated 3 years ago
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Updated 3 years ago
- Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"☆61Updated 2 years ago
- Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models☆129Updated 3 months ago
- Exploiting unlabeled data with vision and language models for object detection, ECCV 2022☆94Updated 2 years ago
- [ICCV 2023] ALIP: Adaptive Language-Image Pre-training with Synthetic Caption☆104Updated 2 years ago
- [ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset☆83Updated 6 months ago
- ☆27Updated 4 years ago
- CLIP4IDC: CLIP for Image Difference Captioning (AACL 2022)☆36Updated 3 years ago
- OvarNet official implement of the paper "OvarNet: Towards Open-vocabulary Object Attribute Recognition"☆106Updated 2 years ago
- Turning to Video for Transcript Sorting☆49Updated 2 years ago
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆64Updated 2 years ago
- ☆38Updated 3 years ago
- Referring Image Segmentation Benchmarking with Segment Anything Model (SAM)☆38Updated 2 years ago
- Official repository of "Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach" (ACL 2024 Oral)☆34Updated 10 months ago
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- [CVPR 2022 Challenge Rank 1st] The official code for V2L: Leveraging Vision and Vision-language Models into Large-scale Product Retrieval…☆29Updated 3 years ago
- [CBMI 2024 Best Paper] Official repository of the paper "Is CLIP the main roadblock for fine-grained open-world perception?".☆32Updated 8 months ago
- ☆23Updated 3 years ago
- [ICLR 2023] PyTorch implementation of VLDet (https://arxiv.org/abs/2211.14843)☆190Updated last year