allenai / reclipView external linksLinks
☆86Apr 15, 2022Updated 3 years ago
Alternatives and similar repositories for reclip
Users that are interested in reclip are comparing it to the libraries listed below
Sorting:
- Official code for Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions (CVPR 2024)☆28Jun 21, 2024Updated last year
- [CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"☆128Mar 17, 2025Updated 10 months ago
- Official Codes for Fine-Grained Visual Prompting, NeurIPS 2023☆56Feb 1, 2024Updated 2 years ago
- Official Implementation for paper "Referring Transformer: A One-step Approach to Multi-task Visual Grounding" Neurips 2021☆68May 26, 2022Updated 3 years ago
- ☆65Oct 11, 2023Updated 2 years ago
- A lightweight codebase for referring expression comprehension and segmentation☆57May 21, 2022Updated 3 years ago
- Related papers about Referring Image Segmentation (RIS)☆16Dec 26, 2023Updated 2 years ago
- An official PyTorch implementation of the CRIS paper☆280Jun 9, 2024Updated last year
- [CVPR 2022] Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding☆153Jul 13, 2024Updated last year
- Data Release for VALUE Benchmark☆30Feb 16, 2022Updated 4 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- [CVPR 2022 Oral] TubeDETR: Spatio-Temporal Video Grounding with Transformers☆192Sep 24, 2023Updated 2 years ago
- SeqTR: A Simple yet Universal Network for Visual Grounding☆144Oct 30, 2024Updated last year
- Improving One-stage Visual Grounding by Recursive Sub-query Construction, ECCV 2020☆89Sep 30, 2021Updated 4 years ago
- [AAAI2023] Repo for the paper ''End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation''.☆23Apr 1, 2023Updated 2 years ago
- Improving Visual Grounding with Visual-Linguistic Verification and Iterative Reasoning, CVPR 2022☆96Dec 2, 2022Updated 3 years ago
- ORES: Open-vocabulary Responsible Visual Synthesis☆14Dec 12, 2023Updated 2 years ago
- Code for the paper "Data Attribution for Text-to-Image Models by Unlearning Synthesized Images."☆17May 23, 2025Updated 8 months ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [BMVC22] Official Implementation of ViCHA: "Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment"☆55Oct 20, 2022Updated 3 years ago
- [WACV 2024] Instruct Me More! Random Prompting for Visual In-Context Learning☆18May 7, 2025Updated 9 months ago
- Code release for "Segment, Select, Correct: A Framework for Weakly-Supervised Referring Segmentation"☆14Oct 23, 2023Updated 2 years ago
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Nov 28, 2022Updated 3 years ago
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Oct 13, 2022Updated 3 years ago
- Referring Expression Datasets API☆556Aug 27, 2024Updated last year
- ☆41Jun 3, 2022Updated 3 years ago
- ☆28Jul 22, 2024Updated last year
- ☆221Apr 13, 2023Updated 2 years ago
- Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners☆116Sep 15, 2022Updated 3 years ago
- ☆1,047Oct 3, 2022Updated 3 years ago
- Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).☆141Jul 20, 2022Updated 3 years ago
- [CVPR 2022] Official code for "RegionCLIP: Region-based Language-Image Pretraining"☆807Mar 20, 2024Updated last year
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- Referring Expression Parser☆27Feb 10, 2018Updated 8 years ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆155Apr 30, 2024Updated last year
- ☆31Mar 24, 2022Updated 3 years ago
- Implementation of Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic☆278Sep 17, 2022Updated 3 years ago
- [ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension☆15Sep 4, 2022Updated 3 years ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Sep 19, 2023Updated 2 years ago