ExplainableML/EgoCVR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ExplainableML/EgoCVR)

ExplainableML / EgoCVR

[ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval

☆41

Alternatives and similar repositories for EgoCVR

Users that are interested in EgoCVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OmkarThawakar / composed-video-retrieval
View on GitHub
Composed Video Retrieval
☆62May 2, 2024Updated 2 years ago
lucas-ventura / CoVR
View on GitHub
Official PyTorch implementation of the paper "CoVR: Learning Composed Video Retrieval from Web Video Captions".
☆119Apr 21, 2026Updated 2 months ago
ExplainableML / Vision_by_Language
View on GitHub
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
☆89Jul 4, 2024Updated 2 years ago
icq-benchmark / icq-benchmark
View on GitHub
☆19Jul 28, 2025Updated 11 months ago
vl2g / CSTBIR
View on GitHub
Official Code for Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions
☆15Dec 27, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
May2333 / FDCA
View on GitHub
[ICLR 2025] This repo is the official implementation of our paper "Learning Fine-Grained Representations through Textual Token Disentangl…
☆23Jul 28, 2025Updated 11 months ago
iLearn-Lab / SIGIR24-DQU-CIR
View on GitHub
[SIGIR 2024] - Simple but Effective Raw-Data Level Multimodal Fusion for Composed Image Retrieval
☆44Jul 14, 2024Updated 2 years ago
jicheol93 / PLOT
View on GitHub
☆13Feb 13, 2025Updated last year
miccunifi / SEARLE
View on GitHub
[ICCV 2023] - Zero-shot Composed Image Retrieval with Textual Inversion
☆198Jul 31, 2025Updated 11 months ago
ExplainableML / ImageSelect
View on GitHub
Code for the paper "If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection"
☆27Jul 10, 2023Updated 3 years ago
Pter61 / context-i2w
View on GitHub
Context-I2W: Mapping Images to Context-dependent words for Accurate Zero-Shot Composed Image Retrieval [AAAI 2024 Oral]
☆54May 27, 2025Updated last year
Tanveer81 / RGNet
View on GitHub
This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos
☆20Mar 3, 2025Updated last year
iLearn-Lab / SIGIR24-FTI4CIR
View on GitHub
Codes of the Fine-grained Textual Inversion network for Zero-Shot Composed Image Retrieval
☆27Apr 9, 2026Updated 3 months ago
dhg-wei / MCL
View on GitHub
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
☆28Sep 27, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
miccunifi / CIRCO
View on GitHub
[ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset
☆87Aug 6, 2025Updated 11 months ago
xuboshen / EgoNCEpp
View on GitHub
[ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?
☆13Apr 11, 2025Updated last year
Becomebright / GroundVQA
View on GitHub
Official PyTorch code of GroundVQA (CVPR'24)
☆63Sep 13, 2024Updated last year
facebookresearch / EgocentricUserAdaptation
View on GitHub
In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…
☆15May 18, 2026Updated 2 months ago
navervision / lincir
View on GitHub
Official Pytorch implementation of LinCIR: Language-only Training of Zero-shot Composed Image Retrieval (CVPR 2024)
☆148Jan 5, 2026Updated 6 months ago
iLearn-Lab / TOIS25-Awesome-Composed-Image-Retrieval
View on GitHub
Collection of Composed Image Retrieval (CIR) papers.
☆360Jun 8, 2026Updated last month
Jazzcharles / Egoinstructor
View on GitHub
Pytorch implementation for Egoinstructor at CVPR 2024
☆28Dec 1, 2024Updated last year
ninatu / howtocaption
View on GitHub
Official implementation of "HowToCaption: Prompting LLMs to Transform Video Annotations at Scale." ECCV 2024
☆58Aug 19, 2025Updated 11 months ago
HuiGuanLab / DL-DKD
View on GitHub
Source code of the paper Dual Learning with Dynamic Knowledge Distillation and Soft Alignment for Partially Relevant Video Retrieval
☆19May 13, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / stepdiff
View on GitHub
Data release for Step Differences in Instructional Video (CVPR24)
☆15Jun 19, 2024Updated 2 years ago
youngkyunJang / VDG
View on GitHub
Visual Delta Generator with Large Multi-modal Model for Semi-supervised Composed Image Retrieval - CVPR2024
☆21May 30, 2024Updated 2 years ago
HuiGuanLab / ms-sl
View on GitHub
Source code of our MM'22 paper Partially Relevant Video Retrieval
☆57Nov 4, 2024Updated last year
facebookresearch / ego-env
View on GitHub
Human-centric environment representations from egocentric video
☆15Feb 5, 2026Updated 5 months ago
AdaCheng / EgoThink
View on GitHub
[CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…
☆64Mar 25, 2025Updated last year
OpenGVLab / EgoExoLearn
View on GitHub
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
☆85Aug 26, 2025Updated 10 months ago
qirui-chen / MultiHop-EgoQA
View on GitHub
[AAAI 2025] Grounded Multi-Hop VideoQA in Long-Form Egocentric Videos
☆38May 27, 2025Updated last year
BolinLai / CSTS
View on GitHub
[ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".
☆16Feb 24, 2025Updated last year
hrtang22 / MUSE
View on GitHub
Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"
☆26Feb 2, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
alanaai / EVUD
View on GitHub
Egocentric Video Understanding Dataset (EVUD)
☆34Jul 4, 2024Updated 2 years ago
iLearn-Lab / MM23-RTQ
View on GitHub
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
☆15Apr 7, 2026Updated 3 months ago
google-research / composed_image_retrieval
View on GitHub
☆197May 9, 2026Updated 2 months ago
facebookresearch / genecis
View on GitHub
Code and Models for "GeneCIS A Benchmark for General Conditional Image Similarity"
☆61Jun 12, 2023Updated 3 years ago
NNNNAI / Ego4d_NLQ_2022_1st_Place_Solution
View on GitHub
The 1st place solution of 2022 Ego4d Natural Language Queries.
☆32Sep 5, 2022Updated 3 years ago
Sid2697 / HOI-Ref
View on GitHub
Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"
☆30Apr 16, 2024Updated 2 years ago
musicman217 / Text-Proxy
View on GitHub
Text Proxy: Decomposing Retrieval from a 1-to-N Relationship into N 1-to-1 Relationships for Text-Video Retrieval -- AAAI2025
☆21May 8, 2026Updated 2 months ago