kevinliang888/IVR-QA-baselines

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kevinliang888/IVR-QA-baselines)

kevinliang888 / IVR-QA-baselines

[ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers

☆20

Alternatives and similar repositories for IVR-QA-baselines

Users that are interested in IVR-QA-baselines are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jyliu-98 / MoSketch
View on GitHub
[ICCV 2025] This repo is the official implementation of "Multi-Object Sketch Animation by Scene Decomposition and Motion Planning"
☆28Jul 30, 2025Updated 11 months ago
patrick-0817 / T-MASS-dataleakage
View on GitHub
☆10Nov 27, 2024Updated last year
HuiGuanLab / ms-sl
View on GitHub
Source code of our MM'22 paper Partially Relevant Video Retrieval
☆56Nov 4, 2024Updated last year
HKUST-LongGroup / DyME
View on GitHub
[ICLR 2026] Empowering Small VLMs to Think with Dynamic Memorization and Exploration
☆18Mar 18, 2026Updated 4 months ago
levymsn / ChatIR
View on GitHub
Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"
☆33Feb 5, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
mwray / Semantic-Video-Retrieval
View on GitHub
Code and benchmarks for the Semantic Video Retrieval Task
☆53Oct 18, 2022Updated 3 years ago
uvavision / DrillDown
View on GitHub
[NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries
☆12Apr 15, 2022Updated 4 years ago
zzhbrr / CMU15445-2022-notes
View on GitHub
My notes for cmu15445 2022
☆14Feb 8, 2023Updated 3 years ago
StanfordVL / STGraph
View on GitHub
Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"
☆23Mar 4, 2020Updated 6 years ago
interactive-cookbook / ara
View on GitHub
Corpus and code for Aligned Recipe Actions (ARA) corpus, EMNLP 2021
☆10May 22, 2024Updated 2 years ago
haoyanbin918 / Attention-in-Attention
View on GitHub
☆12Aug 5, 2022Updated 3 years ago
ztinpn / coloredPrintHelper
View on GitHub
将pdf分成彩色和黑白部分，便于打印
☆11Mar 9, 2025Updated last year
CSC2548 / image_caption_gan
View on GitHub
☆10May 4, 2018Updated 8 years ago
DFKI-NLP / REval
View on GitHub
[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction
☆13Apr 21, 2020Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ezeli / InSentiCap_model
View on GitHub
A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).
☆11Jul 18, 2022Updated 4 years ago
shvdiwnkozbw / Self-supervised-Video-Concept
View on GitHub
Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.
☆11Jul 28, 2022Updated 4 years ago
h-munakata / Lighthouse-Wrapper-for-Audio-Moment-Retrieval
View on GitHub
☆13Mar 23, 2026Updated 4 months ago
ruc-aimc-lab / TeachCLIP
View on GitHub
[CVPR 2024] TeachCLIP for Text-to-Video Retrieval
☆42May 7, 2025Updated last year
FingerRec / OA-Transformer
View on GitHub
[CVPR 2022] The code for our paper 《Object-aware Video-language Pre-training for Retrieval》
☆61May 25, 2022Updated 4 years ago
PKU-ICST-MIPL / MGAH_TMM2019
View on GitHub
Source code of our TMM 2019 paper "Multi-pathway Generative Adversarial Hashing for Unsupervised Cross-modal Retrieval"
☆12Jun 17, 2019Updated 7 years ago
aimh-lab / visione
View on GitHub
An AI-powered interactive video retrieval system
☆60Updated this week
TIGER-AI-Lab / VideoEval-Pro
View on GitHub
VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]
☆15Jun 1, 2026Updated last month
CV-IP / VFD
View on GitHub
This is the release code for CVPR2022 paper "Voice-Face Homogeneity Tells Deepfake".
☆15Mar 7, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
XLearning-SCU / 2023-ICCV-COMMON
View on GitHub
This repo contains the code and data of "Graph Matching with Bi-level Noisy Correspondence".
☆20Jul 28, 2023Updated 3 years ago
tomekkorbak / treehopper
View on GitHub
A Tree-LSTM-based dependency tree sentiment labeler
☆15May 9, 2019Updated 7 years ago
bofang98 / UATVR
View on GitHub
[ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval
☆13Nov 5, 2023Updated 2 years ago
DanielMengLiu / DeepLip
View on GitHub
deep-learning based audio-visual lip bometrics
☆15May 9, 2023Updated 3 years ago
facebookresearch / Llip
View on GitHub
Official PyTorch codebase for the Modeling Caption Diversity in ContrastiveVision-Language Pretraining paper.
☆19Mar 28, 2025Updated last year
emerisly / EDIS
View on GitHub
Entity-Driven Image Search over Multimodal Web Content (EMNLP 2023)
☆26Dec 2, 2023Updated 2 years ago
ligeng0197 / Awesome-Thinking-With-Images
View on GitHub
Latest open-source "Thinking with images" (O3/O4-mini) papers, covering training-free, SFT-based, and RL-enhanced methods for "fine-grain…
☆113Aug 21, 2025Updated 11 months ago
svdbase / SVD-download
View on GitHub
This repo is used for downloading the videos for SVD dataset.
☆18Aug 16, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zexupan / reentry
View on GitHub
☆18Nov 22, 2024Updated last year
lavendelion / vgg16_for_CIFAR10_with_pytorch
View on GitHub
build vgg16 with pytorch 0.4.0 for classification of CIFAR datasets
☆10Mar 31, 2019Updated 7 years ago
eth-lre / LLM_ICL
View on GitHub
ACL24
☆11Jun 7, 2024Updated 2 years ago
piotr-bojanowski / action-ordering
View on GitHub
Code for an ECCV2014 paper
☆12Feb 10, 2015Updated 11 years ago
knightyxp / DGL
View on GitHub
[AAAI 2024] DGL: Dynamic Global-Local Prompt Tuning for Text-Video Retrieval.
☆49Oct 14, 2024Updated last year
vra / easybox
View on GitHub
☐ ☐ A simple, out-of-the-box and cross-platform bbox annotation tool by Python. Try it by `pip install easybox`
☆10May 28, 2021Updated 5 years ago
szdr / pytorch-ranknet
View on GitHub
☆11Dec 23, 2018Updated 7 years ago