princetonvisualai/MQVR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/princetonvisualai/MQVR)

princetonvisualai / MQVR

☆26

Alternatives and similar repositories for MQVR

Users that are interested in MQVR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

showlab / DemoVLP
View on GitHub
[Arxiv2022] Revitalize Region Feature for Democratizing Video-Language Pre-training
☆22Mar 19, 2022Updated 4 years ago
hbchen121 / AICITY2022_Track2_SSM
View on GitHub
🏆 The 1st Place Solution for AICity2022 Challenge Track2: Natural Language-Based Vehicle Retrieval.
☆12Jul 25, 2022Updated 3 years ago
mwray / Semantic-Video-Retrieval
View on GitHub
Code and benchmarks for the Semantic Video Retrieval Task
☆53Oct 18, 2022Updated 3 years ago
TencentARC / MCQ
View on GitHub
Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).
☆141Jul 20, 2022Updated 4 years ago
willard-yuan / video-text-retrieval-papers
View on GitHub
☆15Sep 16, 2021Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
YYJMJC / Compositional-Temporal-Grounding
View on GitHub
☆31Mar 24, 2022Updated 4 years ago
starmemda / CAMoE
View on GitHub
☆100Sep 27, 2021Updated 4 years ago
JustinYuu / MM_Pyramid
View on GitHub
[ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing
☆15Aug 26, 2022Updated 3 years ago
idealwei / SuperPixelPool.pytorch
View on GitHub
superpixel average pooling, superspixel max pooling, pytorch implementations
☆25Nov 14, 2018Updated 7 years ago
CryhanFang / CLIP2Video
View on GitHub
☆260Dec 10, 2022Updated 3 years ago
transvcl / TransVCL
View on GitHub
TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]
☆60Feb 25, 2023Updated 3 years ago
roeiherz / ORViT
View on GitHub
Object-Region Video Transformers
☆24Mar 24, 2022Updated 4 years ago
TencentARC / common_trainer
View on GitHub
Common template for pytorch project. Easy to extent and modify for new project.
☆13Dec 13, 2022Updated 3 years ago
m-bain / CondensedMovies-chall
View on GitHub
Condensed Movies Challenge 2021
☆22Sep 21, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MCC-WH / CSA
View on GitHub
Official implementation of NeurIPS 2021 paper "Contextual Similarity Aggregation with Self-attention for Visual Re-ranking"
☆26Apr 19, 2022Updated 4 years ago
VALUE-Leaderboard / StarterCode
View on GitHub
Starter Code for VALUE benchmark
☆79Aug 23, 2022Updated 3 years ago
mzhaoshuai / CenterCLIP
View on GitHub
[SIGIR 2022] CenterCLIP: Token Clustering for Efficient Text-Video Retrieval.
☆135May 4, 2022Updated 4 years ago
MCG-NJU / MMN
View on GitHub
[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
☆91Nov 16, 2022Updated 3 years ago
mengcaopku / LocVTP
View on GitHub
[ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization
☆39Jul 29, 2022Updated 3 years ago
xwen99 / temporal_context_aggregation
View on GitHub
(WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning
☆29Aug 4, 2021Updated 4 years ago
mever-team / distill-and-select
View on GitHub
Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…
☆71Apr 12, 2023Updated 3 years ago
ioanacroi / qb-norm
View on GitHub
Cross Modal Retrieval with Querybank Normalisation
☆57Nov 21, 2023Updated 2 years ago
WikiChao / Ego-AV-Loc
View on GitHub
[CVPR 2023] Egocentric Audio-Visual Object Localization
☆27Jan 6, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
justforforfor / insclr
View on GitHub
☆27Dec 3, 2021Updated 4 years ago
microsoft / LAVENDER
View on GitHub
A Unified Framework for Video-Language Understanding
☆62Jun 17, 2023Updated 3 years ago
tsujuifu / pytorch_violet
View on GitHub
A PyTorch implementation of VIOLET
☆138Dec 17, 2023Updated 2 years ago
farewellthree / STAN
View on GitHub
Official PyTorch implementation of the paper "Revisiting Temporal Modeling for CLIP-based Image-to-Video Knowledge Transferring"
☆107Jan 28, 2024Updated 2 years ago
aysebilgegunduz / ShotBoundaryDetection
View on GitHub
Detects shot boundaries from news with K-Means. Using Bhattacharya Coefficient for distance.
☆10Jun 1, 2017Updated 9 years ago
FactoDeepLearning / MultitaskVLFM
View on GitHub
☆25Aug 1, 2023Updated 2 years ago
xwen99 / CCF-BDCI-VideoCopyDetection
View on GitHub
2019 CCF 大数据与计算智能大赛视频版权检测算法复赛第8名方案 | 8th place solution of Video Copyright Detection Algorithm Track, 2019 CCF Big Data & Computing Int…
☆30Nov 19, 2019Updated 6 years ago
kennymckormick / TransRank
View on GitHub
[CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…
☆18Aug 1, 2022Updated 3 years ago
fhlt / shot_boundary_detection
View on GitHub
shot_boundary_detection
☆10Nov 26, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
virajprabhu / LANCE
View on GitHub
LANCE: Stress-testing Visual Models by Generating Language-guided Counterfactual Images
☆31Nov 30, 2023Updated 2 years ago
usc-sail / mica-subtitle-aligned-movie-sounds
View on GitHub
A dataset for Audio-Visual Sound Event Detection in Movies
☆26Jan 23, 2023Updated 3 years ago
Huntersxsx / RaNet
View on GitHub
source code of our RaNet in EMNLP 2021
☆30May 31, 2022Updated 4 years ago
CuthbertCai / Ask-Confirm
View on GitHub
Ask&Confirm: Active Detail Enriching for Cross-Modal Retrieval with Partial Query (ICCV2021)
☆20Dec 4, 2021Updated 4 years ago
ArrowLuo / CLIP4Clip
View on GitHub
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
☆1,029Apr 12, 2024Updated 2 years ago
GeWu-Lab / MWAFM
View on GitHub
Multi-Scale Attention for Audio Question Answering
☆28Jul 19, 2023Updated 3 years ago
seonwoo-min / GVRT
View on GitHub
[ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization
☆30Apr 7, 2023Updated 3 years ago