engindeniz/vitis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/engindeniz/vitis)

engindeniz / vitis

[ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts

☆13

Alternatives and similar repositories for vitis

Users that are interested in vitis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

doc-doc / HQGA
View on GitHub
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering (AAAI'22, Oral)
☆35Sep 17, 2022Updated 3 years ago
Mia-YatingYu / STDD
View on GitHub
[AAAI'25]: Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP
☆23Aug 5, 2025Updated 11 months ago
wlin-at / MAXI
View on GitHub
MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)
☆31Sep 5, 2023Updated 2 years ago
NJUPT-MCC / DualVGR-VideoQA
View on GitHub
Implementation for the journal paper "DualVGR: A Dual-Visual Graph Reasoning Unit for Video Question Answering" (Jianyu et al., IEEE Tran…
☆18Jun 22, 2021Updated 5 years ago
ByZ0e / Glance-Focus
View on GitHub
This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)
☆31Jun 28, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
antoyang / FrozenBiLM
View on GitHub
[NeurIPS 2022] Zero-Shot Video Question Answering via Frozen Bidirectional Language Models
☆159Dec 9, 2024Updated last year
wds2014 / ALIGN
View on GitHub
Repo of NeurIPS23
☆17Oct 25, 2023Updated 2 years ago
jiazheng-xing / SloshNet
View on GitHub
[AAAI2023] Revisiting the Spatial and Temporal Modeling for Few-shot Action Recognition (SloshNet)
☆14Jan 10, 2024Updated 2 years ago
bladewaltz1 / PromptSwitch
View on GitHub
☆30Aug 14, 2023Updated 2 years ago
KPeng9510 / Trans4SOAR
View on GitHub
☆14Apr 1, 2023Updated 3 years ago
Shahzadnit / EZ-CLIP
View on GitHub
☆24May 11, 2025Updated last year
ArsenalCheng / Meta-Adapter
View on GitHub
[NeurIPS 2023] Meta-Adapter
☆48Nov 21, 2023Updated 2 years ago
alibaba-mmai-research / HyRSMPlusPlus
View on GitHub
Code for our paper "HyRSM++: Hybrid Relation Guided Temporal Set Matching for Few-shot Action Recognition".
☆15Jan 3, 2023Updated 3 years ago
intel / TVP
View on GitHub
☆15Aug 4, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jbertrand89 / matching_based_fsar
View on GitHub
☆12Mar 30, 2023Updated 3 years ago
bofang98 / UATVR
View on GitHub
[ICCV'23] UATVR: Uncertainty-Adaptive Text-Video Retrieval
☆13Nov 5, 2023Updated 2 years ago
starrycos / PAINet
View on GitHub
[ICCV'23] PAINet: Parallel Attention Interaction Network for Few-shot Skeleton-based Action Recognition
☆11Oct 14, 2023Updated 2 years ago
Sarinda251 / CDFSL-V
View on GitHub
Accepted at ICCV '23
☆16Oct 4, 2023Updated 2 years ago
minghangz / SPL
View on GitHub
Generating Structured Pseudo Labels for Noise-resistant Zero-shot Video Sentence Localization
☆16Jul 20, 2023Updated 3 years ago
AIM3-RUC / Youmakeup_Challenge2022
View on GitHub
☆17Jun 15, 2022Updated 4 years ago
XLiu443 / Tem-adapter
View on GitHub
[ICCV2023] Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer
☆37Oct 18, 2023Updated 2 years ago
R00Kie-Liu / TA2N
View on GitHub
TA2N: Two-Stage Action Alignment Network for Few-Shot Action Recognition
☆17Mar 26, 2024Updated 2 years ago
huangmozhi9527 / GMMFormer
View on GitHub
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆21May 10, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
DeLightCMU / ElaborativeRehearsal
View on GitHub
This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)
☆37Apr 9, 2022Updated 4 years ago
Huntersxsx / MGPN
View on GitHub
source code of our MGPN in SIGIR 2022
☆18Jun 8, 2022Updated 4 years ago
htyao89 / Textual-based_Class-aware_prompt_tuning
View on GitHub
☆33Mar 7, 2024Updated 2 years ago
yanhengwang-heu / SpectralKAN
View on GitHub
☆15Jan 16, 2026Updated 6 months ago
doc-doc / NExT-GQA
View on GitHub
Can I Trust Your Answer? Visually Grounded Video Question Answering (CVPR'24, Highlight)
☆89Jul 1, 2024Updated 2 years ago
InterDigitalInc / DialogSummary-VideoQA
View on GitHub
☆10Mar 30, 2022Updated 4 years ago
IMCCretrieval / MomentDiff
View on GitHub
MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023
☆80Nov 2, 2023Updated 2 years ago
LeiJiangJNU / R3FA
View on GitHub
3D Face Alignment ---The 10th International Conference on Image and Graphics(ICIG2019)-Oral
☆11Dec 3, 2019Updated 6 years ago
jhayes14 / advsteg
View on GitHub
Steganography via adversarial training
☆15Dec 1, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
rohitgandikota / Hiding-Images-using-VAE-Genarative-Adversarial-Networks
View on GitHub
Variational Autoencoder-Generative Adversarial Network (VAE-GAN) to hide data inside images
☆12Nov 9, 2019Updated 6 years ago
VRU-NExT / VideoQA
View on GitHub
☆104Oct 19, 2022Updated 3 years ago
pulkitkumar95 / tats
View on GitHub
☆20Feb 20, 2025Updated last year
alibaba-mmai-research / CLIP-FSAR
View on GitHub
Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".
☆82Mar 7, 2024Updated 2 years ago
AmeenAli / VideoMatch
View on GitHub
☆14Jan 5, 2022Updated 4 years ago
shashankvkt / video_object_segmentation
View on GitHub
Implementation of "Youtube-VOS: Sequence-to-sequence video object segmentation"
☆14Oct 15, 2019Updated 6 years ago
bhrqw / SADA
View on GitHub
CVPR2023: Few-Shot Learning with Visual Distribution Calibration and Cross-Modal Distribution Alignment
☆14May 19, 2023Updated 3 years ago