ninatu / in_styleLinks

Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023

☆11

Alternatives and similar repositories for in_style

Users that are interested in in_style are comparing it to the libraries listed below

Sorting:

Annusha / xmic
X-MIC: Cross-Modal Instance Conditioning for Egocentric Action Generalization, CVPR 2024
☆11Updated last year
jochemloedeman / PGN
Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M…
☆43Updated last year
dmoltisanti / air-cvpr23
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13Updated 2 years ago
haoosz / ade-czsl
[CVPR 2023] Learning Attention as Disentangler for Compositional Zero-shot Learning
☆39Updated 2 years ago
TencentARC / TaCA
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Updated 2 years ago
yuhangzang / UPT
☆61Updated 8 months ago
Dawn-LX / OpenVoc-VidVRD
Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
☆43Updated last year
Luoyadan / MM2020_ABG
official PyTorch implementation of paper "Adversarial Bipartite Graph Learning for Video Domain Adaptation" (MM2020 Oral)
☆10Updated 3 years ago
fmthoker / SEVERE-BENCHMARK
☆26Updated 2 years ago
shvdiwnkozbw / Self-supervised-Video-Concept
Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.
☆11Updated 3 years ago
nirat1606 / OADis
Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022
☆35Updated 2 years ago
KaiyangZhou / on-device-dg
On-Device Domain Generalization
☆45Updated 3 years ago
MotasemAlfarra / Online_Test_Time_Adaptation
Revisiting Test Time Adaptation Under Online Evaluation
☆35Updated last year
sauradip / MUPPET
[ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"
☆15Updated 2 years ago
seonwoo-min / GVRT
[ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization
☆31Updated 2 years ago
gaopengcuhk / BALLAD
☆59Updated 3 years ago
Chuhanxx / helping_hand_for_egocentric_videos
Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'
☆33Updated 2 years ago
geehokim / Combinatorial-Inference
(NeurIPS 2019) Combinatorial Inference against Label Noise
☆11Updated last year
NVlabs / Bongard-HOI
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
☆73Updated 3 years ago
renjie-liang / HUAL
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning
☆14Updated 2 years ago
UniAdapter / UniAdapter
☆26Updated 2 years ago
skhcjh231 / MATR_codebase
☆22Updated 9 months ago
showlab / CLVQA
[AAAI2023] Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task (Oral)
☆40Updated last year
mengcaopku / DCNet
[ACM MM 22] Correspondence Matters for Video Referring Expression Comprehension
☆15Updated 3 years ago
wds2014 / ALIGN
Repo of NeurIPS23
☆18Updated 2 years ago
showlab / datacentric.vlp
Compress conventional Vision-Language Pre-training data
☆53Updated 2 years ago
renwang435 / video-ttt-release
Test-Time Training on Video Streams
☆66Updated 2 years ago
sheng-eatamath / S3A
repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)
☆25Updated last year
MCG-NJU / OCSampler
[CVPR 2022] OCSampler: Compressing Videos to One Clip with Single-step Sampling
☆17Updated 3 years ago
orrzohar / LOVM
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
☆21Updated last year