fmthoker / SEVERE-BENCHMARKLinks

☆26

Alternatives and similar repositories for SEVERE-BENCHMARK

Users that are interested in SEVERE-BENCHMARK are comparing it to the libraries listed below

Sorting:

nirat1606 / OADis
Official code for "Disentangling Visual Embeddings for Attributes and Objects" Published at CVPR 2022
☆35Updated 2 years ago
adobe-research / vaw_dataset
This repository provides data for the VAW dataset as described in the CVPR 2021 paper titled "Learning to Predict Visual Attributes in th…
☆68Updated 3 years ago
showlab / datacentric.vlp
Compress conventional Vision-Language Pre-training data
☆52Updated 2 years ago
RAIVNLab / CREPE
[CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?
☆35Updated 2 years ago
kahnchana / clippy
Perceptual Grouping in Contrastive Vision-Language Models (ICCV'23)
☆37Updated last year
QUVA-Lab / SIGMA
☆20Updated 4 months ago
yuxiaochen1103 / FDT
☆62Updated 2 years ago
microsoft / LAVENDER
A Unified Framework for Video-Language Understanding
☆60Updated 2 years ago
seonwoo-min / GVRT
[ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization
☆31Updated 2 years ago
TengdaHan / TemporalAlignNet
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆118Updated 2 years ago
md-mohaiminul / ViS4mer
☆57Updated 3 years ago
naver / cog
ImageNet-CoG is a benchmark for concept generalization. It provides a full evaluation framework for pre-trained visual representations wh…
☆26Updated 4 years ago
Chuhanxx / Temporal_Query_Networks
The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆64Updated 3 years ago
dmoltisanti / air-cvpr23
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13Updated 2 years ago
klauscc / VindLU
☆110Updated 2 years ago
shvdiwnkozbw / Self-supervised-Video-Concept
Code for Static and Dynamic Concepts for Self-supervised Video Representation Learning.
☆11Updated 3 years ago
showlab / Region_Learner
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆42Updated 3 years ago
ChengHan111 / VPT-or-FT
Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)
☆13Updated last year
NVlabs / Bongard-HOI
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
☆72Updated 3 years ago
gaopengcuhk / BALLAD
☆59Updated 3 years ago
17Skye17 / VideoLT
Official Code for VideoLT: Large-scale Long-tailed Video Recognition (ICCV 2021)
☆34Updated 3 years ago
yuhui-zh15 / drml
Official Code Release for "Diagnosing and Rectifying Vision Models using Language" (ICLR 2023)
☆34Updated 2 years ago
DeLightCMU / ElaborativeRehearsal
This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)
☆36Updated 3 years ago
allenai / reclip
☆88Updated 3 years ago
facebookresearch / CiT
Code for the paper titled "CiT Curation in Training for Effective Vision-Language Data".
☆78Updated 2 years ago
ZhangYuanhan-AI / OmniBenchmark
[ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.
☆110Updated last year
TencentARC / TaCA
Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".
☆16Updated 2 years ago
alibaba-mmai-research / HiCo
CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
☆18Updated 3 years ago
SivanDoveh / TSVLC
Repository for the paper: Teaching Structured Vision & Language Concepts to Vision & Language Models
☆47Updated 2 years ago
Dawn-LX / OpenVoc-VidVRD
Official code for the ICLR2023 paper Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
☆43Updated last year