Shahzadnit/EZ-CLIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shahzadnit/EZ-CLIP)

Shahzadnit / EZ-CLIP

☆24

Alternatives and similar repositories for EZ-CLIP

Users that are interested in EZ-CLIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

engindeniz / vitis
View on GitHub
[ICCV 2023 CLVL Workshop] Zero-Shot and Few-Shot Video Question Answering with Multi-Modal Prompts
☆13Jan 13, 2025Updated last year
Visual-AI / FROSTER
View on GitHub
[ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition
☆101Jan 14, 2025Updated last year
R00Kie-Liu / Sampler
View on GitHub
Task-adaptive Spatial-Temporal Video Sampler for Few-shot Action Recognition
☆14Dec 22, 2022Updated 3 years ago
TalalWasim / Vita-CLIP
View on GitHub
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆126Jul 1, 2023Updated 3 years ago
nutuniv / SSRL
View on GitHub
☆19Jul 9, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
leonnnop / Locater
View on GitHub
[TPAMI 2023] Local-Global Context Aware Transformer for Language-Guided Video Segmentation
☆47Jan 20, 2024Updated 2 years ago
lovelyqian / AMeFu-Net
View on GitHub
Repository for the paper "Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition"
☆48Mar 22, 2022Updated 4 years ago
sauradip / STALE
View on GitHub
[ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "
☆116Aug 3, 2023Updated 2 years ago
ragor114 / InsertDiffusion
View on GitHub
Official implementation for the paper "InsertDiffusion: Identity Preserving Visualization of Objects through a Training-Free Diffusion Ar…
☆22Aug 12, 2024Updated last year
bellos1203 / TCD
View on GitHub
Code for "Class-Incremental Learning for Action Recognition in Videos", ICCV 2021
☆22Oct 14, 2022Updated 3 years ago
QQBrowserVideoSearch / CBVS-UniCLIP
View on GitHub
A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios
☆13Jan 24, 2024Updated 2 years ago
lilygeorgescu / AED
View on GitHub
A Background-Agnostic Framework with Adversarial Training for Abnormal Event Detection in Video
☆34Apr 12, 2022Updated 4 years ago
huangmozhi9527 / GMMFormer
View on GitHub
[AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval
☆21May 10, 2024Updated 2 years ago
PRS-Organization / PRS-Trial-Version
View on GitHub
Trial version for prs platform (python project). Please note that the complete experience requires downloading the Unity resource.
☆10Jun 26, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
JCruan519 / GIST
View on GitHub
(ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.
☆11Jan 28, 2024Updated 2 years ago
whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
alibaba-mmai-research / CLIP-FSAR
View on GitHub
Code for our IJCV 2023 paper "CLIP-guided Prototype Modulating for Few-shot Action Recognition".
☆82Mar 7, 2024Updated 2 years ago
AmeenAli / VideoMatch
View on GitHub
☆14Jan 5, 2022Updated 4 years ago
Ray2OUC / AANet
View on GitHub
demo and pre-trained weight of AANet --- a dense descriptor for image matching.
☆10Jan 9, 2024Updated 2 years ago
muzairkhattak / ViFi-CLIP
View on GitHub
[CVPR 2023] Official repository of paper titled "Fine-tuned CLIP models are efficient video learners".
☆309Apr 3, 2024Updated 2 years ago
mondalanindya / MSQNet
View on GitHub
Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]
☆24Oct 20, 2023Updated 2 years ago
visipedia / ssw60
View on GitHub
Sapsucker Woods 60 Audiovisual Dataset
☆19Oct 7, 2022Updated 3 years ago
naver-ai / lut
View on GitHub
[ECCV 2024] Official PyTorch implementation of LUT "Learning with Unmasked Tokens Drives Stronger Vision Learners"
☆14Dec 1, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
openrsgis / SDCluster
View on GitHub
☆14Mar 28, 2025Updated last year
XiaoBuL / OmniCLIP
View on GitHub
[ECAI-2024] OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning
☆16Jan 7, 2025Updated last year
ThomasWangY / 2024-AAAI-HPT
View on GitHub
Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models (AAAI 2024)
☆75Feb 3, 2025Updated last year
hedongxiao-tju / NSLM
View on GitHub
Code & data accompanying the paper ["Unveiling Implicit Deceptive Patterns in Multi-modal Fake News via Neuro-Symbolic Reasoning"].
☆13Dec 21, 2023Updated 2 years ago
Yuting-Gao / PyramidCLIP
View on GitHub
Implementation of PyramidCLIP(NeurIPS2022).
☆31Nov 15, 2022Updated 3 years ago
jellyho / TwinVLA
View on GitHub
[ICLR 2026] TwinVLA : Data-Efficient Bimanual Manipulation with Twin Single-Arm Vision-Language-Action Models
☆16May 29, 2026Updated last month
Beyond-Zw / DLAN-AC
View on GitHub
☆11Feb 1, 2024Updated 2 years ago
XinyanLi2016 / ND-NER
View on GitHub
This is a named entity recognition (NER) dataset for OSINT towards the national defense domain.
☆10Apr 21, 2023Updated 3 years ago
sallymmx / ActionCLIP
View on GitHub
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
☆614Dec 6, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ninatu / in_style
View on GitHub
Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023
☆11Oct 5, 2023Updated 2 years ago
crodriguezo / TMLGA
View on GitHub
Repository of proposal-free temporal moment localization work
☆33Jun 11, 2024Updated 2 years ago
IlyaGusev / HeadlineCause
View on GitHub
A dataset of news headlines for detecting causalities
☆14May 9, 2022Updated 4 years ago
Ziyeeee / Policy-Lightning
View on GitHub
Policy-Lightning is a PyTorch Lightning-based implementation of GauDP, DP, DP3, etc.
☆16Jan 25, 2026Updated 6 months ago
RongKaiWeskerMA / INSTA
View on GitHub
The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning
☆13Apr 14, 2024Updated 2 years ago
xh5a5n6k6 / image-stitching
View on GitHub
Produce panoramic image from multiple photographs with overlapping fields of view written in C++17.
☆10Feb 19, 2020Updated 6 years ago
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago