yukw777/EILEV

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yukw777/EILEV)

yukw777 / EILEV

EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties

☆133

Alternatives and similar repositories for EILEV

Users that are interested in EILEV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yukw777 / VideoBLIP
View on GitHub
Supercharged BLIP-2 that can handle videos
☆123Dec 1, 2023Updated 2 years ago
alanaai / EVUD
View on GitHub
Egocentric Video Understanding Dataset (EVUD)
☆34Jul 4, 2024Updated 2 years ago
lbaermann / qaego4d
View on GitHub
Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"
☆30Aug 28, 2023Updated 2 years ago
facebookresearch / EgoVLPv2
View on GitHub
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆110Jul 2, 2024Updated 2 years ago
deep-diver / Vid2Persona
View on GitHub
This project breathes life into video characters by using AI to describe their personality and then chat with you as them.
☆48Mar 12, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Becomebright / GroundVQA
View on GitHub
Official PyTorch code of GroundVQA (CVPR'24)
☆63Sep 13, 2024Updated last year
camenduru / DragNUWA
View on GitHub
☆19Jan 8, 2024Updated 2 years ago
houzhijian / GroundNLQ
View on GitHub
The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023
☆18Jan 23, 2024Updated 2 years ago
LgQu / TIGeR
View on GitHub
Code for paper: Unified Text-to-Image Generation and Retrieval
☆16Updated this week
facebookresearch / ego4d-goalstep
View on GitHub
Ego4D Goal-Step: Toward Hierarchical Understanding of Procedural Activities (NeurIPS 2023)
☆61Apr 15, 2024Updated 2 years ago
sayakpaul / single-video-curation-svd
View on GitHub
Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.
☆81Dec 30, 2023Updated 2 years ago
CeeZh / LLoVi
View on GitHub
Official implementation for "A Simple LLM Framework for Long-Range Video Question-Answering"
☆106Oct 27, 2024Updated last year
m-bain / webvid
View on GitHub
Large-scale text-video dataset. 10 million captioned short videos.
☆684Aug 14, 2024Updated last year
UCSC-VLAA / CLIPS
View on GitHub
An Enhanced CLIP Framework for Learning with Synthetic Captions
☆40Apr 18, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
zeyofu / BLINK_Benchmark
View on GitHub
This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.or…
☆171Sep 27, 2025Updated 9 months ago
ExponentialML / Video-BLIP2-Preprocessor
View on GitHub
A simple script that reads a directory of videos, grabs a random frame, and automatically discovers a prompt for it
☆142Jan 22, 2024Updated 2 years ago
yiyixuxu / TimeSformer-rolled-attention
View on GitHub
Visualizing the learned space-time attention using Attention Rollout
☆41Apr 1, 2022Updated 4 years ago
Vision-CAIR / LongVU
View on GitHub
[ICML 2025] Official PyTorch implementation of LongVU
☆429May 8, 2025Updated last year
LHL3341 / ContextBLIP
View on GitHub
ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]
☆11May 17, 2024Updated 2 years ago
Cuberick-Orion / CIRPLANT
View on GitHub
Official implementation of the Composed Image Retrieval using Pretrained LANguage Transformers (CIRPLANT) | ICCV 2021 - Image Retrieval o…
☆40Jun 26, 2024Updated 2 years ago
OpenGVLab / Ask-Anything
View on GitHub
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
☆3,343Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
BAAI-DCAI / Visual-Instruction-Tuning
View on GitHub
SVIT: Scaling up Visual Instruction Tuning
☆167Jun 20, 2024Updated 2 years ago
tinyvision / SOLIDER-PersonSearch
View on GitHub
☆14Apr 3, 2023Updated 3 years ago
Buzz-Beater / EgoTaskQA
View on GitHub
Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.
☆44Apr 17, 2023Updated 3 years ago
facebookresearch / open-eqa
View on GitHub
OpenEQA Embodied Question Answering in the Era of Foundation Models
☆366Sep 20, 2024Updated last year
TengdaHan / AutoAD
View on GitHub
[CVPR'23 Highlight] AutoAD: Movie Description in Context.
☆104Nov 6, 2024Updated last year
Synteraction-Lab / PANDALens
View on GitHub
[CHI24] AI-Assisted In-Context Writing on OHMD During Travels
☆12Dec 19, 2024Updated last year
gyxxyg / VTG-LLM
View on GitHub
[AAAI 2025] VTG-LLM: Integrating Timestamp Knowledge into Video LLMs for Enhanced Video Temporal Grounding
☆130Dec 10, 2024Updated last year
text-video-edit / shape-aware-text-driven-layered-video-editing-release
View on GitHub
☆17Sep 25, 2023Updated 2 years ago
linzhiqiu / visual_gpt_score
View on GitHub
VisualGPTScore for visio-linguistic reasoning
☆27Oct 7, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / EgoObjects
View on GitHub
[ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
☆84Oct 6, 2023Updated 2 years ago
anonymous0769 / DreamVideo
View on GitHub
☆17Jul 30, 2024Updated last year
bfshi / scaling_on_scales
View on GitHub
When do we not need larger vision models?
☆420Feb 8, 2025Updated last year
cvlab-columbia / DoubleRight
View on GitHub
☆27Jan 25, 2024Updated 2 years ago
showlab / Awesome-Video-Diffusion
View on GitHub
A curated list of recent diffusion models for video generation, editing, and various other applications.
☆5,722Jun 16, 2026Updated last month
Ziyang412 / UCoFiA
View on GitHub
Pytorch Code for "Unified Coarse-to-Fine Alignment for Video-Text Retrieval" (ICCV 2023)
☆66Jun 7, 2024Updated 2 years ago
OpenGVLab / EgoExoLearn
View on GitHub
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
☆85Aug 26, 2025Updated 10 months ago