yukw777 / EILEVView external linksLinks
EILeV: Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties
☆131Nov 10, 2024Updated last year
Alternatives and similar repositories for EILEV
Users that are interested in EILEV are comparing it to the libraries listed below
Sorting:
- Supercharged BLIP-2 that can handle videos☆123Dec 1, 2023Updated 2 years ago
- Egocentric Video Understanding Dataset (EVUD)☆33Jul 4, 2024Updated last year
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆102Jul 2, 2024Updated last year
- This project breathes life into video characters by using AI to describe their personality and then chat with you as them.☆49Mar 12, 2024Updated last year
- Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.☆81Dec 30, 2023Updated 2 years ago
- Official PyTorch code of GroundVQA (CVPR'24)☆64Sep 13, 2024Updated last year
- Code and Dataset for the CVPRW Paper "Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos"☆29Aug 28, 2023Updated 2 years ago
- An Enhanced CLIP Framework for Learning with Synthetic Captions☆39Apr 18, 2025Updated 9 months ago
- I know Kung Fu☆21Mar 27, 2025Updated 10 months ago
- SVIT: Scaling up Visual Instruction Tuning☆166Jun 20, 2024Updated last year
- Large-scale text-video dataset. 10 million captioned short videos.☆676Aug 14, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of LongVU☆422May 8, 2025Updated 9 months ago
- The champion solution for Ego4D Natural Language Queries Challenge in CVPR 2023☆18Jan 23, 2024Updated 2 years ago
- ☆19Jan 8, 2024Updated 2 years ago
- Visualizing the learned space-time attention using Attention Rollout☆40Apr 1, 2022Updated 3 years ago
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆26Oct 17, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- SSL Video Representation Learning project☆14Jul 8, 2025Updated 7 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated last month
- ☆11May 17, 2024Updated last year
- [CHI24] AI-Assisted In-Context Writing on OHMD During Travels☆11Dec 19, 2024Updated last year
- [ACL 2023] Code and data for our paper "Measuring Progress in Fine-grained Vision-and-Language Understanding"☆13Jun 11, 2023Updated 2 years ago
- A Next.js v15+ template with Tailwind v3+, featuring Microsoft Entra ID authentication via Next-Auth v5+ and a Microsoft Graph Client int…