KangsanKim07/VideoICL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KangsanKim07/VideoICL)

KangsanKim07 / VideoICL

[CVPR2025] VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding

☆23

Alternatives and similar repositories for VideoICL

Users that are interested in VideoICL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

db-Lee / Multi-RM
View on GitHub
☆17Jul 17, 2026Updated last week
Vinoground / Vinoground
View on GitHub
☆13Apr 13, 2026Updated 3 months ago
GaryJiajia / OFv2_ICL_VQA
View on GitHub
[CVPR 2024] How to Configure Good In-Context Sequence for Visual Question Answering
☆21May 28, 2025Updated last year
Nardien / KALA
View on GitHub
Official Code Repository for the paper "KALA: Knowledge-Augmented Language Model Adaptation" (NAACL 2022)
☆35Oct 17, 2023Updated 2 years ago
ys-zong / VL-ICL
View on GitHub
[ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning
☆69Sep 20, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jiasenlu / vit-vqgan-jax
View on GitHub
Jax implementation of VIT-VQGAN
☆10Jan 25, 2024Updated 2 years ago
YannDubs / Mini_Decodable_Information_Bottleneck
View on GitHub
Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.
☆12Oct 20, 2020Updated 5 years ago
TIGER-AI-Lab / VISTA
View on GitHub
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
☆20Feb 27, 2025Updated last year
mlbio-epfl / joint-inference
View on GitHub
[ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners
☆22Jun 6, 2025Updated last year
jessemelpolio / LMM_CL
View on GitHub
Codes for: How to Teach Large Multimodal Models New Skills?
☆30Oct 10, 2025Updated 9 months ago
OpenGVLab / TimeSuite
View on GitHub
[ICLR 2025] TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning
☆74Apr 7, 2025Updated last year
Dozi01 / MetaSPO
View on GitHub
☆83Oct 1, 2025Updated 9 months ago
GuangyanS / Sys2-LLaVA
View on GitHub
☆31Feb 10, 2025Updated last year
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
KangsanKim07 / MemoryTransferLearning
View on GitHub
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents
☆31Apr 16, 2026Updated 3 months ago
longrongyang / STGC
View on GitHub
Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model
☆13Feb 11, 2025Updated last year
WHB139426 / Grounded-Video-LLM
View on GitHub
[EMNLP 2025 Findings] Grounded-VideoLLM: Sharpening Fine-grained Temporal Grounding in Video Large Language Models
☆149Aug 21, 2025Updated 11 months ago
jeffwillette / delta-attention
View on GitHub
☆16Dec 29, 2025Updated 6 months ago
suny-sht / clip-red-circle
View on GitHub
Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023
☆12Sep 21, 2023Updated 2 years ago
Nardien / KARD
View on GitHub
Official Code Repository for the paper "Knowledge-Augmented Reasoning Distillation for Small Language Models in Knowledge-intensive Tasks…
☆44Nov 24, 2024Updated last year
pwnhyo / T-MAP
View on GitHub
☆17Mar 25, 2026Updated 4 months ago
JPShi12 / VideoLoom
View on GitHub
[ICML 2026] VideoLoom: A Video Large Language Model for Joint Spatial-Temporal Understanding
☆27Jul 3, 2026Updated 3 weeks ago
Open-Galapagos / evolution-fine-tuning
View on GitHub
Official code, models, and dataset for "Evolution Fine-Tuning (EFT): Learning to Discover Across 371 Optimization Tasks"
☆25Jun 30, 2026Updated 3 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Kami-code / HandsOnVLM-release
View on GitHub
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction
☆41Sep 15, 2025Updated 10 months ago
WPR001 / Ego-ST
View on GitHub
☆16Sep 25, 2025Updated 10 months ago
zertow / TPNet
View on GitHub
☆13Oct 25, 2024Updated last year
gmlwns2000 / sea-attention
View on GitHub
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
☆12Jun 20, 2025Updated last year
ThunderVVV / RCLSTR
View on GitHub
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Sep 22, 2023Updated 2 years ago
spacetools / SpaceTools
View on GitHub
code release
☆38Jun 22, 2026Updated last month
ustc-hyin / HiMAP
View on GitHub
Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference
☆14Jun 7, 2025Updated last year
tianyi-lab / R2-T2
View on GitHub
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆19Mar 10, 2025Updated last year
meetdavidwan / crg
View on GitHub
PyTorch code for "Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training"
☆39Mar 4, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Flowerfan / VistaLLaMA
View on GitHub
☆15Dec 11, 2024Updated last year
DeepAuto-AI / hip-attention
View on GitHub
Training-free Post-training Efficient Sub-quadratic Complexity Attention. Implemented with OpenAI Triton.
☆153Mar 31, 2026Updated 3 months ago
MzeroMiko / XDLM
View on GitHub
[ICML 2026 Spotlight] Code for miXed Discrete Diffusion Language Model
☆27Mar 16, 2026Updated 4 months ago
zhoujiahuan1991 / CVPR2025-STOP
View on GitHub
☆19May 8, 2025Updated last year
yongchao98 / PROMST
View on GitHub
Automatic prompt optimization framework for multi-step agent tasks.
☆37Nov 12, 2024Updated last year
TOM-tym / APG
View on GitHub
Official PyTorch implementation of our ICCV2023 paper “When Prompt-based Incremental Learning Does Not Meet Strong Pretraining”
☆16Jan 8, 2024Updated 2 years ago
aashi7 / NearCollision
View on GitHub
Project repo for Forecasting Time-to-Collision from Monocular Video: Feasibility, Dataset and Challenges
☆15Sep 6, 2021Updated 4 years ago