SaraGhazanfari/CoF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SaraGhazanfari/CoF)

SaraGhazanfari / CoF

Chain-of-Frames [CVPR 2026]

☆40

Alternatives and similar repositories for CoF

Users that are interested in CoF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SaraGhazanfari / EMMA
View on GitHub
EMMA [TMLR 2025]
☆14Sep 25, 2025Updated 10 months ago
SaraGhazanfari / R-LPIPS
View on GitHub
R-LPIPS [ICML W 2023]
☆18Nov 14, 2023Updated 2 years ago
SaraGhazanfari / SpotEdit
View on GitHub
SpotEdit [NeurIPS 2025 W]
☆17Sep 24, 2025Updated 10 months ago
guikunchen / SDSGG
View on GitHub
[NeurIPS'24] Scene Graph Generation with Role-Playing Large Language Models
☆15Oct 10, 2025Updated 9 months ago
SaraGhazanfari / lipsim
View on GitHub
LipSim [ICLR 2024]
☆23Mar 19, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
huiwon-jang / RSP
View on GitHub
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆28Nov 27, 2024Updated last year
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
OpenGVLab / VideoChat-R1
View on GitHub
[NIPS2025] VideoChat-R1 & R1.5: Enhancing Spatio-Temporal Perception and Reasoning via Reinforcement Fine-Tuning
☆268Oct 18, 2025Updated 9 months ago
valentyn1boreiko / SVCEs_code
View on GitHub
☆13Jun 23, 2022Updated 4 years ago
zhengrongz / AoTD
View on GitHub
[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".
☆58Updated this week
HuiGuanLab / RaTSG
View on GitHub
This is a repository contains the implementation of our NeurIPS'24 paper "Temporal Sentence Grounding with Relevance Feedback in Videos"
☆13Aug 22, 2025Updated 11 months ago
OpenGVLab / VKnowU
View on GitHub
[ECCV 2026] VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs
☆16Feb 3, 2026Updated 5 months ago
davidstutz / robust-generalization-flatness
View on GitHub
Implementation of average- and worst-case robust flatness measures for adversarial training.
☆15Nov 5, 2021Updated 4 years ago
SooLab / EyeWO
View on GitHub
[NeurIPS2025] The official PyTorch implementation of the "Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video".
☆35Dec 25, 2025Updated 7 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
chs20 / fuselip
View on GitHub
FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens
☆17Sep 8, 2025Updated 10 months ago
hshjerry / VideoEspresso
View on GitHub
[CVPR 2025 Oral] VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection
☆140Jul 28, 2025Updated 11 months ago
yafeng19 / T-CORE
View on GitHub
[CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…
☆19Nov 4, 2025Updated 8 months ago
pro-assist / ProAssist
View on GitHub
☆20Jul 21, 2025Updated last year
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 10 months ago
hy0Y / ST-GT
View on GitHub
[CVPR 2024] Official repository of ST_GT
☆10Sep 15, 2024Updated last year
alibaba / ReWatch-R1
View on GitHub
[ICLR 2026] ReWatch-R1: Boosting Complex Video Reasoning in Large Vision-Language Models through Agentic Data Synthesis
☆30Mar 27, 2026Updated 3 months ago
MCG-NJU / CaReBench
View on GitHub
A Fine-grained Benchmark for Video Captioning and Retrieval
☆30Jul 16, 2025Updated last year
LaVi-Lab / Rethink_CoT_Video
View on GitHub
Official code for "Rethinking Chain-of-Thought Reasoning for Videos"
☆21Dec 14, 2025Updated 7 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
sail-sg / Video-Next-Event-Prediction
View on GitHub
☆28Aug 9, 2025Updated 11 months ago
nmndeep / Robust-Segmentation
View on GitHub
[ECCV 2024] Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models
☆21Jul 17, 2024Updated 2 years ago
marinero4972 / Open-o3-Video
View on GitHub
[ICML 2026] Official implementation of "Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence"
☆157May 1, 2026Updated 2 months ago
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
daniel-cores / tvbench
View on GitHub
TVBench: Redesigning Video-Language Evaluation
☆15Jun 9, 2025Updated last year
maifoundations / Streamo
View on GitHub
Streaming Video Instruction Tuning
☆83Feb 25, 2026Updated 5 months ago
epic-kitchens / VISOR-VOS
View on GitHub
☆13Nov 2, 2023Updated 2 years ago
locuslab / robust_union
View on GitHub
[ICML'20] Multi Steepest Descent (MSD) for robustness against the union of multiple perturbation models.
☆25Jul 25, 2024Updated 2 years ago
araujoalexandre / Lipschitz-SLL-Networks
View on GitHub
☆10Oct 27, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
BeyondScene / BeyondScene
View on GitHub
[ECCV 2024] BeyondScene: Higher-Resolution Human-Centric Scene Generation With Pretrained Diffusion
☆21Jul 2, 2024Updated 2 years ago
TencentARC / ARC-Chapter
View on GitHub
Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
☆44Nov 19, 2025Updated 8 months ago
Tongji-MIC-Lab / KAGS
View on GitHub
[TPAMI'2023]Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling
☆11Jan 3, 2023Updated 3 years ago
KlingAIResearch / VANS
View on GitHub
[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
☆119Feb 28, 2026Updated 4 months ago
xiaomi-research / time-r1
View on GitHub
[NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding
☆95Dec 14, 2025Updated 7 months ago
kokolerk / TON
View on GitHub
[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models
☆58Sep 29, 2025Updated 9 months ago
apple / ml-streambridge
View on GitHub
☆40Nov 5, 2025Updated 8 months ago