deep-spin/Infinite-Video

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/deep-spin/Infinite-Video)

deep-spin / Infinite-Video

\infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation

☆21

Alternatives and similar repositories for Infinite-Video

Users that are interested in Infinite-Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zzhhfut / CCNet-AAAI2025
View on GitHub
This repository contains code for AAAI2025 paper "Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal …
☆24Aug 18, 2025Updated 11 months ago
Araachie / yoda
View on GitHub
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation. In AAAI, 2024.
☆12Feb 17, 2025Updated last year
akhilkedia / TranformersGetStable
View on GitHub
[ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"
☆11Jul 19, 2024Updated 2 years ago
TIGER-AI-Lab / PixelWorld
View on GitHub
The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]
☆15Sep 12, 2025Updated 10 months ago
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
64327069 / LVAgent
View on GitHub
Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents
☆39Nov 24, 2025Updated 7 months ago
YaNgZhAnG-V5 / attention_regulation
View on GitHub
[ECCV24] Attention Regulation on T2I Diffusion Models
☆19Jul 8, 2024Updated 2 years ago
TencentARC / Video-Holmes
View on GitHub
[ECCV 2026] Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?
☆94Jul 13, 2025Updated last year
asuprem / ODIN
View on GitHub
☆11Sep 1, 2020Updated 5 years ago
traveler-framework / TraveLER
View on GitHub
[EMNLP 2024] TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering
☆18Oct 31, 2024Updated last year
deep-spin / uncertainties_MT_eval
View on GitHub
Code and data for the paper "Disentangling Uncertainty in Machine Translation Evaluation", accepted at EMNLP 2022.
☆23Jun 23, 2023Updated 3 years ago
deep-spin / sparse-marginalization-lvm
View on GitHub
Official PyTorch (Lightning) implementation of the NeurIPS 2020 paper "Efficient Marginalization of Discrete and Structured Latent Variab…
☆27May 3, 2021Updated 5 years ago
lih627 / MLMSNet
View on GitHub
Lightweight Multi-Level Multi-Scale Feature Fusion Network for Semantic Segmentation
☆11May 31, 2021Updated 5 years ago
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
edward3862 / Analogist
View on GitHub
Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)
☆38Sep 10, 2024Updated last year
tianyi-lab / DisCL
View on GitHub
[ICCV 2025] Diffusion Curriculum (DisCL)
☆18Sep 26, 2025Updated 9 months ago
riccizz / HRF
View on GitHub
☆18May 13, 2025Updated last year
look4u-ok / video-slicer
View on GitHub
☆18Jun 18, 2024Updated 2 years ago
deep-spin / tower-eval
View on GitHub
☆29Nov 14, 2025Updated 8 months ago
LijunZhang01 / Octopus
View on GitHub
☆33Apr 18, 2025Updated last year
rezashkv / diffusion_pruning
View on GitHub
[ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.
☆15Feb 1, 2025Updated last year
marinero4972 / CyberV
View on GitHub
☆20Jun 10, 2025Updated last year
fansunqi / VideoTool
View on GitHub
Official Repository for NeurIPS'25 Paper "Tool-Augmented Spatiotemporal Reasoning for Streamlining Video Question Answering Task"
☆23May 18, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
daeunni / Video-Skill-CoT
View on GitHub
Code for "Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning [EMNLP 2025 Findings]"
☆18Aug 27, 2025Updated 10 months ago
path2generalist / General-Level
View on GitHub
On Path to Multimodal Generalist: General-Level and General-Bench
☆21Jul 11, 2025Updated last year
snap-research / AVLink
View on GitHub
AV-Link: Temporally-Aligned Diffusion Features for Cross-Modal Audio-Video Generation
☆17Aug 3, 2025Updated 11 months ago
Hongcheng-Gao / HAVEN
View on GitHub
Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".
☆25Oct 22, 2025Updated 8 months ago
lfedgeai / eda
View on GitHub
Data on-Prem, Code on-the-Fly
☆15Nov 22, 2025Updated 7 months ago
thejonaslab / vonmises-icml-2023
View on GitHub
☆10Jun 24, 2023Updated 3 years ago
schowdhury671 / meerkat
View on GitHub
☆35Jul 9, 2025Updated last year
lfedgeai / yomo
View on GitHub
🦖 Stateful Serverless Framework for Edge AI Infra
☆15Sep 3, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Tony-Lowe / RotationDrag
View on GitHub
☆35Jan 23, 2024Updated 2 years ago
AVoCaDO-Captioner / AVoCaDO
View on GitHub
https://avocado-captioner.github.io/
☆37Oct 16, 2025Updated 9 months ago
INFINIQ-AI1 / CLIPVQDiffusion
View on GitHub
official implementation of "CLIP-VQDiffusion : Langauge Free Training of Text To Image generation using CLIP and vector quantized diffusi…
☆19Sep 5, 2024Updated last year
Jialuo-Li / DIG
View on GitHub
[CVPR 2026] Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding
☆21Feb 21, 2026Updated 4 months ago
Qichuzyy / POA
View on GitHub
Official implementation of ECCV24 paper: POA
☆24Aug 8, 2024Updated last year
OpenGVLab / VRBench
View on GitHub
[ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos
☆28Jun 4, 2026Updated last month
showlab / TPDiff
View on GitHub
TPDiff: Temporal Pyramid Video Diffusion Model
☆25Mar 13, 2025Updated last year