JiuTian-VL/LION-FS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JiuTian-VL/LION-FS)

JiuTian-VL / LION-FS

[CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant

☆27

Alternatives and similar repositories for LION-FS

Users that are interested in LION-FS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MCG-NJU / StreamForest
View on GitHub
[NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory
☆144Nov 4, 2025Updated 4 months ago
xinding-sys / StreamMind
View on GitHub
[ICCV 2025] StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition
☆63Jun 25, 2025Updated 9 months ago
SaraGhazanfari / CoF
View on GitHub
Chain-of-Frames [CVPR 2026]
☆38Jul 2, 2025Updated 8 months ago
aurooj / SHG-VQA
View on GitHub
Learning Situation Hyper-Graphs for Video Question Answering
☆22Feb 16, 2024Updated 2 years ago
lwpyh / CoS_codes
View on GitHub
CoS: Chain-of-Shot Prompting for Long Video Understanding
☆53Feb 13, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
iankur / vqllm
View on GitHub
Residual vector quantization for KV cache compression in large language model
☆12Oct 22, 2024Updated last year
TencentYoutuResearch / HighlightDetection-CLC
View on GitHub
Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"
☆18Mar 21, 2023Updated 3 years ago
pabloruizponce / Interact2Ar
View on GitHub
[CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".
☆16Feb 23, 2026Updated last month
gccnlp / Light-PEFT
View on GitHub
[ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
☆13Sep 2, 2024Updated last year
RobinWitch / DyStream
View on GitHub
☆27Feb 7, 2026Updated last month
hiteshhedwig / yolov8-reid-tracking-demo
View on GitHub
☆19Mar 9, 2023Updated 3 years ago
JiuTian-VL / MoME
View on GitHub
[NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
☆81Dec 27, 2025Updated 3 months ago
fast3r-3d / fast3r-3d.github.io
View on GitHub
☆11Mar 4, 2025Updated last year
JiuTian-VL / CogVLA
View on GitHub
[NeurIPS 2025] CogVLA: Cognition-Aligned Vision-Language-Action Models via Instruction-Driven Routing & Sparsification
☆141Dec 10, 2025Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ychensu / LRANet-PP
View on GitHub
[IEEE TPAMI] LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting
☆32Dec 2, 2025Updated 3 months ago
Zhenhao-Zhang / OpenHOI
View on GitHub
☆45Jan 29, 2026Updated last month
CXU-TRI / FAIL-Detect
View on GitHub
Code for RSS 2025 paper "Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning …
☆38Jun 18, 2025Updated 9 months ago
HumanMLLM / ViSpeak
View on GitHub
(ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"
☆47Jul 1, 2025Updated 8 months ago
alibaba / Deep-Vision
View on GitHub
☆37Apr 7, 2022Updated 3 years ago
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 2 months ago
junha1125 / Vision-Language-Model-in-ECCV-2024
View on GitHub
☆17Oct 1, 2024Updated last year
JiuTian-VL / Optimus-2
View on GitHub
[CVPR 2025] Official Implementation for Optimus-2: Multimodal Minecraft Agent with Goal-Observation-Action Conditioned Policy
☆24Jun 17, 2025Updated 9 months ago
nico0704 / GS_ICP_SLAM
View on GitHub
[ECCV 2024] RGBD GS-ICP SLAM
☆14Nov 5, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
Tacxels / MCTac-2.0
View on GitHub
Improvement for Modular Camera based Tactile Sensor, with integrated circuit, optimized illumination, and biomimetic markers.
☆16Feb 14, 2024Updated 2 years ago
zoezheng126 / Spatio-Temporal-LLM
View on GitHub
☆18Aug 7, 2025Updated 7 months ago
fansunqi / AKeyS
View on GitHub
Agentic Keyframe Search for Video Question Answering
☆16Apr 7, 2025Updated 11 months ago
humanx-interaction / Human-X-Interaction
View on GitHub
official code repository for papar: "Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthes…
☆20Jul 29, 2025Updated 7 months ago
sisuolv / 2022-China-Collegiate-Computing-Contest-WeChat-Big-Data-Challenge--12rd
View on GitHub
https://algo.weixin.qq.com/
☆14Mar 7, 2023Updated 3 years ago
CognitiveAISystems / Dynamic-Neural-Potential-Field
View on GitHub
Approach where the repulsive potential in an MPC pipeline is estimated by a neural model.
☆24Mar 5, 2026Updated 3 weeks ago
epic-kitchens / C1-Action-Recognition
View on GitHub
Evaluation metrics and submission file creation scripts the Action Recognition challenge
☆15Feb 9, 2026Updated last month
hyungjin-chung / VPS
View on GitHub
☆14Sep 11, 2025Updated 6 months ago
JiuTian-VL / SimpAgent
View on GitHub
[ICCV 2025 Highlight] Less is More: Empowering GUI Agent with Context-Aware Simplification
☆44Mar 12, 2026Updated 2 weeks ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zihuixue / ProgCaptioner
View on GitHub
Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)
☆21Jul 16, 2025Updated 8 months ago
sisuolv / CVPR--Sorghum--100-Cultivar-Identification--FGVC-9--3rd
View on GitHub
https://www.kaggle.com/competitions/sorghum-id-fgvc-9
☆19Mar 1, 2023Updated 3 years ago
dipika-singhania / ICC-Semi-Supervised-TAS
View on GitHub
Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation
☆11Jul 24, 2023Updated 2 years ago
mufeiyu-ayu / VJade
View on GitHub
🚀 基于 Vue3、TypeScript、Vite 的企业级中后台快速开发框架，采用模块化设计，内置丰富的业务组件。
☆27Oct 16, 2025Updated 5 months ago
snap-research / VIMI
View on GitHub
☆13Jul 10, 2024Updated last year
callsys / DynRefer
View on GitHub
[CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
☆59Mar 4, 2025Updated last year
minjoong507 / Consistency-of-Video-LLM
View on GitHub
[CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"
☆16Oct 13, 2025Updated 5 months ago