JiuTian-VL / LION-FSView external linksLinks
[CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant
☆26Dec 2, 2025Updated 2 months ago
Alternatives and similar repositories for LION-FS
Users that are interested in LION-FS are comparing it to the libraries listed below
Sorting:
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- [NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory☆148Nov 4, 2025Updated 3 months ago
- Code for RSS 2025 paper "Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning …☆31Jun 18, 2025Updated 7 months ago
- [ICCV 2025] StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition☆59Jun 25, 2025Updated 7 months ago
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆18Mar 21, 2023Updated 2 years ago
- Chain-of-Frames Reasoning Traces☆38Jul 2, 2025Updated 7 months ago
- Learning Situation Hyper-Graphs for Video Question Answering☆22Feb 16, 2024Updated last year
- 根据关键词列表谷歌学术搜索,批量获取对应的第一个Bibtex。☆34Dec 8, 2025Updated 2 months ago
- 完全免费的 VPN。亲测有效的科学上网,同时支持 windows、mac、linux、ios 和 andrioid 系统。并提供 chrome、firefox、opera 等浏览器的插件使用。☆10Jul 17, 2018Updated 7 years ago
- Official implementation of paper VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interact…☆42Feb 5, 2025Updated last year
- [ICCV 2025 Oral] Official implementation of Learning Streaming Video Representation via Multitask Training.☆82Dec 24, 2025Updated last month
- A benchmark of Python Library Migration☆14Apr 5, 2025Updated 10 months ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆19Jan 6, 2026Updated last month
- Implementation for "StyleGAN-Canvas: Augmenting StyleGAN3 for Real-Time Human-AI Co-Creation"☆11May 24, 2023Updated 2 years ago
- Our repo containes a Efficient RGB-D features extractor to category-level and instance-level 6D pose estimation.☆14Oct 29, 2025Updated 3 months ago
- [ICCV 2025] Official code for Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation☆56Sep 12, 2025Updated 5 months ago
- ☆99Dec 4, 2025Updated 2 months ago
- [CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".☆54May 25, 2025Updated 8 months ago
- ☆11Dec 24, 2024Updated last year
- ReSemAct: Advancing Fine-Grained Robotic Manipulation via Semantic Structuring and Affordance Refinement☆17Jan 5, 2026Updated last month
- Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025☆25Nov 28, 2025Updated 2 months ago
- TESGNN: 3D Temporal Equivariant Scene Graph Neural Networks (published at TMLR)☆14Nov 2, 2025Updated 3 months ago
- [ICCV 2025] "Fine-grained Spatiotemporal Grounding on Egocentric Videos"☆22Nov 23, 2025Updated 2 months ago
- Initial commit☆12Aug 14, 2023Updated 2 years ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 2 months ago
- Web app for makeup transfer using Stable Diffusion☆10Sep 11, 2023Updated 2 years ago
- Agentic Keyframe Search for Video Question Answering☆15Apr 7, 2025Updated 10 months ago
- 基于电商导购机器人,自然语言理解(NLU),文本纠错,歧义词消歧☆12May 5, 2020Updated 5 years ago
- ☆29Nov 15, 2025Updated 2 months ago
- Virtual character locomotion system. See article“Motion Graphs”, Lucas Kovar, 2002☆12Mar 1, 2012Updated 13 years ago
- [2023 CoRL] Leveraging 3D Reconstruction for Mechanical Search on Cluttered Shelves☆11Dec 12, 2024Updated last year
- Improvement for Modular Camera based Tactile Sensor, with integrated circuit, optimized illumination, and biomimetic markers.☆15Feb 14, 2024Updated last year
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- ☆15Sep 11, 2025Updated 5 months ago
- [RAL 2025] MTIL: Encoding Full History with Mamba for Temporal Imitation Learning☆27Nov 17, 2025Updated 2 months ago
- ☆12Nov 4, 2024Updated last year
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- ☆18Jun 4, 2024Updated last year