showlab / liveccLinks
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
☆418Updated 3 months ago
Alternatives and similar repositories for livecc
Users that are interested in livecc are comparing it to the libraries listed below
Sorting:
- The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"☆578Updated 2 weeks ago
- 🧠 VideoMind: A Chain-of-LoRA Agent for Temporal-Grounded Video Reasoning (ICLR 2026)☆305Updated 2 weeks ago
- Ming - facilitating advanced multimodal understanding and generation capabilities built upon the Ling LLM.☆580Updated 3 months ago
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆301Updated 4 months ago
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"☆131Updated last year
- ☆370Updated 10 months ago
- [ICML 2025] Official PyTorch implementation of LongVU☆421Updated 9 months ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planning☆288Updated 10 months ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆348Updated 3 weeks ago