OpenMOSS/MOSS-Video-Preview

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenMOSS/MOSS-Video-Preview)

OpenMOSS / MOSS-Video-Preview

A real-time video understanding foundation model with gated cross-attention. Offline & real-time inference.

☆163

Alternatives and similar repositories for MOSS-Video-Preview

Users that are interested in MOSS-Video-Preview are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fnlp-vision / UnifiedVisual
View on GitHub
Official repository for the EMNLP 2025 paper “UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets”.
☆16Sep 19, 2025Updated 10 months ago
fnlp-vision / DPA
View on GitHub
[EMNLP Findings'25] Official PyTorch Implementation of Decoupled Proxy Alignment: Mitigating Language Prior Conflict for Multimodal Align…
☆16Sep 19, 2025Updated 10 months ago
OpenMOSS / MOSS-VL
View on GitHub
MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.
☆398Updated this week
Berdyanskov / CargoDash
View on GitHub
A Python library for building simple, modular, multifunctional, and efficient large model training data synthesis/augmentation pipelines.
☆34May 29, 2026Updated last month
xinghaow99 / prism
View on GitHub
[ICML 2026] Prism: Spectral-Aware Block-Sparse Attention
☆27May 22, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xinghaow99 / pbs-attn
View on GitHub
[ICML 2026] Sparser Block-Sparse Attention via Token Permutation
☆31May 22, 2026Updated 2 months ago
ydyhello / Awesome-VLM-Streaming-Video
View on GitHub
📚 A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for str…
☆189Updated this week
OpenMOSS / MOSS-Audio-Tokenizer
View on GitHub
MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…
☆248Jun 16, 2026Updated last month
OpenMOSS / claude-codex-handoff
View on GitHub
Drop-in async file-based handoff protocol for two AI coding agents (Claude Code + Codex), installed as one shared .handoff/ in your proje…
☆30Jul 4, 2026Updated 3 weeks ago
SooLab / EyeWO
View on GitHub
[NeurIPS2025] The official PyTorch implementation of the "Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video".
☆35Dec 25, 2025Updated 7 months ago
Jihuai-wpy / InferAligner
View on GitHub
Inference-time alignment for harmlessness through cross-model guidance (ACL 2024). Code + MM-Harmful Bench.
☆38Oct 2, 2024Updated last year
OpenMOSS / MOSS-Speech
View on GitHub
MOSS-Speech is a true speech-to-speech large language model without text guidance.
☆138Feb 13, 2026Updated 5 months ago
maifoundations / Streamo
View on GitHub
Streaming Video Instruction Tuning
☆83Feb 25, 2026Updated 5 months ago
JingYiJun / awesome-inspire
View on GitHub
一个面向启智平台（Inspire）的 awesome list
☆37Mar 29, 2026Updated 3 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
OpenMOSS / OurClaw
View on GitHub
Institutional OpenClaw Solution. Share One Claw with Others.
☆25Mar 30, 2026Updated 3 months ago
OpenMOSS / MOVA
View on GitHub
MOVA: Towards Scalable and Synchronized Video–Audio Generation
☆1,083Jun 18, 2026Updated last month
sotayang / Awesome-Streaming-Video-Understanding
View on GitHub
🔥🔥🔥 [Awesome] Latest Papers, Codes & Datasets on Streaming / Online Video Understanding — Building Always-on, Real-time Video AI 🤖
☆416Jul 2, 2026Updated 3 weeks ago
xinghaow99 / DenoSent
View on GitHub
[AAAI 2024] DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning
☆15Apr 29, 2024Updated 2 years ago
wanglu-cs / Think_While_Watching
View on GitHub
☆19Jun 26, 2026Updated last month
Twilight92z / Quantize-Watermark
View on GitHub
☆19Nov 6, 2023Updated 2 years ago
aiha-lab / InfiniPot-V
View on GitHub
[NeurIPS 25] InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding
☆20Jan 25, 2026Updated 6 months ago
EmbodiedForge / Inspire-cli
View on GitHub
A tool for better use of Inspire platform (Beta: Codeberg version is more up-to-date)
☆28Apr 2, 2026Updated 3 months ago
Linxi000 / MEDS
View on GitHub
☆142Jun 24, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
opendatalab / RxnCaption
View on GitHub
[CVPR 2026] SOTA Chemical Reaction Diagram Parsing Framework
☆25Mar 24, 2026Updated 4 months ago
EIT-NLP / Awesome-Streaming-LLMs
View on GitHub
🔥This is a repository of paper list for streaming LLMs/MLLMs.
☆24Apr 19, 2026Updated 3 months ago
sii-research / OpenMOSS
View on GitHub
OpenMOSS presents a collection of our research on LLMs, supported by SII, Fudan and Mosi.
☆30Updated this week
OpenMOSS / MOSS-Audio
View on GitHub
MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoni…
☆617Jun 2, 2026Updated last month
OpenLMLab / ParallelTokenizer
View on GitHub
Use the tokenizer in parallel to achieve superior acceleration
☆20Mar 21, 2024Updated 2 years ago
OpenMOSS / MOSS-Transcribe-Diarize
View on GitHub
MOSS-Transcribe-Diarize 0.9B is an open-source SOTA end-to-end audio understanding model for long-form multi-speaker transcription, diari…
☆1,238Updated this week
haowei-freesky / HERMES
View on GitHub
Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]
☆92May 8, 2026Updated 2 months ago
aurateam2026 / AURA
View on GitHub
☆118Jun 5, 2026Updated last month
yellow-binary-tree / ProactiveVideoQA
View on GitHub
ProactiveBench: A Comprehensive Benchmark for VideoLLM Proactive Interaction Evaluation
☆18Jan 8, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yellow-binary-tree / MMDuet2
View on GitHub
[ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning
☆42Jan 14, 2026Updated 6 months ago
netokeep / netokeep
View on GitHub
Create SSH and TCP Proxy to your company container.
☆29Jun 10, 2026Updated last month
JoeLeelyf / OVO-Bench
View on GitHub
[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
☆154Jul 24, 2025Updated last year
Becomebright / ReKV
View on GitHub
[ICLR'25] Streaming Video Question-Answering with In-context Video KV-Cache Retrieval
☆122Nov 4, 2025Updated 8 months ago
MCG-NJU / StreamForest
View on GitHub
[NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory
☆133Nov 4, 2025Updated 8 months ago
OpenMOSS / GAOKAO-MM
View on GitHub
[ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation
☆82Mar 13, 2024Updated 2 years ago
OpenMOSS / BandPO
View on GitHub
Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.…
☆49Apr 8, 2026Updated 3 months ago