OpenMOSS/MOSS-VL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenMOSS/MOSS-VL)

OpenMOSS / MOSS-VL

MOSS-VL is the core multimodal model series within the OpenMOSS ecosystem, dedicated to visual understanding.

☆398

Alternatives and similar repositories for MOSS-VL

Users that are interested in MOSS-VL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenMOSS / MOSS-Video-Preview
View on GitHub
A real-time video understanding foundation model with gated cross-attention. Offline & real-time inference.
☆163Jul 16, 2026Updated last week
fnlp-vision / UnifiedVisual
View on GitHub
Official repository for the EMNLP 2025 paper “UnifiedVisual: A Framework for Constructing Unified Vision-Language Datasets”.
☆16Sep 19, 2025Updated 10 months ago
fnlp-vision / DPA
View on GitHub
[EMNLP Findings'25] Official PyTorch Implementation of Decoupled Proxy Alignment: Mitigating Language Prior Conflict for Multimodal Align…
☆16Sep 19, 2025Updated 10 months ago
Linxi000 / MEDS
View on GitHub
☆142Jun 24, 2026Updated last month
EmbodiedForge / Inspire-cli
View on GitHub
A tool for better use of Inspire platform (Beta: Codeberg version is more up-to-date)
☆28Apr 2, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
xinghaow99 / prism
View on GitHub
[ICML 2026] Prism: Spectral-Aware Block-Sparse Attention
☆27May 22, 2026Updated 2 months ago
OpenMOSS / claude-codex-handoff
View on GitHub
Drop-in async file-based handoff protocol for two AI coding agents (Claude Code + Codex), installed as one shared .handoff/ in your proje…
☆30Jul 4, 2026Updated 3 weeks ago
OpenMOSS / MOSS-Audio
View on GitHub
MOSS-Audio is an open-source foundation model for unified audio understanding, enabling speech, sound, music, captioning, QA, and reasoni…
☆617Jun 2, 2026Updated last month
tongjingqi / Thinking-with-Video
View on GitHub
We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…
☆315Jun 21, 2026Updated last month
Berdyanskov / CargoDash
View on GitHub
A Python library for building simple, modular, multifunctional, and efficient large model training data synthesis/augmentation pipelines.
☆34May 29, 2026Updated last month
sii-research / OpenMOSS
View on GitHub
OpenMOSS presents a collection of our research on LLMs, supported by SII, Fudan and Mosi.
☆30Updated this week
OpenMOSS / MOVA
View on GitHub
MOVA: Towards Scalable and Synchronized Video–Audio Generation
☆1,083Jun 18, 2026Updated last month
JingYiJun / awesome-inspire
View on GitHub
一个面向启智平台（Inspire）的 awesome list
☆37Mar 29, 2026Updated 3 months ago
OpenMOSS / OurClaw
View on GitHub
Institutional OpenClaw Solution. Share One Claw with Others.
☆25Mar 30, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tongjingqi / AI-Can-Learn-Scientific-Taste
View on GitHub
We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervis…
☆425Updated this week
Phospheneser / Phospheneser-awesome-academic-template
View on GitHub
An open-source personal academic homepage template characterized by its user-friendly design and extensive scalability.
☆37Oct 6, 2025Updated 9 months ago
OpenMOSS / FutureOmni
View on GitHub
☆26Jan 22, 2026Updated 6 months ago
OpenMOSS / MOSS-Speech
View on GitHub
MOSS-Speech is a true speech-to-speech large language model without text guidance.
☆138Feb 13, 2026Updated 5 months ago
tongjingqi / Awesome-Agent-RL
View on GitHub
A curated list of awesome resources about reward construction for AI agents. This repository covers cutting-edge research, and practical …
☆60Sep 1, 2025Updated 10 months ago
xinghaow99 / pbs-attn
View on GitHub
[ICML 2026] Sparser Block-Sparse Attention via Token Permutation
☆31May 22, 2026Updated 2 months ago
OpenMOSS / MOSS-Audio-Tokenizer
View on GitHub
MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…
☆248Jun 16, 2026Updated last month
OpenMOSS / MOSS-TTS
View on GitHub
MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fi…
☆3,901Jun 22, 2026Updated last month
OpenMOSS / VehicleWorld
View on GitHub
VehicleWorld is the first comprehensive multi-device environment for intelligent vehicle interaction that accurately models the complex, …
☆24Sep 16, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JingYiJun / qz_ssh_starter
View on GitHub
☆21Mar 2, 2026Updated 4 months ago
tianyilt / qzcli_tool
View on GitHub
启智平台任务管理 CLI：资源查询、任务提交、日志查看和 MCP/agent workflow
☆109Jul 17, 2026Updated last week
OpenMOSS / BandPO
View on GitHub
Official implementation of BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning.…
☆49Apr 8, 2026Updated 3 months ago
netokeep / netokeep
View on GitHub
Create SSH and TCP Proxy to your company container.
☆29Jun 10, 2026Updated last month
OpenMOSS / MOSS-TTSD
View on GitHub
MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…
☆1,362Mar 23, 2026Updated 4 months ago
Ruiqi-Yan / Awesome-Audio-Editing
View on GitHub
A curated list of models, benchmarks, tools and guides for audio editing
☆34Jul 7, 2026Updated 2 weeks ago
Jihuai-wpy / InferAligner
View on GitHub
Inference-time alignment for harmlessness through cross-model guidance (ACL 2024). Code + MM-Harmful Bench.
☆38Oct 2, 2024Updated last year
EnigmaYYYY / SocialClaw
View on GitHub
SocialClaw is a screen-aware social copilot that watches live chat windows, builds personalized memory and profile context, and suggests …
☆40Apr 9, 2026Updated 3 months ago
yxzwang / FamilyTool
View on GitHub
FamilyTool benchmark
☆14Sep 10, 2025Updated 10 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
OpenMOSS / Awesome-WAM
View on GitHub
A curated, continuously updated reading list, paper blogs, and resources for World Action Models (WAMs) in embodied AI.
☆1,170Updated this week
OpenMOSS / Thus-Spake-Long-Context-LLM
View on GitHub
a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation
☆62Mar 31, 2025Updated last year
OpenMOSS / MOSS-TTS-Nano
View on GitHub
MOSS-TTS-Nano is an open-source multilingual tiny speech generation model from MOSI.AI and the OpenMOSS team. With only 0.1B parameters, …
☆4,001Jul 14, 2026Updated last week
OmegaCombinator / suzumio
View on GitHub
A Docker-first, non-preemptive multi-agent coordination runtime
☆16Jul 9, 2026Updated 2 weeks ago
OpenMOSS / Lorsa
View on GitHub
☆30Nov 9, 2025Updated 8 months ago
OpenMOSS / FRoM-W1
View on GitHub
[arXiv 26] FRoM-W1: Towards General Humanoid Whole-Body Control with Language Instructions
☆183Jun 5, 2026Updated last month
OpenMOSS / rope_pp
View on GitHub
[ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs
☆33Dec 9, 2025Updated 7 months ago