MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.
☆3,353Jun 11, 2026Updated last week
Alternatives and similar repositories for MOSS-TTS
Users that are interested in MOSS-TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenEAI Platform for Embodied Intelligence☆483Mar 5, 2026Updated 3 months ago
- A Claude-Code-like CLI coding agent scaffold (TypeScript + oclif)☆50Feb 26, 2026Updated 3 months ago
- EvoCorps:面向网络舆论去极化的进化式多 Agent 框架,在传播过程中主动 介入,协同降温情绪、对抗极端化、推动理性讨论。☆136Apr 13, 2026Updated 2 months ago
- An improved and reproducible implementation of a Silver Medal Kaggle NeurIPS Open Polymer Prediction solution, featuring SMILES canonical…☆362May 1, 2026Updated last month
- FreeFuse: Multi-Subject LoRA Fusion via Adaptive Token-Level Routing at Test Time☆208Mar 17, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Openclaw based trading system,core of CyberMolt☆144Feb 26, 2026Updated 3 months ago
- One of the early China-built lightweight Claude Code-inspired terminal coding agents, with autonomous execution, tool calling, custom age…☆54May 28, 2026Updated 3 weeks ago
- An open-source AI visualization tool that transforms natural language into Mind Maps, Mermaid diagrams, and Echarts. Turn your ideas into…☆909Jun 1, 2026Updated 2 weeks ago
- Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation☆42Mar 6, 2026Updated 3 months ago
- 🐙 Give your AI a life — open-source agent infrastructure for team collaboration.☆1,147Updated this week
- ☆24Jul 20, 2025Updated 10 months ago
- ☆18Feb 28, 2026Updated 3 months ago
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆233Updated this week
- FreeCite: A Judge-Free Benchmark for Granular Citation Evaluation in Large Language Models☆51Feb 22, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MOVA: Towards Scalable and Synchronized Video–Audio Generation☆1,044Updated this week
- 让宅在家里的电脑,变成桌面优先的 AI Agent 工作站☆64Jun 3, 2026Updated 2 weeks ago
- Open-source AI research assistant for biomedicine — chat to run RNA-seq, drug discovery, clinical analysis, and more. Built on Claude Cod…☆656Mar 12, 2026Updated 3 months ago
- ☆43May 15, 2026Updated last month
- This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs☆94Sep 19, 2025Updated 9 months ago
- MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…☆1,353Mar 23, 2026Updated 2 months ago
- From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models☆35Feb 27, 2026Updated 3 months ago
- 🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features☆24Dec 10, 2025Updated 6 months ago
- [ICML 2026] Code2Worlds: Empowering Coding LLMs for 4D World Generation☆115Jun 3, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LongCat Audio Tokenizer and Detokenizer☆302May 9, 2026Updated last month
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆61Mar 31, 2025Updated last year
- [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors☆91Apr 30, 2026Updated last month
- ☆11Apr 25, 2026Updated last month
- MultiModal Audio Generation in Raw Waveform Space.☆153May 26, 2026Updated 3 weeks ago
- ☃企业门户系统,基于Springboot和Thymeleaf🎪,使用layui前后端分离☆18Feb 25, 2026Updated 3 months ago
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆306Jun 3, 2026Updated 2 weeks ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- [ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆32Dec 9, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆95Nov 30, 2025Updated 6 months ago
- Official repo for paper "SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation".☆56Mar 22, 2026Updated 2 months ago
- [CVPR'26 Highlight] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset☆608Jun 1, 2026Updated 2 weeks ago
- ComfyUI custom nodes for Fish Audio S2-Pro TTS — voice clone, multi-speaker, and text-to-speech☆238Apr 27, 2026Updated last month
- ☆89Dec 31, 2025Updated 5 months ago
- [CVPR'26] VecGlypher: Unified Vector Glyph Generation with Language Models☆130Feb 26, 2026Updated 3 months ago
- ☆2,054Apr 11, 2026Updated 2 months ago