MOSS‑TTS Family is an open‑source speech and sound generation model family from MOSI.AI and the OpenMOSS team. It is designed for high‑fidelity, high‑expressiveness, and complex real‑world scenarios, covering stable long‑form speech, multi‑speaker dialogue, voice/character design, environmental sound effects, and real‑time streaming TTS.
☆1,872May 26, 2026Updated this week
Alternatives and similar repositories for MOSS-TTS
Users that are interested in MOSS-TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- OpenEAI Platform for Embodied Intelligence☆491Mar 5, 2026Updated 2 months ago
- A Claude-Code-like CLI coding agent scaffold (TypeScript + oclif)☆94Feb 26, 2026Updated 3 months ago
- EvoCorps:面向网络舆论去极化的进化式多 Agent 框架,在传播过程中主动 介入,协同降温情绪、对抗极端化、推动理性讨论。☆136Apr 13, 2026Updated last month
- An improved and reproducible implementation of a Silver Medal Kaggle NeurIPS Open Polymer Prediction solution, featuring SMILES canonical…☆369May 1, 2026Updated 3 weeks ago
- FreeFuse: Multi-Subject LoRA Fusion via Adaptive Token-Level Routing at Test Time☆193Mar 17, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- One of the early China-built lightweight Claude Code-inspired terminal coding agents, with autonomous execution, tool calling, custom age…☆54Updated this week
- Openclaw based trading system,core of CyberMolt☆147Feb 26, 2026Updated 3 months ago
- An open-source AI visualization tool that transforms natural language into Mind Maps, Mermaid diagrams, and Echarts. Turn your ideas into…☆905Feb 4, 2026Updated 3 months ago
- Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation☆40Mar 6, 2026Updated 2 months ago
- 🐙 Give your AI a life — open-source agent infrastructure for team collaboration.☆1,174Updated this week
- ☆24Jul 20, 2025Updated 10 months ago
- ☆17Feb 28, 2026Updated 3 months ago
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆219May 6, 2026Updated 3 weeks ago
- FreeCite: A Judge-Free Benchmark for Granular Citation Evaluation in Large Language Models☆51Feb 22, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Starfish-FL is an essential component of the STARFISH project. It focuses on federated learning and analysis for the Analysis Mandate fun…☆10May 6, 2026Updated 3 weeks ago
- MOVA: Towards Scalable and Synchronized Video–Audio Generation☆1,017May 6, 2026Updated 3 weeks ago
- 宅在家里的电脑,努力做一个openclaw☆60Mar 26, 2026Updated 2 months ago
- Open-source AI research assistant for biomedicine — chat to run RNA-seq, drug discovery, clinical analysis, and more. Built on Claude Cod…☆647Mar 12, 2026Updated 2 months ago
- ☆41May 15, 2026Updated 2 weeks ago
- This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs☆92Sep 19, 2025Updated 8 months ago
- MOSS-TTSD is a spoken dialogue generation model designed for expressive multi-speaker synthesis. It features long-context modeling, flex…☆1,326Mar 23, 2026Updated 2 months ago
- From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models☆95Feb 27, 2026Updated 3 months ago
- [ICML 2026] Code2Worlds: Empowering Coding LLMs for 4D World Generation☆112May 17, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation☆61Mar 31, 2025Updated last year
- [ICML2026] From Statics to Dynamics: Physics-Aware Image Editing with Latent Transition Priors☆92Apr 30, 2026Updated last month
- ☆11Apr 25, 2026Updated last month
- GraTAG — Production AI Search via Graph-Based Query Decomposition and Triplet-Aligned Generation with Rich Multimodal Representations☆55Updated this week
- ☃企业门户系统,基于Springboot和Thymeleaf🎪,使用layui前后端分离☆18Feb 25, 2026Updated 3 months ago
- We introduce 'Thinking with Video', a new paradigm leveraging video generation for multimodal reasoning. Our VideoThinkBench shows that S…☆301Mar 21, 2026Updated 2 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- [ICLR26] Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs☆33Dec 9, 2025Updated 5 months ago
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆95Nov 30, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official repo for paper "SK-Adapter: Skeleton-Based Structural Control for Native 3D Generation".☆54Mar 22, 2026Updated 2 months ago
- ☆88Dec 31, 2025Updated 4 months ago
- [CVPR'26] VecGlypher: Unified Vector Glyph Generation with Language Models☆126Feb 26, 2026Updated 3 months ago
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆143Sep 19, 2025Updated 8 months ago
- Official Codebase for our CVPR 2026 paper UniSH: Unifying Scene and Human Reconstruction in a Feed-Forward Pass☆145Feb 24, 2026Updated 3 months ago
- 公司知识库助手☆74Mar 15, 2026Updated 2 months ago
- Code for 'JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion'☆261May 11, 2026Updated 2 weeks ago