FireRedTeam / FireRedTTSLinks
An Open-Sourced LLM-empowered Foundation TTS System
☆892Updated 3 months ago
Alternatives and similar repositories for FireRedTTS
Users that are interested in FireRedTTS are comparing it to the libraries listed below
Sorting:
- ☆1,502Updated last year
- Added vLLM support to IndexTTS for faster inference.☆987Updated 2 months ago
- 基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。☆574Updated 7 months ago
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆1,069Updated last month
- ☆375Updated last year
- 使用vllm加速cosyvoice2的推理☆465Updated 8 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆756Updated last month
- ☆297Updated last year
- A fundamental toolkit designed for music, song, and audio generation☆1,276Updated 7 months ago
- 🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.☆1,372Updated 3 months ago
- GPT-SoVITS2☆229Updated last year
- GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning☆842Updated 3 weeks ago
- ☆473Updated 7 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆689Updated last month
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆699Updated last year
- CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)☆184Updated 9 months ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆631Updated this week
- ☆338Updated 8 months ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆526Updated 2 years ago
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,284Updated 3 months ago
- GPT-4o-level, real-time spoken dialogue system.☆363Updated 11 months ago
- ☆482Updated 8 months ago
- [ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching☆1,217Updated 3 weeks ago
- 一个用于CosyVoice的api接口项目☆326Updated 4 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Updated last year
- SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.☆208Updated last year
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆459Updated last year
- [IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆637Updated last year
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,706Updated 2 weeks ago
- OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.☆466Updated last month