xiaomingnio / kanttsView external linksLinks
TTS appalication based on modelscope KAN-TTS
☆41Apr 11, 2024Updated last year
Alternatives and similar repositories for kantts
Users that are interested in kantts are comparing it to the libraries listed below
Sorting:
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆525Dec 28, 2023Updated 2 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- ☆14Jun 16, 2023Updated 2 years ago
- Train the next generation of TTS systems.☆171Sep 13, 2024Updated last year
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆415Nov 20, 2025Updated 2 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 2 months ago
- Copied from official repo of VITS. Added some comments.☆19Sep 24, 2024Updated last year
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Sep 2, 2025Updated 5 months ago
- Streaming Text to Speech Web UI☆22May 6, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Updated this week
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆88Apr 2, 2024Updated last year
- singing voice conversion without f0☆23May 10, 2023Updated 2 years ago
- a lightweight voice conversion☆86Sep 2, 2024Updated last year
- ☆23Oct 17, 2024Updated last year
- Text Normalization & Inverse Text Normalization☆726Feb 3, 2026Updated last week
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆24Sep 12, 2024Updated last year
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆24Sep 1, 2023Updated 2 years ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- ☆29Feb 4, 2025Updated last year
- freeswitch百度语音识别模块☆25Feb 16, 2021Updated 4 years ago
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆55Jan 17, 2024Updated 2 years ago
- Real-time Hand Shape and Motion Capture with RGB Camera☆29Jul 22, 2021Updated 4 years ago
- ☆25Mar 6, 2024Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated last year
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Apr 4, 2024Updated last year
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音 更生动且富有节奏☆277Sep 10, 2023Updated 2 years ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- ICASSP 2022☆61Oct 12, 2021Updated 4 years ago
- A one-page WebUI integrating VITS inference, training, and output in Sherpa-Onnx format.☆12Feb 2, 2025Updated last year
- Identify speakers with stable voice timbre.☆32Jun 20, 2024Updated last year
- Chinese polyphone disambiguation for Text-to-Speech application☆42Jun 11, 2024Updated last year
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆40Jun 12, 2025Updated 8 months ago
- Unoffical implementation of Megatts2☆288Mar 23, 2024Updated last year
- ☆204Sep 24, 2024Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32May 10, 2023Updated 2 years ago
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆145Jan 1, 2025Updated last year