c++ code for merlin tts
☆22Oct 19, 2019Updated 6 years ago
Alternatives and similar repositories for merlin-tts
Users that are interested in merlin-tts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Chinese Prosodic Structure Prediction☆10May 18, 2019Updated 7 years ago
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- Predict prosody labels for Chinese sentences.☆42Jul 7, 2022Updated 3 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Jul 25, 2024Updated last year
- TTS inference in C++ based on TFlite model☆20Jan 18, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Oct 1, 2019Updated 6 years ago
- A Demo of Mandarin/Chinese TTS frontend☆284Apr 18, 2022Updated 4 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆230Aug 17, 2020Updated 5 years ago
- tacotron+griffin Lim synthetic mandarin voice☆26Jul 6, 2023Updated 2 years ago
- MultiSpeaker Tacotron2 using LifeLong Learning.☆13Sep 27, 2019Updated 6 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Jul 6, 2023Updated 2 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆68Jan 17, 2018Updated 8 years ago
- speex aec kalman filter☆15Mar 17, 2024Updated 2 years ago
- Open-source text-to-speech model from KRAFTON trained exclusively on public speech data, with curated datasets and reproducible training …☆69May 21, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Oct 22, 2020Updated 5 years ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- Chinese text normalization for speech processing☆732Mar 18, 2023Updated 3 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 5 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- This repository is a collection of TTS Models in TFLite☆200Feb 12, 2021Updated 5 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Feb 20, 2018Updated 8 years ago
- ☆23Jul 8, 2019Updated 6 years ago
- Simulation of parallel synthesis with LPCNet vocoder☆14May 5, 2020Updated 6 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A handy dataset of noises for ASR☆22May 29, 2019Updated 7 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- TTS model based on Transformer.☆57Aug 2, 2019Updated 6 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 7 months ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆49Mar 10, 2021Updated 5 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆36Apr 25, 2025Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Text frontend for ESPnet tts recipes☆35Jun 1, 2021Updated 5 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 8 years ago
- mnn tts demo.☆19May 7, 2025Updated last year
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62May 6, 2023Updated 3 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 10 years ago
- ☆40Nov 18, 2025Updated 6 months ago