rhasspy / piper-phonemizeLinks
C++ library for converting text to phonemes for Piper
☆121Updated last year
Alternatives and similar repositories for piper-phonemize
Users that are interested in piper-phonemize are comparing it to the libraries listed below
Sorting:
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆170Updated last year
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆218Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 8 months ago
- Implementation of Spear-TTS - multi-speaker text-to-speech attention network, in Pytorch☆271Updated last year
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆124Updated 10 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆245Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆316Updated 7 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆126Updated 7 months ago
- On-device noise suppression powered by deep learning☆72Updated last week
- Official Implementation of StyleTTS☆435Updated 5 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆256Updated 5 months ago
- ONNX Inference of Pyannote Segmentation☆90Updated 6 months ago
- Official Implementation of StyleTTS-VC☆184Updated 5 months ago
- A ggml (C++) re-implementation of tortoise-tts☆186Updated 10 months ago
- Unofficial implementation of NVIDIA P-Flow TTS paper☆225Updated 6 months ago
- NeMo text processing for ASR and TTS☆342Updated this week
- Provides training, inference and voice conversion recipes for RADTTS and RADTTS++: Flow-based TTS models with Robust Alignment Learning, …☆286Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆115Updated 2 years ago
- VALL-E 2 reproduction☆129Updated 11 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆180Updated 8 months ago
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆253Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆78Updated 7 months ago
- VoiceBox neural network implementation☆109Updated 10 months ago
- Create an LJSpeech structured voice dataset on wave input☆30Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- On-device speaker diarization powered by deep learning☆50Updated last week
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆202Updated 2 years ago