rhasspy / piper-phonemize
C++ library for converting text to phonemes for Piper
β117Updated last year
Alternatives and similar repositories for piper-phonemize:
Users that are interested in piper-phonemize are comparing it to the libraries listed below
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β243Updated 10 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorchβ126Updated 5 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β95Updated 6 months ago
- Open models for Coqui STTβ138Updated last year
- ONNX Inference of Pyannote Segmentationβ86Updated 4 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ246Updated 3 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β166Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ208Updated this week
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.β224Updated 2 years ago
- β96Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobileβ¦β39Updated 8 months ago
- Desktop application for neural speech synthesis written in C++β215Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave inputβ29Updated 7 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ67Updated last year
- Repository for the paper: VoiceMe: Personalized voice generation in TTSβ126Updated 3 years ago
- Putting flows on top of neural transducers for better TTSβ62Updated last month
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ148Updated last year
- β359Updated 8 months ago
- VoiceBox neural network implementationβ106Updated 9 months ago
- Official Implementation of StyleTTSβ431Updated 3 months ago
- On-device noise suppression powered by deep learningβ69Updated 2 weeks ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)β119Updated 2 years ago
- Train the next generation of TTS systems.β165Updated 7 months ago
- A ggml (C++) re-implementation of tortoise-ttsβ178Updated 8 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β310Updated 5 months ago
- Unofficial implementation of NVIDIA P-Flow TTS paperβ222Updated 4 months ago
- Onnx wrapper for espnet infrernce modelβ162Updated 6 months ago
- On-device speaker diarization powered by deep learningβ44Updated last month
- zero-shot realtime TTS system, fully offline, free and open sourceβ34Updated 2 weeks ago