rhasspy / piper-phonemizeLinks
C++ library for converting text to phonemes for Piper
β128Updated last year
Alternatives and similar repositories for piper-phonemize
Users that are interested in piper-phonemize are comparing it to the libraries listed below
Sorting:
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β246Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated 11 months ago
- Open models for Coqui STTβ141Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-ttsβ188Updated 10 months ago
- On-device voice activity detection (VAD) powered by deep learningβ219Updated this week
- openvino version of openai/whisperβ168Updated last year
- Running the F5-TTS by ONNX Runtimeβ161Updated last week
- A tokenizer, text cleaner, and phonemizer for many human languages.β318Updated 7 months ago
- Desktop application for neural speech synthesis written in C++β215Updated 2 years ago
- Faster Tortoise inference then Tortoise Fast Forkβ128Updated last year
- Official Implementation of StyleTTSβ439Updated 6 months ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ254Updated last year
- ONNX Inference of Pyannote Segmentationβ92Updated 6 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ184Updated 9 months ago
- [WIP] VoiceSmith makes training text to speech models easy.β225Updated 2 years ago
- β272Updated last year
- β239Updated 3 weeks ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobileβ¦β42Updated 10 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β100Updated 9 months ago
- πΈ - A general purpose model trainer, as flexible as it getsβ220Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorchβ128Updated 7 months ago
- On-device noise suppression powered by deep learningβ73Updated 3 weeks ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β172Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.β82Updated 8 months ago
- Port of Meta's Encodec in C/C++β226Updated 7 months ago
- VoiceBox neural network implementationβ108Updated 11 months ago
- β369Updated 10 months ago
- eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.β21Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioβ68Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.β189Updated 2 months ago