rhasspy / piper-phonemizeLinks
C++ library for converting text to phonemes for Piper
☆119Updated last year
Alternatives and similar repositories for piper-phonemize
Users that are interested in piper-phonemize are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆216Updated 3 weeks ago
- On-device noise suppression powered by deep learning☆70Updated 3 weeks ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆126Updated 6 months ago
- Open models for Coqui STT☆138Updated 2 years ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆168Updated last year
- A tokenizer, text cleaner, and phonemizer for many human languages.☆315Updated 6 months ago
- Port of Meta's Encodec in C/C++☆218Updated 5 months ago
- ONNX Inference of Pyannote Segmentation☆90Updated 5 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆40Updated 9 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆244Updated 11 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆251Updated 4 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆126Updated last year
- Desktop application for neural speech synthesis written in C++☆214Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Running the F5-TTS by ONNX Runtime☆155Updated 2 weeks ago
- ☆229Updated 2 months ago
- A ggml (C++) re-implementation of tortoise-tts☆183Updated 9 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆179Updated 8 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated 10 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆39Updated last month
- 🐤 Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillation☆253Updated last year
- VoiceBox neural network implementation☆108Updated 10 months ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆162Updated last year
- Monotonic Alignment Search☆91Updated 2 years ago
- ☆95Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.☆225Updated 2 years ago
- A curated list of awesome voice activity detection☆54Updated 6 months ago