rhasspy / piper-phonemizeLinks
C++ library for converting text to phonemes for Piper
β138Updated 6 months ago
Alternatives and similar repositories for piper-phonemize
Users that are interested in piper-phonemize are comparing it to the libraries listed below
Sorting:
- Open models for Coqui STTβ150Updated 2 years ago
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§π₯π Advanced audio processing.β258Updated last year
- ONNX Inference of Pyannote Segmentationβ97Updated last year
- Faster Tortoise inference then Tortoise Fast Forkβ127Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ161Updated last year
- A ggml (C++) re-implementation of tortoise-ttsβ193Updated last year
- On-device noise suppression powered by deep learningβ81Updated last week
- On-device voice activity detection (VAD) powered by deep learningβ243Updated last week
- Port of Meta's Encodec in C/C++β227Updated last year
- Desktop application for neural speech synthesis written in C++β212Updated last week
- πΈ - A general purpose model trainer, as flexible as it getsβ233Updated last year
- [WIP] VoiceSmith makes training text to speech models easy.β228Updated 3 years ago
- openvino version of openai/whisperβ180Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-aiβ129Updated 2 years ago
- IPA Phonemizer/Dephonemizer for 140 human languagesβ52Updated 3 weeks ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β332Updated last year
- Fine Tune the Style-TTS2 Voice Modelβ266Updated 7 months ago
- Official Implementation of StyleTTSβ460Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β104Updated last year
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobileβ¦β43Updated last year
- Running the F5-TTS by ONNX Runtimeβ191Updated 3 weeks ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionβ187Updated last year
- Application of MB-iSTFT-VITS components to vits2_pytorchβ131Updated last month
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.β179Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ154Updated last year
- A curated list of awesome voice activity detectionβ71Updated last year
- Experiments to test different speech recognition systems for SEPIA Frameworkβ63Updated 2 years ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversionβ258Updated 2 years ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ260Updated 2 months ago
- β258Updated last year