guest271314 / SSMLParserLinks
Implement SSML parsing for Web Speech API
☆38Updated 4 years ago
Alternatives and similar repositories for SSMLParser
Users that are interested in SSMLParser are comparing it to the libraries listed below
Sorting:
- Putting flows on top of neural transducers for better TTS☆64Updated this week
- Labeled data for homograph disambiguation☆60Updated 2 years ago
- An even smaller speech recognizer / force aligner☆36Updated 9 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- Web app for keyword spotting using TensorflowJS☆74Updated 2 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- ☆43Updated last year
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆15Updated 7 years ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆132Updated 2 years ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 6 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆102Updated last year
- A simple voice conversion tool☆19Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Heteronym to Phoneme Parser☆18Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆123Updated 10 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆176Updated this week
- PnG BERT: Augmented BERT on Phonemes and Graphemes for Neural TTS☆24Updated 3 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- a lightweight voice conversion☆85Updated last year
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆63Updated 2 years ago
- Convert English text from written expressions into spoken forms☆26Updated 3 years ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 6 months ago
- ☆19Updated 7 months ago
- ☆25Updated last year
- UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)☆12Updated last year
- 4G GPU & 10 Minutes for train☆12Updated 2 years ago
- Open TTS models, built for streaming on the edge☆43Updated 6 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated last year