neurlang / goruut
IPA Phonemizer/Dephonemizer for 136 human languages
☆19Updated this week
Alternatives and similar repositories for goruut:
Users that are interested in goruut are comparing it to the libraries listed below
- StyleTTS 2 Optimized Training Fork☆26Updated last month
- (WIP) A retrain of F5-TTS on permissively-licensed data☆10Updated 2 weeks ago
- Forced alignment decoder for Whisper.☆14Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆24Updated 11 months ago
- ☆10Updated last month
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 2 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- ☆13Updated 7 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆12Updated 6 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆42Updated 7 months ago
- ☆11Updated 2 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆20Updated last year
- ☆26Updated last month
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆15Updated 5 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆12Updated last week
- ☆17Updated this week
- Open TTS models, built for streaming on the edge☆38Updated last week
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 7 months ago
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆44Updated last week
- Simple PyTorch Denoisers for Waveform Audio☆35Updated last month
- Convert English text from written expressions into spoken forms☆24Updated 2 years ago
- ☆28Updated last year
- ☆25Updated 4 months ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆20Updated last week
- ☆12Updated 2 years ago