alphacep / awesome-speechLinks
Resources that make every language unique
☆24Updated this week
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below
Sorting:
- Transfer learning approach to pronunciation scoring☆11Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Updated 9 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 8 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 4 months ago
- Colab notebooks for Next-gen Kaldi☆29Updated 2 months ago
- ☆17Updated 4 years ago
- Text-to-Speech Latency Benchmark☆22Updated 6 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated last month
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated last month
- ☆16Updated 8 months ago
- Pybind11 bindings for Kaldi☆15Updated 3 months ago
- ☆29Updated 10 months ago
- ☆57Updated 2 years ago
- Open Source Crimean Tatar Text-to-Speech datasets☆14Updated 10 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Updated 6 months ago
- IPA Phonemizer/Dephonemizer for 144 human languages☆49Updated this week
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Updated 3 months ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆16Updated last year
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆62Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated 2 weeks ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆33Updated 2 years ago
- Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…☆28Updated 2 years ago
- ☆14Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆16Updated 9 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated last year
- Universal multilingual automatic speech transcription into IPA☆72Updated 9 months ago
- Supervoice diffusion enhance☆28Updated last year
- steps to perform text-based speaker diarization with kaldi toolkit☆12Updated 7 years ago