alphacep / awesome-speechLinks
Resources that make every language unique
☆16Updated 9 months ago
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below
Sorting:
- StyleTTS2 + Vocos as a Decoder☆13Updated 5 months ago
- VITS Inference using ONNX Runtime on C++☆13Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated last month
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆20Updated 2 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 4 months ago
- Whisper finetuning☆14Updated 4 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆19Updated 4 months ago
- ☆17Updated 4 years ago
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Updated last year
- Russian accentuator and IPA transcriber☆14Updated 11 months ago
- High quality text-to-speech based on StyleTTS 2.☆60Updated last week
- Target speaker automatic speech recognition (TS-ASR)☆11Updated last year
- Colab notebooks for Next-gen Kaldi☆28Updated last week
- Forced alignment decoder for Whisper.☆14Updated last year
- Simple audio AE☆12Updated 9 months ago
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆14Updated 5 months ago
- ☆13Updated last month
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆14Updated last year
- Java Bindings for the C++ library DeepSpeech☆10Updated 5 years ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆30Updated 4 months ago
- ☆29Updated 6 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 10 months ago
- Pybind11 bindings for Kaldi☆14Updated last month
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆61Updated 2 years ago
- ☆14Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 2 years ago
- Text-to-Speech Latency Benchmark☆18Updated 2 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆23Updated 9 months ago