alphacep / awesome-speechLinks
Resources that make every language unique
☆24Updated last week
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below
Sorting:
- StyleTTS2 + Vocos as a Decoder☆13Updated 6 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 6 months ago
- StyleTTS 2 Optimized Training Fork☆33Updated 8 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆35Updated 5 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated last year
- High quality text-to-speech based on StyleTTS 2.☆65Updated last week
- Text-to-Speech Latency Benchmark☆18Updated 3 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆29Updated last month
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆18Updated last month
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆99Updated last week
- Using OpenVINO to speed up MeloTTS inference☆13Updated 11 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆15Updated last year
- ☆47Updated last week
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- IPA Phonemizer/Dephonemizer for 139 human languages☆42Updated 2 weeks ago
- ☆15Updated 3 months ago
- VITS Inference using ONNX Runtime on C++☆13Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated 2 months ago
- ☆16Updated 5 months ago
- Colab notebooks for Next-gen Kaldi☆29Updated last week
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 8 months ago
- ☆10Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- Python implementation of a few speech intelligibility prediction algorithms☆14Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆63Updated 2 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆20Updated 3 weeks ago
- ☆14Updated last year
- ☆29Updated 8 months ago
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Updated 2 years ago