alphacep / awesome-speechLinks
Resources that make every language unique
☆25Updated 2 weeks ago
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below
Sorting:
- StyleTTS2 + Vocos as a Decoder☆13Updated 10 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 9 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated this week
- Simple audio AE☆13Updated last year
- Colab notebooks for Next-gen Kaldi☆29Updated 3 months ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆53Updated 3 weeks ago
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated last year
- Tunable pipelines☆41Updated 4 months ago
- High quality text-to-speech based on StyleTTS 2.☆71Updated last month
- Open Source Crimean Tatar Text-to-Speech datasets☆14Updated 11 months ago
- Text-to-Speech Latency Benchmark☆22Updated 2 weeks ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 6 months ago
- ☆29Updated 11 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Updated 7 months ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆35Updated 2 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Updated last year
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated 3 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- StyleTTS 2 Optimized Training Fork☆33Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆20Updated last year
- Whisper finetuning☆15Updated 9 months ago
- Universal multilingual automatic speech transcription into IPA☆74Updated 11 months ago
- ☆16Updated 9 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆27Updated 3 weeks ago
- Transfer learning approach to pronunciation scoring☆11Updated 2 years ago
- VITS Inference using ONNX Runtime on C++☆13Updated 2 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆63Updated 3 years ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆16Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆22Updated last year