alphacep / awesome-speechLinks
Resources that make every language unique
☆24Updated last week
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below
Sorting:
- IPA Phonemizer/Dephonemizer for 139 human languages☆46Updated last month
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 7 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 8 months ago
- Простой IPA фонемизатор на базе ruaccent-encoder☆24Updated 7 months ago
- StyleTTS 2 Optimized Training Fork☆34Updated 9 months ago
- pytorch model for contexless-phoneme prediction from speech audio☆30Updated last month
- ☆16Updated 7 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆35Updated 7 months ago
- High quality text-to-speech based on StyleTTS 2.☆70Updated 3 weeks ago
- ⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.☆34Updated last year
- ☆50Updated last week
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆22Updated 10 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Updated 2 months ago
- Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆23Updated 2 weeks ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated this week
- Simple audio AE☆13Updated last year
- ☆14Updated last year
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- Russian accentuator and IPA transcriber☆16Updated last year
- ☆13Updated 3 months ago
- VoiceBox neural network implementation☆110Updated last year
- 🎵 muse: Music Separation☆10Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ☆92Updated 3 weeks ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆15Updated last year
- VALL-E 2 reproduction☆132Updated last year
- ☆29Updated 9 months ago
- A multilingual text-to-speech synthesis system for ten lower-resourced Turkic languages: Azerbaijani, Bashkir, Kazakh, Kyrgyz, Sakha, Tat…☆75Updated 2 years ago
- On-device speaker diarization powered by deep learning☆57Updated this week
- ☆28Updated 2 years ago