alphacep / awesome-speechLinks
Resources that make every language unique
☆25Updated 3 weeks ago
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below
Sorting:
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 10 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 10 months ago
- VITS Inference using ONNX Runtime on C++☆13Updated 2 years ago
- Open Source Crimean Tatar Text-to-Speech datasets☆14Updated 11 months ago
- Simple audio AE☆13Updated last year
- Whisper finetuning☆15Updated 10 months ago
- ☆16Updated 9 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Updated 6 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Updated 9 months ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Updated 2 years ago
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Updated last week
- 🎵 muse: Music Separation☆11Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆24Updated last year
- A high-quality, varied ~30hr voice dataset suitable for training a TTS model☆63Updated 3 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Updated 2 months ago
- High quality text-to-speech based on StyleTTS 2.☆71Updated last month
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 8 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆34Updated 2 years ago
- Transfer learning approach to pronunciation scoring☆11Updated 2 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Updated 4 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Updated last year
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Updated 10 months ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆20Updated 8 months ago
- Text-to-Speech Latency Benchmark☆22Updated 3 weeks ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated last year
- Supervoice diffusion enhance☆28Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Updated 4 months ago