alphacep / awesome-speechLinks
Resources that make every language unique
☆22Updated 10 months ago
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below
Sorting:
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆14Updated last year
- Whisper finetuning☆14Updated 5 months ago
- StyleTTS2 + Vocos as a Decoder☆13Updated 6 months ago
- ☆13Updated last month
- VITS Inference using ONNX Runtime on C++☆13Updated last year
- Simple audio AE☆12Updated 10 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆12Updated 5 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated last month
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Updated last year
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆20Updated 3 months ago
- Colab notebooks for Next-gen Kaldi☆28Updated last month
- ☆17Updated 4 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆27Updated last month
- ☆10Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated 8 months ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆10Updated 6 months ago
- Normalize Text in Russian☆27Updated last year
- Forced alignment decoder for Whisper.☆14Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆42Updated this week
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 11 months ago
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆12Updated last year
- Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆15Updated 6 months ago
- Using OpenVINO to speed up MeloTTS inference☆13Updated 10 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Updated 11 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Ai…☆18Updated 11 months ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆20Updated this week
- Getting confidences from any end-to-end systems☆11Updated 2 years ago
- ☆29Updated 7 months ago
- Transfer learning approach to pronunciation scoring☆10Updated last year