alphacep / awesome-speechLinks
Resources that make every language unique
☆12Updated 6 months ago
Alternatives and similar repositories for awesome-speech
Users that are interested in awesome-speech are comparing it to the libraries listed below
Sorting:
- StyleTTS2 + Vocos as a Decoder☆12Updated 2 months ago
- ☆25Updated last week
- ☆13Updated 9 months ago
- StyleTTS 2 Optimized Training Fork☆29Updated 4 months ago
- ☆14Updated last month
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆24Updated last year
- Text-to-Speech Latency Benchmark☆12Updated 9 months ago
- Collection of scripts from mHuBERT-147.☆25Updated 6 months ago
- ☆12Updated 4 months ago
- Simple audio AE☆12Updated 6 months ago
- Viterbi decoding in PyTorch☆34Updated 3 weeks ago
- High quality text-to-speech based on StyleTTS 2.☆48Updated last week
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated 3 weeks ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 7 months ago
- An extension of PHOIBLE that includes features for allophones.☆10Updated 2 years ago
- Forced alignment decoder for Whisper.☆14Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆24Updated last month
- ☆11Updated last year
- ☆26Updated 4 months ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated 2 years ago
- A collection of all our phonemeizers for dataset construction and inference☆23Updated 3 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆17Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆23Updated 2 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆19Updated 4 months ago
- Llasa Speed Up☆33Updated last week
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆21Updated last year
- Russian phonetical transcription☆10Updated last year