just-ai / speechflowLinks
β28Updated 2 months ago
Alternatives and similar repositories for speechflow
Users that are interested in speechflow are comparing it to the libraries listed below
Sorting:
- Official repository of Wavehax vocoderβ54Updated last week
- ποΈ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets β¨β17Updated 2 months ago
- β125Updated 11 months ago
- β25Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authorsβ53Updated 11 months ago
- Normalize Text in Russianβ27Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPβ¦β101Updated 9 months ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these facβ¦β50Updated 3 weeks ago
- A TTS model that makes a speaker speak new languagesβ76Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accβ¦β76Updated 2 years ago
- Unofficial implementation of wavenext vocoderβ48Updated 11 months ago
- The VoxTube dataset official repositoryβ70Updated last year
- Putting flows on top of neural transducers for better TTSβ62Updated last month
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTSβ63Updated 2 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.β77Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processingβ70Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Implementation of TTS model based on NVIDIA P-Flow TTS Paperβ74Updated last year
- β41Updated 10 months ago
- β48Updated 11 months ago
- An unofficial PyTorch implementation of VALL-Eβ87Updated this week
- Collection of scripts from mHuBERT-147.β29Updated 8 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.β15Updated 2 months ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).β49Updated last year
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repoβ¦β32Updated 2 years ago
- β63Updated last year
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.β13Updated 9 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversionβ99Updated last year
- Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Modelsβ57Updated last month
- β80Updated last year