scart97 / thunder-speech
A Hackable speech recognition library.
☆25Updated 4 months ago
Alternatives and similar repositories for thunder-speech:
Users that are interested in thunder-speech are comparing it to the libraries listed below
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- ☆34Updated this week
- Speaker change detection using SincNet and an LSTM/Transformer☆47Updated 8 months ago
- ☆56Updated 2 years ago
- ☆56Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- This is the M-AILABS Speech Dataset☆43Updated 3 months ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 5 months ago
- asr2k☆49Updated 9 months ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Clustering-based methods for overlapping diarization☆76Updated last year
- ☆38Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- The VoxTube dataset official repository☆68Updated last year
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆51Updated 6 months ago
- A sequence-to-sequence voice conversion toolkit.☆94Updated 8 months ago
- ☆62Updated 10 months ago
- A TTS model that makes a speaker speak new languages☆76Updated 8 months ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- ☆80Updated 9 months ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆82Updated 2 weeks ago
- ☆74Updated 3 years ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆62Updated 11 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆44Updated 3 years ago
- Convert English text from written expressions into spoken forms☆24Updated 2 years ago
- multilingual speech aligner☆72Updated last year