ag1988 / mel-asr
☆13Updated 7 months ago
Related projects: ⓘ
- ☆16Updated 2 months ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆25Updated last year
- A list of podcast URLs scraped from the Apple podcast database in late 2021, including a script for downloading those podcasts.☆32Updated 2 years ago
- Temporary anonymous version☆22Updated 6 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆60Updated 6 months ago
- ☆15Updated last month
- ☆16Updated 2 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆8Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- A library of speech gadgets.☆13Updated last year
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆33Updated last week
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- ☆19Updated 5 months ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- A simple command line tool to calculate WER for ASR.☆13Updated last year
- Prosodic Speech Segmentation with Transformers☆22Updated 6 months ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆18Updated 11 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning and Rye☆12Updated 4 months ago
- ☆11Updated 2 weeks ago
- ☆22Updated 2 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- Survey on speech generation work.☆11Updated 9 months ago
- GPT for FACodec☆13Updated 5 months ago
- ☆25Updated 2 years ago
- ☆28Updated this week
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆12Updated 2 weeks ago
- A SPMI Lab toolkit for language models.☆11Updated 7 years ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago