sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆45Updated 3 years ago
Related projects: ⓘ
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆44Updated 2 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆63Updated 2 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆33Updated 2 years ago
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Updated last year
- PyTorch implementation of RNN-Transducer(RNN-T).☆68Updated 3 years ago
- ☆37Updated 3 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆18Updated 11 months ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago
- neural network based speaker embedder☆25Updated last year
- Refactored version of https://github.com/ming024/FastSpeech2☆13Updated 2 years ago
- A pakage for crawling audio from Youtube☆41Updated last year
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆42Updated last year
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆78Updated last month
- ☆56Updated last year
- Repository for speech paper reading☆32Updated 3 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 3 years ago
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆15Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated last year
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 2 years ago
- ☆69Updated this week
- ☆9Updated last year
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆42Updated last year
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆117Updated last year
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆36Updated 4 months ago
- UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation☆69Updated 3 years ago
- Example code for a neural transducer model.☆58Updated 7 months ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 3 years ago