rusiaaman / PCPM
Presenting Collection of Pretrained Models. Links to pretrained models in NLP and voice.
☆21Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for PCPM
- ASR project with pytorch-lightning☆20Updated 4 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 2 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆48Updated 2 months ago
- Code for AccentDB.☆19Updated 3 years ago
- Anonymous ICLR Submission☆14Updated 5 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Updated 3 years ago
- Example implementation of Monotonic Chunkwise Attention.☆50Updated 6 years ago
- Online (real-time) decoder to be used with DeepSpeech2 model☆24Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Sound augmentation using Large-scale audio dataset (Audioset)☆44Updated 3 years ago
- Losses and decoders for end-to-end ASR and OCR☆33Updated 4 years ago
- A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")☆120Updated 8 months ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- ☆74Updated 3 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated last year
- PyTorch end-to-end speech recognition☆49Updated 3 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆10Updated 4 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Updated 5 years ago
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- ☆12Updated last year
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆51Updated 4 years ago