ShivamRajSharma / Transformer-Text-To-Speech
Pytorch implementation of Transformer-TTS for converting text into speech.
☆18Updated 3 years ago
Alternatives and similar repositories for Transformer-Text-To-Speech:
Users that are interested in Transformer-Text-To-Speech are comparing it to the libraries listed below
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆66Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆136Updated 2 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆86Updated 2 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆75Updated 3 years ago
- ASR project with pytorch-lightning☆20Updated 4 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆111Updated 3 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 5 months ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 3 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆91Updated 3 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 4 years ago
- ☆163Updated 2 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated last year
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆45Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- A Pytorch Implementation of MelGAN☆67Updated 5 years ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆34Updated 2 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Updated 3 years ago
- Example code for a neural transducer model.☆61Updated last year
- PyTorch based speaker embedding model☆15Updated 10 months ago
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Updated 3 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆74Updated 4 years ago
- ☆56Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- ☆42Updated 3 years ago