ShivamRajSharma / Transformer-Text-To-Speech
Pytorch implementation of Transformer-TTS for converting text into speech.
☆19Updated 3 years ago
Alternatives and similar repositories for Transformer-Text-To-Speech
Users that are interested in Transformer-Text-To-Speech are comparing it to the libraries listed below
Sorting:
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- PyTorch implementation of automatic speech recognition models.☆38Updated 4 years ago
- ASR project with pytorch-lightning☆20Updated last month
- Official PyTorch implementation of TTS Style Transfer☆23Updated 2 years ago
- PyTorch based speaker embedding model☆16Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper☆18Updated 2 years ago
- GSoC'2021 | TensorFlow implementation of Wav2Vec2☆90Updated 3 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- ☆43Updated 2 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37Updated last year
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Updated 4 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- 2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib☆22Updated 4 years ago
- Using speaker embedding for diarization in PyTorch☆18Updated 4 years ago
- Refactored version of https://github.com/ming024/FastSpeech2☆14Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆75Updated 4 years ago
- Deep Speech Distances PyTorch☆28Updated 3 years ago
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Updated 5 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 4 years ago
- Unofficial Pytorch Implementation of WaveGrad2☆112Updated 3 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Updated 2 years ago
- ☆46Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆44Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆141Updated 2 years ago