ShivamRajSharma / Transformer-Text-To-SpeechLinks

Pytorch implementation of Transformer-TTS for converting text into speech.

☆19

Alternatives and similar repositories for Transformer-Text-To-Speech

Users that are interested in Transformer-Text-To-Speech are comparing it to the libraries listed below

Sorting:

Deepest-Project / Transformer-TTS
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Updated 2 years ago
clam004 / unsupervised-speech-representation-learning
This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…
☆10Updated 4 years ago
dipjyoti92 / TTS-Style-Transfer
Official PyTorch implementation of TTS Style Transfer
☆24Updated 3 years ago
sooftware / speech-transformer
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆64Updated 3 years ago
CODEJIN / Speaker_Embedding_Torch
PyTorch based speaker embedding model
☆16Updated last year
Open-Speech-EkStep / vakyansh-wav2vec2-experimentation
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
☆87Updated 2 years ago
CODEJIN / SPEECHSPLIT
An implement of SPEECHSPLIT
☆15Updated 4 years ago
bepierre / SpeechVGG
Feature extractor for DL speech processing.
☆66Updated 3 years ago
chuachinhon / wav2vec2_transformers
Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…
☆32Updated 4 years ago
frozentoad9 / CMST
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Updated 2 years ago
tongjinle123 / speech-transformer-pytorch_lightning
ASR project with pytorch-lightning
☆20Updated 3 months ago
Sreyan88 / Disfluency-Detection-with-Span-Classification
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆13Updated 2 years ago
Rishit-dagli / Conformer
An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras
☆44Updated 3 years ago
adiyoss / AutoVowelDuration
Automatic Measurement of Vowel Duration for Consonant Vowel Consonant (CVC) sound files (JASA 2016)
☆14Updated 8 years ago
an-tran528 / wavetransformer
Code base for WaveTransformer: A novel architecture for automated audio captioning
☆44Updated 4 years ago
Speech-Lab-IITM / Hindi-ASR-Challenge
🎯 Speech Recognition Challenge by Speech Lab - IIT Madras
☆11Updated 4 years ago
xinjli / asr2k
asr2k
☆51Updated last year
AccentDB / code
Code for AccentDB.
☆22Updated 4 years ago
f90 / Seq-U-Net
Official implementation of the Seq-U-Net for efficient sequence modelling
☆79Updated 11 months ago
ronggong / mispronunciation-detection
Mispronunciation detection code for jingju singing voice
☆20Updated 6 years ago
shangeth / wavencoder
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…
☆91Updated 4 years ago
sooftware / End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
☆38Updated 4 years ago
andi611 / Mockingjay-Speech-Representation
Official Implementation of Mockingjay in Pytorch
☆55Updated 2 years ago
AndreevP / speech_distances
Deep Speech Distances PyTorch
☆29Updated 3 years ago
zerospeech / zerospeech2021_baseline
BERT and LSTM baseline models of the ZeroSpeech Challenge 2021
☆60Updated 2 years ago
keonlee9420 / WaveGrad2
PyTorch Implementation of Google Brain's WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis
☆69Updated 3 years ago
sooftware / RNN-Transducer
PyTorch implementation of RNN-Transducer(RNN-T).
☆77Updated 4 years ago
andi611 / TTS-Tacotron-Pytorch
Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.
☆29Updated 6 years ago
seujung / WaveNet-gluon
Implementation of WaveNet with Gluon
☆16Updated 6 years ago
Open-Speech-EkStep / audio-to-speech-pipeline
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
☆32Updated 2 years ago