zzw922cn / awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
β3,032Updated last year
Alternatives and similar repositories for awesome-speech-recognition-speech-synthesis-papers:
Users that are interested in awesome-speech-recognition-speech-synthesis-papers are comparing it to the libraries listed below
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflowβ2,842Updated 2 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,324Updated 11 months ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.β1,863Updated 2 years ago
- Speech Recognition using DeepSpeech2.β2,115Updated 2 years ago
- End-to-End Speech Processing Toolkitβ9,053Updated last week
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,β¦β2,385Updated 3 years ago
- This is now the official location of the Merlin project.β1,313Updated 5 years ago
- Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLPβ1,559Updated 3 years ago
- A PyTorch Implementation of End-to-End Models for Speech-to-Textβ758Updated last year
- Deep Speaker: an End-to-End Neural Speaker Embedding System.β926Updated last year
- A Python wrapper for Kaldiβ1,015Updated 3 months ago
- List of speech synthesis papers.β1,039Updated last year
- The Implementation of FastSpeech based on pytorch.β871Updated last year
- The official repository of the Eesen projectβ829Updated 5 years ago
- Speech Enhancement Generative Adversarial Network in TensorFlowβ840Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,603Updated last year
- WaveNet vocoderβ2,357Updated last year
- A Flow-based Generative Network for Speech Synthesisβ2,327Updated last year
- A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Modelβ1,832Updated 3 years ago
- A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)β2,975Updated last year
- This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )β537Updated 3 years ago
- Deep Learning & 3D Convolutional Neural Networks for Speaker Verificationβ785Updated 5 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis modelsβ1,978Updated last year
- DeepMind's Tacotron-2 Tensorflow implementationβ2,310Updated last year
- πSpeech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networksβ2,170Updated last year
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.β788Updated 2 years ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwβ¦β968Updated this week
- End-to-end ASR/LM implementation with PyTorchβ596Updated 3 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pytβ¦β1,200Updated 4 years ago
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"β680Updated last year