30stomercury / Automatic-Speech-Recognition
End-to-End Speech Recognition Using Tensorflow
☆41Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Automatic-Speech-Recognition
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- Paper: https://arxiv.org/abs/1702.02285☆62Updated 5 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- transformer for ASR-systerm (via tensorflow2.0)☆114Updated 5 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆137Updated 4 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆166Updated last year
- ☆59Updated 4 years ago
- This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…☆87Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆44Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆53Updated 5 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 5 years ago
- A Python toolbox for speech features extraction☆159Updated last year
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago
- Implementation of audio degradation processes☆101Updated 9 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- An implementation of RNN-Transducer loss in TF-2.0.☆45Updated last year
- tf 2.0 implementation of Listen, attend and spell☆21Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- BERT and LSTM baseline models of the ZeroSpeech Challenge 2021☆57Updated 2 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 5 years ago