hoaaoh / Audio2VecLinks
Audio2Vec with multi lingual
☆8Updated 7 years ago
Alternatives and similar repositories for Audio2Vec
Users that are interested in Audio2Vec are comparing it to the libraries listed below
Sorting:
- Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)☆52Updated 6 years ago
- A database of clean and noisy speech for audio research☆9Updated 7 years ago
- Tools for Ahocoder data processing and evaluation metrics☆14Updated last year
- speech-to-text in pytorch☆83Updated 6 years ago
- A listen attend and spell reimplementation in tensorflow, using a custom attention mechanism.☆44Updated 6 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Updated 7 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆109Updated 6 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Updated 5 years ago
- ☆58Updated 6 years ago
- End-2-end speech synthesis with recurrent neural networks☆225Updated last year
- ASR with PyTorch☆139Updated 6 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 7 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- TTS model based on Transformer.☆58Updated 5 years ago
- A modified version of Speech Signal Processing Toolkit (SPTK)☆89Updated 3 years ago
- pytorch implementation of lyre.ai's char2wav model☆32Updated 8 years ago
- Open Source WFST-based Decoder Toolkit☆77Updated 9 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆125Updated 6 years ago
- Python wrappers for Kaldi data☆33Updated 7 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Updated 7 years ago
- Aalto Automatic Speech Recognition tools☆88Updated 8 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆198Updated 2 years ago
- Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).☆62Updated 9 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Updated 3 years ago
- Custom decoders for Kaldi☆79Updated 6 years ago
- Tacotron text to speech in C++(synthesize only)☆76Updated 5 years ago
- ☆48Updated 4 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Updated 7 years ago