hoaaoh / Audio2VecLinks

Audio2Vec with multi lingual

☆8

Alternatives and similar repositories for Audio2Vec

Users that are interested in Audio2Vec are comparing it to the libraries listed below

Sorting:

akashmjn / cs224n-gpu-that-talks
Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)
☆52Updated 6 years ago
dingzeyuli / SpEAR-speech-database
A database of clean and noisy speech for audio research
☆9Updated 7 years ago
santi-pdp / ahoproc_tools
Tools for Ahocoder data processing and evaluation metrics
☆14Updated last year
ryanleary / patter
speech-to-text in pytorch
☆83Updated 6 years ago
v0lta / Listen-attend-and-spell
A listen attend and spell reimplementation in tensorflow, using a custom attention mechanism.
☆44Updated 6 years ago
Totoketchup / Adaptive-MultiSpeaker-Separation
Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem
☆51Updated 7 years ago
vrenkens / nabu
Code for end-to-end ASR with neural networks, build with TensorFlow
☆109Updated 6 years ago
CSTR-Edinburgh / magphase
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
☆80Updated 5 years ago
CSTR-Edinburgh / Ossian
☆58Updated 6 years ago
tiberiu44 / TTS-Cube
End-2-end speech synthesis with recurrent neural networks
☆225Updated last year
jinserk / pytorch-asr
ASR with PyTorch
☆139Updated 6 years ago
weedwind / CTC-speech-recognition
This is a working example of using CTC for phone recognition on TIMIT
☆50Updated 7 years ago
cdyangbo / end2endASR
implement end-to-end asr algorithm with tensorflow
☆40Updated 6 years ago
xcmyz / Transformer-TTS
TTS model based on Transformer.
☆58Updated 5 years ago
r9y9 / SPTK
A modified version of Speech Signal Processing Toolkit (SPTK)
☆89Updated 3 years ago
geyang / char2wav_pytorch
pytorch implementation of lyre.ai's char2wav model
☆32Updated 8 years ago
opendcd / opendcd
Open Source WFST-based Decoder Toolkit
☆77Updated 9 years ago
G-Wang / WaveRNN-Pytorch
Fatcord's Alternative WaveRNN (Faster training)
☆125Updated 6 years ago
dmitriy-serdyuk / kaldi-python
Python wrappers for Kaldi data
☆33Updated 7 years ago
Idlak / Living-Audio-Dataset
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆41Updated 2 years ago
glecorve / rnnlm2wfst
Conversion of recurrent neural network language models to weighted finite state transducers
☆58Updated 7 years ago
aalto-speech / AaltoASR
Aalto Automatic Speech Recognition tools
☆88Updated 8 years ago
awni / transducer
A Fast Sequence Transducer Implementation with PyTorch Bindings
☆198Updated 2 years ago
idiap / juicer
Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).
☆62Updated 9 years ago
zh217 / torch-asg
Auto Segmentation Criterion (ASG) implemented in pytorch
☆51Updated 3 years ago
jpuigcerver / kaldi-decoders
Custom decoders for Kaldi
☆79Updated 6 years ago
syoyo / tacotron-tts-cpp
Tacotron text to speech in C++(synthesize only)
☆76Updated 5 years ago
YiwenShaoStephen / pychain_example
☆48Updated 4 years ago
azraelkuan / FFTNet
FFTNet: a Real-Time Speaker-Dependent Neural Vocoder
☆64Updated 6 years ago
sequence-labeling / rnn-transducer
An implementation of rnn transducer for sequence labeling problem
☆22Updated 7 years ago