igormq / aes-lac-2018
Pytorch code of "A new automatic speech recognizer for Brazilian Portuguese based on deep neural networks and transfer learning" submitted to AES-LAC 2018
☆21Updated 5 years ago
Alternatives and similar repositories for aes-lac-2018:
Users that are interested in aes-lac-2018 are comparing it to the libraries listed below
- ☆12Updated 4 years ago
- Implementation of all-neural speech recognition systems using Keras and Tensorflow☆144Updated 7 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- FFTNet vocoder implementation☆81Updated 6 years ago
- Deep learning for Text to Speech☆27Updated 4 years ago
- FFTNet: a Real-Time Speaker-Dependent Neural Vocoder☆64Updated 6 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Updated 6 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 10 months ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- ☆38Updated 4 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 7 years ago
- A test bed for updates and new features | pytorch/audio☆169Updated 4 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 4 years ago
- Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"☆25Updated 5 years ago
- Python library for handling audio datasets.☆137Updated last year
- Some notes on Kaldi☆31Updated 10 years ago
- Fatcord's Alternative WaveRNN (Faster training)☆132Updated 4 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- ☆45Updated 5 years ago
- Vocode spectrograms to audio with generative adversarial networks☆63Updated 5 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- Collection of machine learning demos for Automatic Speech Recognition☆55Updated 3 years ago
- Pytorch Implementation of FFTNet☆86Updated 6 years ago
- ☆25Updated 7 years ago
- Deep Neural Network for Speaker Count Estimation☆148Updated 4 years ago
- Python functions to convert between different speech quality metrics☆54Updated 6 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 6 years ago
- DeepMind's Tacotron-2 Tensorflow implementation☆34Updated 6 years ago