igormq / aes-lac-2018
Pytorch code of "A new automatic speech recognizer for Brazilian Portuguese based on deep neural networks and transfer learning" submitted to AES-LAC 2018
☆21Updated 5 years ago
Alternatives and similar repositories for aes-lac-2018:
Users that are interested in aes-lac-2018 are comparing it to the libraries listed below
- ☆11Updated 3 years ago
- Wav2vec resources and models for Brazilian Portuguese☆32Updated 2 years ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Updated 4 years ago
- Implementation of all-neural speech recognition systems using Keras and Tensorflow☆144Updated 7 years ago
- Audio data augmentation examples☆34Updated 6 years ago
- Python library for handling audio datasets.☆136Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆41Updated 2 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- In this work I investigate the speech command task developing and analyzing deep learning models. The state of the art technology uses co…☆19Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 7 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- ☆25Updated 7 years ago
- A neural attention model for speech command recognition☆183Updated last year
- ☆15Updated last year
- 6th place solution to Freesound Audio Tagging 2019 kaggle competition☆25Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 8 months ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Updated 7 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 10 months ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- This is a legacy repo. Dev occurs now on GitHub.☆11Updated 3 years ago
- collaborative audio module for fast.ai☆98Updated 5 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 6 years ago
- Attacking Speaker Recognition with Deep Generative Models☆34Updated last year
- Uma base de dados para estudo de regionalismos brasileiros através da voz.☆7Updated last year
- PyTorch end-to-end speech recognition☆49Updated 4 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆71Updated 5 years ago
- Utils and data sets for audio and PyTorch☆85Updated 3 years ago
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 4 years ago