tomasz-oponowicz / spoken_language_identificationLinks
Identify a spoken language using artificial intelligence (LID).
☆124Updated 7 years ago
Alternatives and similar repositories for spoken_language_identification
Users that are interested in spoken_language_identification are comparing it to the libraries listed below
Sorting:
- Making a TTS model with 1 minute of speech samples within 10 minutes☆184Updated 7 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.☆130Updated 4 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆82Updated last year
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆292Updated 4 years ago
- End-2-end speech synthesis with recurrent neural networks☆224Updated last year
- An opensource speech-to-text software written in tensorflow☆159Updated 2 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 6 years ago
- Speech-to-text based on wav2letter built for transfer learning☆98Updated 2 years ago
- A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis☆205Updated 6 years ago
- C++ Code to run waveglow inference in cuda☆131Updated 6 years ago
- Deep neural networks for getting text-independent speaker embedding written in TensorFlow☆310Updated 6 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46Updated 8 years ago
- Speaker diarization scripts, based on AaltoASR☆190Updated 6 years ago
- Speech Recognition model based off of FAIR research paper built using Pytorch.☆85Updated 6 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 6 years ago
- Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.☆49Updated 8 years ago
- Speech Recognition Using Tacotron☆164Updated 8 years ago
- ☆40Updated 7 years ago
- Upsample speech audio in wav format using deep learning☆195Updated 8 years ago
- Cross-lingual Voice Conversion☆97Updated 7 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆474Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Updated 5 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆228Updated 4 years ago
- A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")☆126Updated last year
- A Pytorch Implementation of ClariNet☆292Updated 6 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentation☆242Updated 7 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 5 years ago
- A Collection of Speech Corpus for ASR and TTS☆114Updated 8 years ago
- Text Independent Speaker Verification Using GE2E Loss☆84Updated 6 years ago
- Paper: https://arxiv.org/abs/1702.02285☆64Updated 6 years ago