anjandeepsahni / automatic_speech_recognition
Speech to text transcription using RNN (Listen, Attend and Spell).
☆11Updated 5 years ago
Alternatives and similar repositories for automatic_speech_recognition
Users that are interested in automatic_speech_recognition are comparing it to the libraries listed below
Sorting:
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 3 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated 2 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 3 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 6 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 4 years ago
- Simple speaker recognition project☆7Updated 5 years ago
- ☆12Updated 3 months ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- tf 2.0 implementation of Listen, attend and spell☆21Updated 4 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 6 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- Audio data augmentation examples☆34Updated 6 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆13Updated 5 years ago
- Language identification using Siamese network based on i-vector☆7Updated 7 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 8 months ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 7 months ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆16Updated 2 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 3 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆9Updated 2 years ago
- Companion repository for the blog article: https://www.endpointdev.com/blog/2019/01/speech-recognition-with-tensorflow/☆22Updated 3 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago