anjandeepsahni / automatic_speech_recognition
Speech to text transcription using RNN (Listen, Attend and Spell).
☆11Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for automatic_speech_recognition
- End-to-End Speech Recognition Using Tensorflow☆41Updated last year
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 2 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 2 months ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.☆17Updated 3 years ago
- ☆40Updated last year
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Companion repository for the blog article: https://www.endpointdev.com/blog/2019/01/speech-recognition-with-tensorflow/☆22Updated 2 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Improving the Goodness of Pronunciation with DNNs and RNNs☆32Updated 6 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆64Updated 2 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 2 years ago
- tf 2.0 implementation of Listen, attend and spell☆21Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 5 years ago
- Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augment…☆38Updated last year
- Various algorithms for voice activity detection☆22Updated 7 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆64Updated 3 years ago
- Data preparation code for building Kaldi ASR system☆14Updated 7 years ago
- Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.☆34Updated 2 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 6 years ago
- ☆10Updated last year
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆23Updated 4 years ago
- A neural language modeling toolkit built on PyTorch☆18Updated last year
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆43Updated 4 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago