anjandeepsahni / automatic_speech_recognition
Speech to text transcription using RNN (Listen, Attend and Spell).
☆11Updated 5 years ago
Related projects: ⓘ
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 2 years ago
- End-to-End Speech Recognition Using Tensorflow☆40Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 3 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 3 weeks ago
- This project is about performing Speaker diarization for Hindi Language.☆44Updated 3 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 3 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 2 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆91Updated 4 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 3 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 2 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 5 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆20Updated 6 years ago
- End to End Dialect Identification using Convolutional Neural Network☆51Updated 4 years ago
- Audio data augmentation examples☆35Updated 6 years ago
- Language identification using Siamese network based on i-vector☆7Updated 6 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆24Updated 5 years ago
- Simple speaker recognition project☆7Updated 5 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆22Updated 3 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Audio classification via transfer learning☆32Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 3 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago