anjandeepsahni / automatic_speech_recognition
Speech to text transcription using RNN (Listen, Attend and Spell).
☆11Updated 5 years ago
Alternatives and similar repositories for automatic_speech_recognition:
Users that are interested in automatic_speech_recognition are comparing it to the libraries listed below
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 3 years ago
- End-to-End Speech Recognition Using Tensorflow☆41Updated 2 years ago
- tf 2.0 implementation of Listen, attend and spell☆21Updated 4 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 4 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 3 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated 2 years ago
- ☆12Updated 2 months ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆21Updated 7 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 4 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 7 months ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆44Updated last year
- Keyword spotting using RNNs + Edit distance☆9Updated 4 years ago
- A Kaldi/ESPnet based approach to perform automatic speech recognition on low resource languages☆9Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- Various algorithms for voice activity detection☆22Updated 8 years ago
- Repository hosting code and slides of the Audio Data Augmentation series on The Sound of AI YT channel.☆37Updated 3 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Spectra extraction tutorials based on torch and torchaudio.☆41Updated last year
- ☆46Updated 2 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 6 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago