anjandeepsahni / automatic_speech_recognition
Speech to text transcription using RNN (Listen, Attend and Spell).
☆11Updated 5 years ago
Alternatives and similar repositories for automatic_speech_recognition:
Users that are interested in automatic_speech_recognition are comparing it to the libraries listed below
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 7 months ago
- End-to-End Speech Recognition Using Tensorflow☆42Updated 2 years ago
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 3 years ago
- tf 2.0 implementation of Listen, attend and spell☆21Updated 4 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆18Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- ☆12Updated 2 months ago
- speech recognition using Kaldi framework☆12Updated 5 years ago
- Audio data augmentation examples☆34Updated 6 years ago
- This is a intuitive explanation of Representation Learning with Contrastive Predictive Coding using code provided by jefflai108 that use…☆10Updated 4 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- https://www.kaggle.com/c/tensorflow-speech-recognition-challenge/☆21Updated 7 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?☆34Updated 6 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆19Updated 3 years ago
- Mispronunciation detection code for jingju singing voice☆20Updated 6 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 4 years ago
- ☆45Updated 2 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 2 years ago
- Language identification using Siamese network based on i-vector☆7Updated 7 years ago
- Pytorch implementation of Tacotron, a speech synthesis end-to-end generative TTS model.☆29Updated 6 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆66Updated 3 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆25Updated 7 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Some tutorials used for ASR class☆31Updated 3 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated 2 years ago
- The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project descript…☆28Updated 5 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 4 years ago