zvadaadam / speech-recognition
End to End Speech Recognition with Tensorflow
☆9Updated 6 years ago
Alternatives and similar repositories for speech-recognition:
Users that are interested in speech-recognition are comparing it to the libraries listed below
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Updated 3 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆73Updated 3 years ago
- This repository provides you the details of how speech recognition is done from end to end.☆25Updated 6 years ago
- A neural attention model for speech command recognition☆185Updated 2 years ago
- This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models☆56Updated last month
- Pytorch implementation of conformer with with training script for end-to-end speech recognition on the LibriSpeech dataset.☆27Updated 11 months ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆101Updated 2 years ago
- An implementation of MatchboxNet☆10Updated 2 years ago
- An implementation of Conformer: Convolution-augmented Transformer for Speech Recognition, a Transformer Variant in TensorFlow/Keras☆44Updated 3 years ago
- ☆60Updated last year
- Final project for the Speaker Recognition course on Udemy, 机器之心, 深蓝学院 and 语音之家☆44Updated 11 months ago
- Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)☆16Updated 4 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆51Updated 2 years ago
- Implementation of IEEE Access paper - Lung Sound Recognition Algorithm Based on VGGish-BiGRU☆28Updated 5 years ago
- A unified dataset of multilingual emotional human utterances☆25Updated 3 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆43Updated last year
- ☆19Updated last year
- Learning Efficient Representations for Keyword Spotting with Triplet Loss☆109Updated 2 years ago
- Classification of 11 types of audio clips using MFCCs features and LSTM. Pretrained on Speech Command Dataset with intensive data augment…☆42Updated 2 years ago
- Conformer RNN-Transducer☆15Updated 2 years ago
- A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.☆15Updated 3 years ago
- Sound Classification using Librosa, ffmpeg, CNN, Keras, XGBOOST, Random Forest.☆68Updated last year
- Toolkit to asses speech impairments in patients with neurological disorders☆55Updated 6 years ago
- VArious audio processing tasks☆21Updated 2 years ago
- ☆29Updated 2 years ago
- Unofficial PyTorch implementation of "Keyword Transformer: A Self-Attention Model for Keyword Spotting", Berg et al. 2021.☆37Updated 2 years ago
- A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'☆41Updated 6 years ago
- CNN 1D vs 2D audio classification☆104Updated 6 years ago
- In this project, the performance of speech emotion recognition is compared between two methods (SVM vs Bi-LSTM RNN).Conventional classifi…☆27Updated last year
- Multi-class audio classification with MFCC features using CNN☆29Updated 5 years ago