zvadaadam / speech-recognition
End to End Speech Recognition with Tensorflow
☆9Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for speech-recognition
- Predicting the labels (spoken languages) of audio files with audio features (MFCC, RASTA, PLP) using ML-based and statistical approaches …☆10Updated 5 years ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆11Updated 2 years ago
- End-to-End Speech Recognition using Neural Networks.☆35Updated 2 months ago
- Audio data augmentation examples☆35Updated 6 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM☆102Updated 3 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆91Updated 4 years ago
- In this repository, we explore using a hybrid system consisting of a Convolutional Neural Network and a Support Vector Machine for Keywor…☆92Updated last year
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆55Updated 5 years ago
- Speech Separation☆52Updated 8 months ago
- Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"☆10Updated 2 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆110Updated 5 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆58Updated 3 years ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆20Updated 5 years ago
- Audio classification via transfer learning☆32Updated 5 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆34Updated last year
- ☆17Updated 2 years ago
- Multi-class audio classification with MFCC features using CNN☆28Updated 4 years ago
- Cough detection with Log Mel Spectrogram, Wavelet Transform, Deep learning and Transfer learning techniques☆15Updated 3 years ago
- ☆46Updated 11 months ago
- DDAE speech enhancement on spectrogram domain using Keras☆25Updated 7 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆15Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆64Updated 2 years ago
- ☆27Updated 2 years ago
- Automatic Speaker Recognition algorithms in Python☆93Updated 3 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆58Updated 2 years ago
- Voice Activity Detection (VAD) using deep learning.☆192Updated 5 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆27Updated 3 months ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆35Updated 4 years ago