khannasarthak / Stuttered-Speech-recognition
Final semester project on Stuttered Speech recognition
☆18Updated 7 years ago
Alternatives and similar repositories for Stuttered-Speech-recognition:
Users that are interested in Stuttered-Speech-recognition are comparing it to the libraries listed below
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆42Updated last year
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆13Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆25Updated 3 years ago
- music genre classification : LSTM vs Transformer☆60Updated 2 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆22Updated 3 months ago
- StammerClipper:: :A deep learning approach for automatic stutter detection☆9Updated 3 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆20Updated 2 years ago
- StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disflue…☆18Updated 2 years ago
- Self-supervised Speech Enhancement network☆11Updated 4 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆41Updated 2 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆24Updated 3 years ago
- Reading notes of speech or deep learning related papers, including Automatic Speech Recognition (ASR), Speech Enhancement and Dereverbera…☆23Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 2 years ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆132Updated 2 years ago
- ☆97Updated last year
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆89Updated 3 years ago
- Matlab tools for pathological voice analysis☆13Updated last year
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 2 years ago
- Toolkit to asses speech impairments in patients with neurological disorders☆54Updated 6 years ago
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆36Updated 11 months ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆118Updated 9 months ago
- Real-time speech enhancement mobile app using Nested U-Net☆48Updated last year
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆66Updated 4 years ago
- ☆29Updated 2 years ago
- In this repository, I implement a system for detecting specific spoken words in speech signal. When reading a speech signal, I detect not…☆17Updated 3 years ago
- A set of scripts that extract speech features (so far MFCCs, FBANKs, STFT, and kinda dominant frequency) and trains CNN, LSTM, or CNN+LST…☆51Updated 2 years ago
- [Research] Monaural Speech Enhancement through Wave-U-Net (SEWUNet)☆30Updated 2 years ago
- VArious audio processing tasks☆21Updated 2 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆29Updated 3 weeks ago