WiraDKP / pytorch_gru_speaker_diarizationLinks
Speaker Diarization using GRU in PyTorch
☆11Updated 5 years ago
Alternatives and similar repositories for pytorch_gru_speaker_diarization
Users that are interested in pytorch_gru_speaker_diarization are comparing it to the libraries listed below
Sorting:
- ☆90Updated 3 years ago
- A neural attention model for speech command recognition☆187Updated 4 months ago
- Simple d-vector based Speaker Recognition (verification and identification) using Pytorch☆212Updated 5 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆74Updated 4 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆141Updated 4 years ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆125Updated last year
- Time series course Fall 2019 project☆53Updated 5 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆68Updated 4 years ago
- Understanding emotions from audio files using neural networks and multiple datasets.☆423Updated 2 years ago
- Identifying people from small audio fragments☆170Updated 5 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated 2 years ago
- Code and slides for the "Deep learning (audio) application: From design to deployment" tutorials.☆175Updated last year
- Data repository of Project Coswara☆194Updated 2 years ago
- Detecting emotions using MFCC features of human speech using Deep Learning☆130Updated 4 years ago
- An in-depth analysis of audio classification on the RAVDESS dataset. Feature engineering, hyperparameter optimization, model evaluation, …☆78Updated 5 years ago
- A collection of Audio and Speech pre-trained models.☆194Updated 5 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Deep learning based speech source separation using Pytorch☆319Updated 5 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆94Updated 5 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆267Updated 3 years ago
- Speaker identification using voice MFCCs and GMM☆54Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Updated 4 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 5 years ago
- It uses GMM to train a speaker identification model. The training and testing has been done on subset (34 speakers) from VoxForge data co…☆58Updated 6 years ago
- This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1☆114Updated 6 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Updated 4 years ago
- Include some core functions and model to handle speech separation☆156Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library☆212Updated 5 years ago
- For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project☆18Updated 5 years ago