WiraDKP / pytorch_gru_speaker_diarization
Speaker Diarization using GRU in PyTorch
☆11Updated 4 years ago
Related projects: ⓘ
- For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project☆18Updated 4 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆43Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- This project is about performing Speaker diarization for Hindi Language.☆44Updated 3 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆72Updated 3 years ago
- Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…☆64Updated 3 years ago
- ☆51Updated 6 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆135Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆88Updated 3 years ago
- Urdu Language Speech Emotional Corpus☆43Updated 5 years ago
- speaker_diarization done on toy dataset and tested on timit dataset☆8Updated 2 years ago
- Augmented Audio Data Generator for 1D-Convolutional Neural Networks☆49Updated 3 years ago
- Detection of emotion in Speech Using Convolution Neural Network☆20Updated 4 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆97Updated last year
- Urban sounds classification with Covnolutional Neural Networks☆36Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 3 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 2 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Updated 3 years ago
- An tensorflow implementation of ghostvlad for speaker recognition☆14Updated 5 years ago
- Multi-class audio classification with MFCC features using CNN☆26Updated 4 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆98Updated 4 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Updated 4 years ago
- Implement a GRU/LSTM model using Keras, and train it to classify the languages using MFCC features☆26Updated last month
- [deprecated] Pretrained models for pyannote-audio 1.x☆70Updated 2 years ago
- End-to-End Speech Recognition Using Tensorflow☆40Updated last year
- ☆10Updated 5 years ago
- [InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …☆208Updated last year
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆112Updated 3 months ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- Speaker recognition ,Voiceprint recognition☆51Updated 4 years ago