WiraDKP / pytorch_gru_speaker_diarizationLinks
Speaker Diarization using GRU in PyTorch
☆11Updated 4 years ago
Alternatives and similar repositories for pytorch_gru_speaker_diarization
Users that are interested in pytorch_gru_speaker_diarization are comparing it to the libraries listed below
Sorting:
- For our Smart Media Player (detecting time period(s) inside audio/video during which specific person(s) is/are speaking) project☆18Updated 5 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 4 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Updated 4 years ago
- ☆90Updated 2 years ago
- Time series course Fall 2019 project☆53Updated 4 years ago
- Using speaker embedding for diarization in PyTorch☆17Updated 4 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆91Updated 4 years ago
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆122Updated last year
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆99Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- 📁 This repo makes it easy to download the raw audio files from AudioSet (32.45 GB, 632 classes).☆103Updated last year
- speaker_diarization done on toy dataset and tested on timit dataset☆7Updated 3 years ago
- Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)☆73Updated 4 years ago
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆62Updated last year
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- For our speech emotion recognition project☆28Updated 4 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆16Updated 2 years ago
- ☆10Updated 5 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Updated 5 years ago
- A neural attention model for speech command recognition☆185Updated 2 years ago
- Urban Sound Classification : striving towards a fair comparison☆17Updated 4 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆65Updated 4 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Updated 3 years ago
- Audio data augmentation examples☆34Updated 7 years ago