itmo-mbss-lab / sr_labs_book
The project is related to the development of labs for the ITMO Speaker Recognition Course.
☆10Updated 2 years ago
Alternatives and similar repositories for sr_labs_book:
Users that are interested in sr_labs_book are comparing it to the libraries listed below
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 4 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆39Updated 4 years ago
- Baseline kaldi script for UA-SPEECH corpus☆29Updated 3 months ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated last year
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆23Updated 2 years ago
- MultiSV: scripts for data preparation☆27Updated 2 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- Constrained Permutation Invariant Training, Speech Separation☆44Updated 3 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 6 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 4 years ago
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 3 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆38Updated 2 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 5 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆15Updated last year
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆16Updated last year
- Clustering-based methods for overlapping diarization☆74Updated last year
- A PyTorch 1.0 implementation of the convolutions described in SincNet☆32Updated 5 years ago
- ☆59Updated 4 years ago
- Experiments on speech recognition robustness to accents and dialects☆12Updated 5 years ago
- Python toolkit for speech processing☆68Updated last week
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 5 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated last year
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 4 months ago
- ☆27Updated 2 years ago
- ☆53Updated 4 years ago