itmo-mbss-lab / sr_labs_bookLinks
The project is related to the development of labs for the ITMO Speaker Recognition Course.
☆10Updated 2 months ago
Alternatives and similar repositories for sr_labs_book
Users that are interested in sr_labs_book are comparing it to the libraries listed below
Sorting:
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Updated 10 months ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 3 years ago
- ☆14Updated last year
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 9 months ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 4 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆17Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆28Updated last year
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆11Updated 7 months ago
- Spectra extraction tutorials based on torch and torchaudio.☆41Updated last year
- Implementation of the paper "Keyword Transformer: A Self-Attention Model for Keyword Spotting"☆23Updated 4 years ago
- Official Implementation of Mockingjay in Pytorch☆55Updated 2 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆45Updated last year
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆60Updated 4 years ago
- Code for 4th Place Solution in Spoofing-Aware Speaker Verification Challenge 2022☆9Updated 2 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆91Updated 4 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 4 years ago
- Discriminative Condition-Aware PLDA☆44Updated 11 months ago
- Constrained Permutation Invariant Training, Speech Separation☆47Updated 4 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆65Updated 3 years ago
- This repository includes the code to reproduce our paper Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing…☆18Updated 3 years ago
- ☆21Updated 4 years ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆46Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆41Updated 3 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 3 years ago
- ☆60Updated 4 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆55Updated 2 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year