magnumresearchgroup / FastaudioLinks
FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge
☆45Updated 2 years ago
Alternatives and similar repositories for Fastaudio
Users that are interested in Fastaudio are comparing it to the libraries listed below
Sorting:
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Updated 4 years ago
- ☆32Updated 2 years ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆66Updated 3 years ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆71Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆50Updated 3 years ago
- ☆45Updated 2 years ago
- Discriminative Condition-Aware PLDA☆44Updated last year
- Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning☆23Updated 2 years ago
- Clustering-based methods for overlapping diarization☆82Updated 2 years ago
- Official repository of NeXt-TDNN for speaker verification☆81Updated last year
- Official implementation of our ASVspoof 2021 paper, "UR Channel-Robust Synthetic Speech Detection System for ASVspoof 2021"☆56Updated 3 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Updated 2 years ago
- Python package for combining diarization system outputs.☆92Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆105Updated 2 years ago
- ☆32Updated 3 years ago
- ☆58Updated 9 months ago
- SASV2 baseline, a track on ASVspoof5 phase2 challenge☆25Updated 2 months ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆111Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆58Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 4 years ago
- ☆37Updated 4 years ago
- A simple package for Guided source separation (GSS)☆132Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆157Updated 3 years ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆53Updated last year
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated 2 years ago
- Python toolkit for speech processing☆72Updated 3 weeks ago