zycv / Speaker-Recognition-Based-on-Deep-Learning-An-OverviewView external linksLinks
This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》
☆41Jun 26, 2021Updated 4 years ago
Alternatives and similar repositories for Speaker-Recognition-Based-on-Deep-Learning-An-Overview
Users that are interested in Speaker-Recognition-Based-on-Deep-Learning-An-Overview are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- Using Kaldi x-vector method to train speaker recognition model on aishell database.☆17Aug 19, 2021Updated 4 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆66Feb 16, 2022Updated 4 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification☆20Jul 31, 2020Updated 5 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 5 months ago
- An Open Source Tools for Speaker Recognition☆634Aug 5, 2024Updated last year
- The dataset of Speech Recognition☆449Jan 4, 2026Updated last month
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆787Apr 11, 2024Updated last year
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- Incorporate Image, Text and Tabular Data with HuggingFace Transformers☆12Mar 1, 2022Updated 3 years ago
- ☆30Dec 23, 2025Updated last month
- A data processing module implemented with numpy☆10Aug 16, 2022Updated 3 years ago
- Multimodal data loader compatible with pytorch and tensorflow☆12Aug 14, 2024Updated last year
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- In defence of metric learning for speaker recognition☆1,161Mar 26, 2024Updated last year
- 说话人特征(声纹)提取工具,基于VGG-SR预训练模型。☆38Mar 7, 2020Updated 5 years ago
- A web based command line interface in a Docker container, based on ttyd.☆11Mar 15, 2021Updated 4 years ago
- ☆10Jun 2, 2021Updated 4 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- GAN-BASED DATA AUGMENTATION for RAMAN SPECTRA☆10Jul 11, 2025Updated 7 months ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- Wav2vec2 Large XLSR 53 fine-tuned for Malayalam☆11Sep 7, 2021Updated 4 years ago
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- ☆103Sep 2, 2021Updated 4 years ago
- Heart Murmur Detection from Phonocardiogram Recordings: The George B. Moody PhysioNet Challenge 2022☆15Jan 6, 2026Updated last month
- ☆10Mar 22, 2022Updated 3 years ago
- [MM'23] ProTegO: Protect Text Content against OCR Extraction Attack☆14Mar 12, 2024Updated last year
- A small library that wraps Keras models to pickle them.☆14Jul 17, 2018Updated 7 years ago
- Pytorch implementation of MDensenet and sparse NMF. Made for my undergraduate thesis "Music Source Separation with Supervised Learning Me…☆11Jan 31, 2021Updated 5 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- An example of rendering table from MySQL in Django☆10Sep 27, 2019Updated 6 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ellipsoid method python code☆12Feb 12, 2024Updated 2 years ago
- Cyclical Curriculum Learning Code☆12Jul 29, 2023Updated 2 years ago
- A Survey of Spoken Dialogue Models (60 pages)☆316Nov 28, 2024Updated last year
- ☆16Dec 17, 2024Updated last year
- Audio samples of our paper "PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network" (accepted by ICASSP2020).☆11Apr 14, 2020Updated 5 years ago