This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》
☆41Jun 26, 2021Updated 4 years ago
Alternatives and similar repositories for Speaker-Recognition-Based-on-Deep-Learning-An-Overview
Users that are interested in Speaker-Recognition-Based-on-Deep-Learning-An-Overview are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- A implement of adaptive score normalization (AS-Norm) in speaker verification/recognition with pytorch☆10Oct 12, 2022Updated 3 years ago
- Using Kaldi x-vector method to train speaker recognition model on aishell database.☆17Aug 19, 2021Updated 4 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆66Feb 16, 2022Updated 4 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification☆20Jul 31, 2020Updated 5 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 6 months ago
- The dataset of Speech Recognition☆453Jan 4, 2026Updated 2 months ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆791Apr 11, 2024Updated last year
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- Incorporate Image, Text and Tabular Data with HuggingFace Transformers☆12Mar 1, 2022Updated 4 years ago
- Multimodal data loader compatible with pytorch and tensorflow☆12Aug 14, 2024Updated last year
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- In defence of metric learning for speaker recognition☆1,165Mar 26, 2024Updated last year
- 说话人特征(声纹)提取工具,基于VGG-SR预训练模型。☆39Mar 7, 2020Updated 6 years ago
- Voice Music Separation competing for 6th Huawei Cup in ZJU☆11Jun 2, 2015Updated 10 years ago
- ☆10Jun 2, 2021Updated 4 years ago
- ☆11Oct 24, 2022Updated 3 years ago
- GAN-BASED DATA AUGMENTATION for RAMAN SPECTRA☆10Jul 11, 2025Updated 7 months ago
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- A web based command line interface in a Docker container, based on ttyd.☆11Mar 15, 2021Updated 4 years ago
- Experiments with GAN, WGAN, WGAN-GP, DC-GAN, cGAN, AC,GAN and pix2pix☆10May 28, 2019Updated 6 years ago
- ☆105Sep 2, 2021Updated 4 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- An example of rendering table from MySQL in Django☆10Sep 27, 2019Updated 6 years ago
- Cyclical Curriculum Learning Code☆12Jul 29, 2023Updated 2 years ago
- ellipsoid method python code☆12Feb 12, 2024Updated 2 years ago
- ☆10Mar 22, 2022Updated 3 years ago
- A small library that wraps Keras models to pickle them.☆14Jul 17, 2018Updated 7 years ago
- Prosody Predict☆10Jan 4, 2021Updated 5 years ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Heart Murmur Detection from Phonocardiogram Recordings: The George B. Moody PhysioNet Challenge 2022☆15Jan 6, 2026Updated 2 months ago
- [MM'23] ProTegO: Protect Text Content against OCR Extraction Attack☆14Mar 12, 2024Updated last year
- A Survey of Spoken Dialogue Models (60 pages)☆315Nov 28, 2024Updated last year
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Nov 6, 2023Updated 2 years ago
- This is an Tensorflow implementation of semi-supervised learning with the following methods: Pseudo-label, Pi_model, VAT, mean_teacher, M…☆12Jul 23, 2020Updated 5 years ago
- ☆16Apr 27, 2025Updated 10 months ago
- Implementation of the paper "Emotion Identification from raw speech signals using DNNs"☆14Jun 11, 2020Updated 5 years ago