CSLT-THU / IS2019-VAEView external linksLinks
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 2 years ago
Alternatives and similar repositories for IS2019-VAE
Users that are interested in IS2019-VAE are comparing it to the libraries listed below
Sorting:
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆54Feb 26, 2020Updated 5 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Jul 28, 2017Updated 8 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Sep 4, 2019Updated 6 years ago
- MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks☆19Feb 29, 2020Updated 5 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear☆19Jun 24, 2023Updated 2 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Sep 1, 2025Updated 5 months ago
- University of Edinbrugh-Johns Hopkins University's system for ASVspoof 2017 Version 2.0 dataset.☆50May 1, 2019Updated 6 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆21Apr 7, 2021Updated 4 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆100Apr 20, 2020Updated 5 years ago
- 東北きりたん歌唱データベースの最新ラベルデータ☆148May 1, 2021Updated 4 years ago
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆24Jul 13, 2019Updated 6 years ago
- Deep multi-metric learning for text-independent speaker verification☆24Dec 6, 2019Updated 6 years ago
- Fork of the official kaldi.☆22Mar 22, 2022Updated 3 years ago
- ☆29May 4, 2020Updated 5 years ago
- ICASSP 2019 official Latex template☆23May 11, 2021Updated 4 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆102Apr 15, 2017Updated 8 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- Jdit is a research processing oriented framework based on pytorch. The docs is here!☆30Apr 27, 2021Updated 4 years ago
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- Python implementation of simple GMM and HMM models for isolated digit recognition.☆67Feb 7, 2021Updated 5 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Sep 4, 2022Updated 3 years ago
- Unofficial implementation of ECAPA-TDNN☆30Feb 28, 2021Updated 4 years ago
- Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".☆30Nov 13, 2021Updated 4 years ago
- 2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告☆33Dec 28, 2018Updated 7 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- PyTorch implementation of Tacotron and Tacotron2☆34Jul 19, 2022Updated 3 years ago
- Code for paper "Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion"☆36Jan 15, 2020Updated 6 years ago
- Simple speech recognition using dynamic time warping with examples☆29Mar 3, 2020Updated 5 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆67Sep 9, 2019Updated 6 years ago