Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
Alternatives and similar repositories for IS2019-VAE
Users that are interested in IS2019-VAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Translation and draft of Machine Learning Yearning for chapter 1-22.该书1-22章的翻译及原稿。☆10Aug 1, 2018Updated 7 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆54Feb 26, 2020Updated 6 years ago
- 2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告☆33Dec 28, 2018Updated 7 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Fork of the official kaldi.☆22Mar 22, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- ☆18Aug 9, 2018Updated 7 years ago
- 一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.☆24Jul 13, 2019Updated 6 years ago
- ☆29May 4, 2020Updated 6 years ago
- MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks☆19Feb 29, 2020Updated 6 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆15Sep 4, 2019Updated 6 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆21Apr 7, 2021Updated 5 years ago
- Contains code for a voting classifier that is part of an ensemble learning model for tweet classification (which includes an LSTM, a baye…☆23May 8, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering☆15Jul 28, 2017Updated 8 years ago
- 東北きりたん歌唱データベースの最新ラベルデータ☆148May 1, 2021Updated 5 years ago
- Lightweight speaker anonymization [IEEE SLT2021]☆27Jun 6, 2022Updated 3 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆201Sep 4, 2022Updated 3 years ago
- Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear☆19Jun 24, 2023Updated 2 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆57Apr 9, 2026Updated 3 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Audio data augmentation examples☆34May 27, 2018Updated 7 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- GoLang 版本的京东茅台脚本☆17Jan 25, 2021Updated 5 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆44Dec 17, 2020Updated 5 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- ICASSP 2019 official Latex template☆23May 11, 2021Updated 4 years ago
- Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily☆10Updated this week
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PAVOQUE Corpus of Expressive Speech☆12Aug 2, 2016Updated 9 years ago
- Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".☆30Nov 13, 2021Updated 4 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆100Apr 20, 2020Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Autonomy is Organization☆17Sep 16, 2015Updated 10 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆102Apr 15, 2017Updated 9 years ago
- Unofficial implementation of ECAPA-TDNN☆30Feb 28, 2021Updated 5 years ago