CSLT-THU/IS2019-VAE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CSLT-THU/IS2019-VAE)

CSLT-THU / IS2019-VAE

Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"

☆11

Alternatives and similar repositories for IS2019-VAE

Users that are interested in IS2019-VAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yucc2018 / machine-learning-yearning
View on GitHub
Translation and draft of Machine Learning Yearning for chapter 1-22.该书1-22章的翻译及原稿。
☆10Aug 1, 2018Updated 7 years ago
Deepest-Project / FastSpeech
View on GitHub
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆54Feb 26, 2020Updated 6 years ago
lmingde / speech-emotion-recognition-exercise
View on GitHub
2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告
☆33Dec 28, 2018Updated 7 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
vimalmanohar / kaldi
View on GitHub
Fork of the official kaldi.
☆22Mar 22, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BiometricVox / DAE_SpeakerID
View on GitHub
Denoising autoencoders for speaker identification on MCE 2018 challenge
☆12Nov 8, 2018Updated 7 years ago
bfs18 / armel
View on GitHub
poorman's ar-dit tts
☆45Dec 31, 2025Updated 6 months ago
AI-Guru / SincNet
View on GitHub
Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)
☆12Aug 5, 2018Updated 7 years ago
xushengyuan / VocalnetOpenDataset
View on GitHub
一个开源的中文歌声合成数据集。An open-source Chinese singing synthesizing dataset.
☆24Jul 13, 2019Updated 7 years ago
MU94W / TTS-Eval
View on GitHub
☆18Aug 9, 2018Updated 7 years ago
nperraud / gan_audio_inpainting
View on GitHub
☆29May 4, 2020Updated 6 years ago
nobody996 / FastSVC
View on GitHub
Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"
☆21Apr 7, 2021Updated 5 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
zengchang233 / MTGAN
View on GitHub
MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks
☆19Feb 29, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
thefirebanks / Ensemble-Learning-for-Tweet-Classification-of-Hate-Speech-and-Offensive-Language
View on GitHub
Contains code for a voting classifier that is part of an ensemble learning model for tweet classification (which includes an LSTM, a baye…
☆23May 8, 2018Updated 8 years ago
robin1001 / vad
View on GitHub
simple energy vad
☆19Jun 3, 2017Updated 9 years ago
groadabike / Kaldi-Dsing-task
View on GitHub
DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.
☆19Jul 9, 2026Updated 2 weeks ago
yzyouzhang / SASV_PR
View on GitHub
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
☆18Jun 24, 2022Updated 4 years ago
kan-bayashi / WaveNetVocoderSamples
View on GitHub
WaveNet Vocoder Samples
☆23Aug 23, 2019Updated 6 years ago
mmorise / kiritan_singing
View on GitHub
東北きりたん歌唱データベースの最新ラベルデータ
☆148May 1, 2021Updated 5 years ago
sarulab-speech / lightweight_spkr_anon
View on GitHub
Lightweight speaker anonymization [IEEE SLT2021]
☆27Jun 6, 2022Updated 4 years ago
gdebayan / Diarization_BIC
View on GitHub
Speaker Diarization library in Python. Performs VAD, Segmentation, Linear Clustering, Hierarchical Clustering
☆15Jul 28, 2017Updated 9 years ago
yerfor / SyntaSpeech
View on GitHub
SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code
☆201Sep 4, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
bioidiap / bob.bio.spear
View on GitHub
Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear
☆19Jun 24, 2023Updated 3 years ago
k2-fsa / kaldifst
View on GitHub
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆56Apr 9, 2026Updated 3 months ago
alibugra / audio-data-augmentation
View on GitHub
Audio data augmentation examples
☆34May 27, 2018Updated 8 years ago
sunshibao / go-jdmt
View on GitHub
GoLang 版本的京东茅台脚本
☆17Jan 25, 2021Updated 5 years ago
RemiRigal / snreval-python
View on GitHub
This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…
☆12Jun 22, 2022Updated 4 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
marytts / pavoque-data
View on GitHub
PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
orbxball / icassp2019-latex-template
View on GitHub
ICASSP 2019 official Latex template
☆23May 11, 2021Updated 5 years ago
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bsxfan / meta-embeddings
View on GitHub
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆23Nov 23, 2018Updated 7 years ago
maum-ai / cotatron
View on GitHub
Official code for Cotatron @ INTERSPEECH 2020
☆213Jul 25, 2024Updated 2 years ago
cnlinxi / LLM-paper-daily
View on GitHub
Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily
☆10Updated this week
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago
One-Shot-Voice-Conversion-with-WIN / WINVC
View on GitHub
Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".
☆30Nov 13, 2021Updated 4 years ago
iiscleap / NeuralPlda
View on GitHub
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Apr 20, 2020Updated 6 years ago
dyne / AutOrg
View on GitHub
Autonomy is Organization
☆17Sep 16, 2015Updated 10 years ago