cyrta / voxcelebView external linksLinks
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
☆74Jul 5, 2019Updated 6 years ago
Alternatives and similar repositories for voxceleb
Users that are interested in voxceleb are comparing it to the libraries listed below
Sorting:
- VoxCeleb plugin for pyannote.database☆30Aug 4, 2021Updated 4 years ago
- Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments☆43Mar 29, 2021Updated 4 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 10 years ago
- This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition☆12Dec 8, 2015Updated 10 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets☆399Feb 4, 2019Updated 7 years ago
- Interspeech 2019 tutorial materials☆49Sep 26, 2019Updated 6 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- In defence of metric learning for speaker recognition☆1,161Mar 26, 2024Updated last year
- Speaker identification with VGGVox network☆84Nov 30, 2018Updated 7 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Jan 27, 2020Updated 6 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Mar 4, 2020Updated 5 years ago
- ☆11Sep 16, 2014Updated 11 years ago
- Model drift detection☆11Jul 22, 2023Updated 2 years ago
- ☆10Apr 8, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- PyTorch implementation of Retriever: Learning Content-Style Representation☆12Jan 27, 2023Updated 3 years ago
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆24Nov 8, 2021Updated 4 years ago
- Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)☆100Apr 20, 2020Updated 5 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆439Aug 12, 2025Updated 6 months ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆43Jul 17, 2020Updated 5 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 2 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆102Apr 15, 2017Updated 8 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Oct 30, 2018Updated 7 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- A basic Python implementation of the Auto-Tune patent. Uses auto-correlation for pitch detection and resampling for pitch correction.☆13Aug 8, 2021Updated 4 years ago
- ☆14Apr 8, 2025Updated 10 months ago
- visual-text to speech☆14Apr 3, 2022Updated 3 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.☆49Jun 12, 2017Updated 8 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- Unofficial implementation of ECAPA-TDNN☆30Feb 28, 2021Updated 4 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago
- Code & demo for the animation of still facial landmarks from an initial pose.☆15Jan 19, 2023Updated 3 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Sep 5, 2017Updated 8 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago