cyrta/voxceleb

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/cyrta/voxceleb)

cyrta / voxceleb

mirror of VoxCeleb dataset - a large-scale speaker identification dataset

☆77

Alternatives and similar repositories for voxceleb

Users that are interested in voxceleb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

maxhollmann / voxceleb-luigi
View on GitHub
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
☆43Mar 29, 2021Updated 5 years ago
pyannote / pyannote-db-voxceleb
View on GitHub
VoxCeleb plugin for pyannote.database
☆30Aug 4, 2021Updated 4 years ago
a-nagrani / VGGVox
View on GitHub
VGGVox models for Speaker Identification and Verification trained on the VoxCeleb (1 & 2) datasets
☆401Feb 4, 2019Updated 7 years ago
aishoot / Concurrent_Speakers_Counter
View on GitHub
Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.
☆23Mar 4, 2020Updated 6 years ago
rakshithShetty / dnn-speech
View on GitHub
This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition
☆12Dec 8, 2015Updated 10 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 2 months ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
linhdvu14 / vggvox-speaker-identification
View on GitHub
Speaker identification with VGGVox network
☆84Nov 30, 2018Updated 7 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
kan-bayashi / INTERSPEECH19_TUTORIAL
View on GitHub
Interspeech 2019 tutorial materials
☆49Sep 26, 2019Updated 6 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
google / speaker-id
View on GitHub
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…
☆453Aug 12, 2025Updated 11 months ago
v-iashin / VoxCeleb
View on GitHub
An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset
☆12Dec 11, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bioidiap / bob.bio.spear
View on GitHub
Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear
☆19Jun 24, 2023Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
leichtrhino / ChimeraNet
View on GitHub
Unofficial implementation of music separation model by Luo et.al.
☆13Nov 3, 2019Updated 6 years ago
cvqluu / dropclass_speaker
View on GitHub
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
☆22Oct 29, 2020Updated 5 years ago
LouisBearing / UnconditionalHeadMotion
View on GitHub
Code & demo for the animation of still facial landmarks from an initial pose.
☆15Jan 19, 2023Updated 3 years ago
kamo-naoyuki / pytorch_complex
View on GitHub
A temporal module for PyTorch-ComplexTensor
☆44Jun 28, 2024Updated 2 years ago
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
joonson / voxceleb_unsupervised
View on GitHub
Augmentation adversarial training for self-supervised speaker recognition
☆77Aug 15, 2021Updated 4 years ago
jonepatr / lets_face_it
View on GitHub
This is the official implementation for IVA'20 Best Paper Award paper "Let's Face It: Probabilistic Multi-modal Interlocutor-aware Gener…
☆17Feb 14, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bastibe / MAPS-Scripts
View on GitHub
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆25Mar 29, 2021Updated 5 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
RicherMans / PLDA
View on GitHub
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
☆102Apr 15, 2017Updated 9 years ago
sanjayss34 / lm-listener
View on GitHub
Implementation for the paper "Can Language Models Learn to Listen?"
☆71Jun 4, 2026Updated last month
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
rishikksh20 / Avocodo-pytorch
View on GitHub
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆122Jul 14, 2022Updated 4 years ago
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
schufo / plla-tisvs
View on GitHub
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
☆24Nov 8, 2021Updated 4 years ago
swshon / voxceleb-ivector
View on GitHub
Voxceleb1 i-vector based speaker recognition system
☆43May 22, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
iiscleap / NeuralPlda
View on GitHub
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Apr 20, 2020Updated 6 years ago
qqueing / DeepSpeaker-pytorch
View on GitHub
Speaker embedding(verification and recognition) using Pytorch
☆369Jul 24, 2020Updated 5 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
dr-pato / audio_visual_speech_enhancement
View on GitHub
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
☆112Mar 19, 2024Updated 2 years ago
bookbot-hive / k2-indonesian-asr
View on GitHub
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆16Jun 30, 2023Updated 3 years ago
burhanahmed1 / TaskSphere
View on GitHub
Integrated .NET-based desktop framework for dynamic task lifecycle management, featuring relational database connectivity, status trackin…
☆12May 7, 2025Updated last year
funcwj / ge2e-speaker-verification
View on GitHub
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
☆103Mar 18, 2019Updated 7 years ago