maxhollmann/voxceleb-luigi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/maxhollmann/voxceleb-luigi)

maxhollmann / voxceleb-luigi

Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments

☆43

Alternatives and similar repositories for voxceleb-luigi

Users that are interested in voxceleb-luigi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
cyrta / voxceleb
View on GitHub
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
☆77Jul 5, 2019Updated 7 years ago
changil / avspeech-downloader
View on GitHub
AVSpeech downloader
☆69Jan 30, 2019Updated 7 years ago
juanmc2005 / torch-plda
View on GitHub
PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf
☆15Oct 16, 2020Updated 5 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
Popgun-Labs / SincNetConv
View on GitHub
A PyTorch 1.0 implementation of the convolutions described in SincNet
☆33Jan 30, 2019Updated 7 years ago
Archivoice / nnsvs-chinese-support
View on GitHub
Hed and supporting files for Chinese NNSVS Dataset Creation
☆13Oct 14, 2025Updated 9 months ago
pyannote / pyannote-db-voxceleb
View on GitHub
VoxCeleb plugin for pyannote.database
☆30Aug 4, 2021Updated 4 years ago
matln / voxceleb_triplet-loss
View on GitHub
A Pytorch implementation of triplet loss on VoxCeleb1
☆12Oct 16, 2019Updated 6 years ago
yinruiqing / diarization_with_neural_approach
View on GitHub
☆14Aug 9, 2018Updated 7 years ago
swshon / voxceleb-ivector
View on GitHub
Voxceleb1 i-vector based speaker recognition system
☆43May 22, 2018Updated 8 years ago
markdrayton / wozzwo
View on GitHub
Parses whatsonzwift.com workout pages to produce `zwo` files for use in Zwift.
☆11Jun 19, 2022Updated 4 years ago
feerci / feerci
View on GitHub
FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates
☆12Mar 13, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
t13m / kaldi-readers-for-tensorflow
View on GitHub
readers that enable reading kaldi ark in tensorflow
☆17Mar 7, 2018Updated 8 years ago
laboroai / TEDxJP-10K
View on GitHub
☆26Jan 14, 2021Updated 5 years ago
bookbot-hive / k2-indonesian-asr
View on GitHub
Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).
☆16Jun 30, 2023Updated 3 years ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
rishikksh20 / Avocodo-pytorch
View on GitHub
Avocodo: Generative Adversarial Network for Artifact-free Vocoder
☆122Jul 14, 2022Updated 4 years ago
bsxfan / PSDA
View on GitHub
Probabilistic Spherical Discriminant Analysis
☆12Oct 29, 2022Updated 3 years ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
Idlak / Living-Audio-Dataset
View on GitHub
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆43Aug 3, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bayartsogt-ya / whisper-multiple-hf-datasets
View on GitHub
Whisper fine-tuning event script to use multiple hf datasets
☆32Dec 20, 2022Updated 3 years ago
eloimoliner / audio-inpainting-diffusion
View on GitHub
☆74Apr 4, 2024Updated 2 years ago
xinjli / alqalign
View on GitHub
multilingual speech aligner
☆78Nov 19, 2023Updated 2 years ago
nipponjo / arabic-vocalization
View on GitHub
Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch
☆15May 30, 2023Updated 3 years ago
desh2608 / pytorch-tdnn
View on GitHub
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Dec 18, 2020Updated 5 years ago
resemble-ai / normalise
View on GitHub
A module for normalising text.
☆10Nov 6, 2019Updated 6 years ago
google-research-datasets / WikipediaHomographData
View on GitHub
Labeled data for homograph disambiguation
☆62Jun 1, 2023Updated 3 years ago
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
basaldella / bioreddit
View on GitHub
Word embeddings trained on medical subreddits.
☆10Jan 4, 2021Updated 5 years ago
bsxfan / meta-embeddings
View on GitHub
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆23Nov 23, 2018Updated 7 years ago
bsxfan / PYLLR
View on GitHub
Python toolkit for likelihood-ratio calibration of binary classifiers
☆25Feb 21, 2023Updated 3 years ago
bioidiap / bob.bio.spear
View on GitHub
Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear
☆19Jun 24, 2023Updated 3 years ago
talonvoice / speech
View on GitHub
speech engine training projects
☆29Apr 19, 2021Updated 5 years ago
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
RicherMans / PLDA
View on GitHub
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
☆102Apr 15, 2017Updated 9 years ago