walkoncross/voxceleb2-download-zyf

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/walkoncross/voxceleb2-download-zyf)

walkoncross / voxceleb2-download-zyf

Tools for downloading VoxCeleb2 dataset

☆35

Alternatives and similar repositories for voxceleb2-download-zyf

Users that are interested in voxceleb2-download-zyf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aispeech-lab / advr-avss
View on GitHub
Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.
☆18Jul 11, 2022Updated 4 years ago
iiscleap / DIHARD-2019-baseline
View on GitHub
☆16Mar 7, 2019Updated 7 years ago
zcxu-eric / AVA-AVD
View on GitHub
☆51Nov 24, 2022Updated 3 years ago
Tiago-Roxo / WASD
View on GitHub
☆20Mar 20, 2026Updated 4 months ago
zhaoyi2 / xvector-cnceleb
View on GitHub
kaldi based x-vector trained on Cn-Celeb
☆13Sep 22, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TaoRuijie / Speaker-Recognition-Demo
View on GitHub
A ResNet Speaker Recognition&Verification Demo
☆27Oct 19, 2021Updated 4 years ago
tuanchien / asd
View on GitHub
Active Speaker Detection
☆19Jun 19, 2020Updated 6 years ago
codesavory / IMAGEimate
View on GitHub
IMAGEimate is an end-to-end pipeline to create realistic animatable 3D avatars from a single image using neural networks
☆13Dec 9, 2021Updated 4 years ago
solmp / VideoMatting
View on GitHub
Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python
☆12Mar 10, 2022Updated 4 years ago
zexupan / reentry
View on GitHub
☆18Nov 22, 2024Updated last year
WiraDKP / pytorch_speaker_embedding_for_diarization
View on GitHub
Using speaker embedding for diarization in PyTorch
☆17Aug 29, 2020Updated 5 years ago
Jiang-Yidi / TS-TalkNet
View on GitHub
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
☆61May 29, 2023Updated 3 years ago
EGO4D / audio-visual
View on GitHub
☆69Sep 13, 2022Updated 3 years ago
joonson / voxceleb_unsupervised
View on GitHub
Augmentation adversarial training for self-supervised speaker recognition
☆77Aug 15, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
nghiahhnguyen / CS224W-Stanford
View on GitHub
This is the repository containing the solution of the homework for the CS224W course at Stanford: Machine Learning with Graphs
☆11Jul 19, 2020Updated 6 years ago
salmedina / SpeechDrivenTongueAnimation
View on GitHub
ML-driven tongue animation (CVPR'22)
☆51Mar 29, 2022Updated 4 years ago
zabir-nabil / awesome-speaker-recognition-verification
View on GitHub
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
☆15Aug 29, 2021Updated 4 years ago
zexupan / MuSE
View on GitHub
☆42Nov 22, 2024Updated last year
clovaai / lookwhostalking
View on GitHub
Look Who’s Talking: Active Speaker Detection in the Wild
☆76Aug 24, 2023Updated 2 years ago
clovaai / voxceleb_trainer
View on GitHub
In defence of metric learning for speaker recognition
☆1,170Apr 22, 2026Updated 3 months ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago
lawlict / ECAPA-TDNN
View on GitHub
☆106Sep 2, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
simonsuthers / IBM-Separation
View on GitHub
Python code to show basic sound separation using Ideal Binary Masks
☆13Oct 13, 2018Updated 7 years ago
daquinterop / DSSATTools_notebooks
View on GitHub
Notebooks showing some examples of DSSATTools usage
☆15Apr 13, 2025Updated last year
KunZhou9646 / Mixed_Emotions
View on GitHub
☆123Oct 24, 2022Updated 3 years ago
jinny960812 / SyncTalkFace
View on GitHub
SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory
☆33Nov 3, 2022Updated 3 years ago
zhenghuatan / GMM-UBM_MAP_SV
View on GitHub
Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification
☆20Jul 31, 2020Updated 5 years ago
kyungmnlee / RenyiCL
View on GitHub
Contrastive self-supervised learning using Rényi divergence
☆14Oct 21, 2022Updated 3 years ago
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
Snishikant / CreditRating-FeatureSelection-GAW
View on GitHub
Feature Selection for Credit Scoring using Genetic Algorithm Wrapper(Information Gain)
☆14Dec 1, 2018Updated 7 years ago
zjumml / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆10Mar 8, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lodeguns / Solids-classification-3D-CNN-3D-GradCam
View on GitHub
Here we introduce the problem of 3D solids classification with a CNN (spheres and octahedra). We implemented a 3D GradCam model, in orde…
☆11Nov 25, 2019Updated 6 years ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
Sanyuan-Chen / CSS_with_EETransformer
View on GitHub
Code for the ICASSP-2021 paper: Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
☆12Sep 2, 2021Updated 4 years ago
joaosiqueira / dark-mode-gee
View on GitHub
Saving nerds eyes.
☆16Jan 15, 2022Updated 4 years ago
xuchenglin28 / target_speaker_verification
View on GitHub
target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech
☆15Jan 26, 2021Updated 5 years ago
linchaobao / hifi3dface
View on GitHub
Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".
☆19Dec 30, 2024Updated last year
adesgautam / clip-search
View on GitHub
A search engine implementation using OpenAI's clip model
☆10Jun 20, 2021Updated 5 years ago