StelaBou/voxceleb_preprocessing

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/StelaBou/voxceleb_preprocessing)

StelaBou / voxceleb_preprocessing

Download and preprocess voxceleb datasets.

☆41

Alternatives and similar repositories for voxceleb_preprocessing

Users that are interested in voxceleb_preprocessing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KingJamesSong / HouseholderGAN
View on GitHub
ICCV23 "Householder Projector for Unsupervised Latent Semantics Discovery"
☆17Jul 7, 2026Updated 3 weeks ago
ClaudiaShu / SSL-FER
View on GitHub
[BMVC 2022] This is the official code of our Paper "Revisiting Self-Supervised Contrastive Learning for Facial Expression Recognition"
☆24Jul 8, 2024Updated 2 years ago
aqibahmad / speech2face
View on GitHub
A PyTorch implementation of MIT CSAIL's Speech2Face research paper from IEEE CVPR 2019
☆12Mar 25, 2023Updated 3 years ago
gzoumpourlis / Ensemble-MI
View on GitHub
PyTorch code for "Motor Imagery Decoding Using Ensemble Curriculum Learning and Collaborative Training"
☆22Feb 14, 2024Updated 2 years ago
my-yy / sl_icmr2022
View on GitHub
Code for "Self-Lifting: A Novel Framework For Unsupervised Voice-Face Association Learning,ICMR,2022"
☆15Oct 25, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
negrinho / research_toolbox
View on GitHub
Utilities to help manage a machine learning experimental workflow
☆20Jul 31, 2021Updated 4 years ago
msaadsaeed / SBNet
View on GitHub
Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".
☆13Aug 28, 2023Updated 2 years ago
ogunlao / glowtts_stdp
View on GitHub
Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor
☆19Jun 5, 2023Updated 3 years ago
my-yy / vfal_papers
View on GitHub
Voice Face Association Learning Paper List
☆17May 20, 2023Updated 3 years ago
my-yy / vfal-eva
View on GitHub
Voice-Face Association Learning Evaluation
☆49Feb 13, 2024Updated 2 years ago
anjieyang / VFHQ-downloader
View on GitHub
VFHQ-downloader is a Python-based utility designed for the easy downloading and processing of videos from the VFHQ dataset.
☆28Apr 15, 2024Updated 2 years ago
Cocoxili / CMPC
View on GitHub
[IJCAI2022] Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
☆21Oct 25, 2023Updated 2 years ago
chi0tzp / ContraCLIP
View on GitHub
Authors official PyTorch implementation of the "ContraCLIP: Interpretable GAN generation driven by pairs of contrasting sentences".
☆42Oct 1, 2022Updated 3 years ago
msaadsaeed / FOP
View on GitHub
Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"
☆23Dec 31, 2025Updated 6 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
james-oldfield / PandA
View on GitHub
[ICLR'23] Code to reproduce the results in the paper "PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs"
☆58Jun 8, 2023Updated 3 years ago
YUCHEN005 / RATS-Channel-A-Speech-Data
View on GitHub
This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…
☆16Oct 22, 2022Updated 3 years ago
YapengTian / CCOL-CVPR21
View on GitHub
Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation
☆26Nov 24, 2021Updated 4 years ago
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
JoeHEZHAO / Spatiotemporal-Residual-Propagation
View on GitHub
Code release for ICCV 2019 paper "Spatiotemporal Feature Residual Propagation for Action Prediction"
☆14Sep 20, 2021Updated 4 years ago
AliaksandrSiarohin / video-preprocessing
View on GitHub
☆542Dec 8, 2022Updated 3 years ago
ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
light-and-ray / sd-webui-topaz-photo-ai-integration
View on GitHub
Topaz Photo AI upscaler inside sd-webui
☆12Jul 5, 2024Updated 2 years ago
mmagnuski / borsar
View on GitHub
Various tools for EEG/MEG data analysis.
☆10Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
StelaBou / StyleMask
View on GitHub
Authors official PyTorch implementation of the "StyleMask: Disentangling the Style Space of StyleGAN2 for Neural Face Reenactment" [FG 20…
☆114Aug 10, 2023Updated 2 years ago
ndb796 / Face-Gender-Classification-PyTorch
View on GitHub
Face Gender Classification Tutorial: PyTorch Implementations
☆12Mar 2, 2021Updated 5 years ago
zkzhou126 / AI-for-Research
View on GitHub
From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems
☆20Jun 29, 2026Updated 3 weeks ago
Liu-Feng-deeplearning / CoverHunter
View on GitHub
Official PyTorch implementation of CoverHunter
☆43Nov 21, 2024Updated last year
gzoumpourlis / DEAP_MNE_preprocessing
View on GitHub
Scripts to a) download DEAP EEG dataset b) preprocess its EEG signals and c) perform feature extraction
☆96May 26, 2022Updated 4 years ago
mbzuai-metaverse / VOODOO3D-official
View on GitHub
Official implementation for the paper "VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment"
☆167Jun 11, 2024Updated 2 years ago
Jenine-321 / GenFace
View on GitHub
☆10Jan 13, 2026Updated 6 months ago
AKiessner / TUHAbnormal-Expansion-dataset
View on GitHub
☆14Sep 5, 2023Updated 2 years ago
alakise / Audio-Spectrogram
View on GitHub
Generating sound spectrograms using short-time Fourier transform that can be used for purposes such as sound classification by machine le…
☆37May 9, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
billpsomas / efficient-probing
View on GitHub
[ICLR 2026] - Official implementation of "Attention, Please! Revisiting Attentive Probing Through the Lens of Efficiency"
☆33Feb 23, 2026Updated 5 months ago
wangyanckxx / FERV39k
View on GitHub
☆67Sep 26, 2022Updated 3 years ago
Vincent-ZHQ / Comprehensive-Long-Video-Understanding-Survey
View on GitHub
A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long…
☆23Sep 12, 2025Updated 10 months ago
ZhengyiLuo / SMPL_Renderer
View on GitHub
Rendering SMPL using neural-mesh-render!!
☆12Aug 6, 2020Updated 5 years ago
MichiganCOG / video-frame-inpainting
View on GitHub
Code for "A Temporally-Aware Interpolation Network for Video Frame Inpainting"
☆10Jul 22, 2023Updated 3 years ago
berndporr / deepNeuronalFilter
View on GitHub
Deep Neuronal Filter (DNF): A closed-loop filter to remove noise from signals with the help of a noise reference signal.
☆14Nov 26, 2025Updated 8 months ago
saurjya / EnsembleSep
View on GitHub
This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.
☆12Nov 7, 2024Updated last year