Tools for downloading VoxCeleb2 dataset
☆33Mar 16, 2024Updated 2 years ago
Alternatives and similar repositories for voxceleb2-download-zyf
Users that are interested in voxceleb2-download-zyf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆18Jul 11, 2022Updated 3 years ago
- Voice Face Association Learning Paper List☆17May 20, 2023Updated 2 years ago
- ☆16Mar 7, 2019Updated 7 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- ☆50Nov 24, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A ResNet Speaker Recognition&Verification Demo☆26Oct 19, 2021Updated 4 years ago
- Active Speaker Detection☆19Jun 19, 2020Updated 5 years ago
- IMAGEimate is an end-to-end pipeline to create realistic animatable 3D avatars from a single image using neural networks☆13Dec 9, 2021Updated 4 years ago
- ☆20Mar 20, 2026Updated last week
- kaldi based x-vector trained on Cn-Celeb☆13Sep 22, 2020Updated 5 years ago
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆12Mar 10, 2022Updated 4 years ago
- ☆18Nov 22, 2024Updated last year
- Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".☆20Dec 30, 2024Updated last year
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- ☆67Sep 13, 2022Updated 3 years ago
- A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.☆15Aug 29, 2021Updated 4 years ago
- In defence of metric learning for speaker recognition☆1,164Mar 26, 2024Updated 2 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆76Aug 24, 2023Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 3 years ago
- Python code to show basic sound separation using Ideal Binary Masks☆13Oct 13, 2018Updated 7 years ago
- ☆122Oct 24, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆105Sep 2, 2021Updated 4 years ago
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Nov 3, 2022Updated 3 years ago
- target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech☆15Jan 26, 2021Updated 5 years ago
- Model See Model Do: Speech-Driven Facial Animation with Style Control☆20May 6, 2025Updated 10 months ago
- Python code for training and testing of GMM-UBM and maximum a posterirori (MAP) adaptation based speaker verification☆20Jul 31, 2020Updated 5 years ago
- A repository to mass generate deepfake video based on DeepFaceLab repository.☆10Aug 10, 2023Updated 2 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Mar 8, 2022Updated 4 years ago
- ☆64Jun 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for the ICASSP-2021 paper: Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer☆12Sep 2, 2021Updated 4 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- Code repository for Blackbox Attacks via Surrogate Ensemble Search (BASES), NeurIPS 2022☆13Aug 6, 2024Updated last year
- Notebooks showing some examples of DSSATTools usage☆14Apr 13, 2025Updated 11 months ago
- ☆16Feb 19, 2026Updated last month
- Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch☆14Mar 23, 2026Updated last week
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆21Apr 1, 2022Updated 3 years ago