walkoncross / voxceleb2-download-zyfView external linksLinks
Tools for downloading VoxCeleb2 dataset
☆33Mar 16, 2024Updated last year
Alternatives and similar repositories for voxceleb2-download-zyf
Users that are interested in voxceleb2-download-zyf are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of our paper: Audio-Visual Speech Separation with Visual Features Enhanced by Adversarial Training.☆18Jul 11, 2022Updated 3 years ago
- Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"☆13Jan 27, 2025Updated last year
- ☆49Nov 24, 2022Updated 3 years ago
- Windows 💻 RobustVideoMatting with ONNXRuntime/MNN/TNN C++/Python☆12Mar 10, 2022Updated 3 years ago
- IMAGEimate is an end-to-end pipeline to create realistic animatable 3D avatars from a single image using neural networks☆13Dec 9, 2021Updated 4 years ago
- Voice Face Association Learning Paper List☆17May 20, 2023Updated 2 years ago
- ☆18Nov 22, 2024Updated last year
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- ☆20Dec 29, 2024Updated last year
- ☆121Oct 24, 2022Updated 3 years ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Aug 30, 2024Updated last year
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 2 years ago
- A ResNet Speaker Recognition&Verification Demo☆26Oct 19, 2021Updated 4 years ago
- Look Who’s Talking: Active Speaker Detection in the Wild☆76Aug 24, 2023Updated 2 years ago
- [ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization☆36Dec 20, 2024Updated last year
- This project was built during the competition of Smart India Hackathon 2020. In This I am using a Android device's Camera to detect Garba…☆11Apr 5, 2023Updated 2 years ago
- ☆67Sep 13, 2022Updated 3 years ago
- In defence of metric learning for speaker recognition☆1,161Mar 26, 2024Updated last year
- ☆12Sep 25, 2023Updated 2 years ago
- A Foundation Model for Industrial Signal Comprehensive Representation☆57Updated this week
- SyncTalkFace: Talking Face Generation for Precise Lip-syncing via Audio-Lip Memory☆33Nov 3, 2022Updated 3 years ago
- Virtual news production using Tacotron2 and Wav2Lip☆11Nov 14, 2023Updated 2 years ago
- A pytorch implementation of D3Net.☆11Aug 8, 2021Updated 4 years ago
- [INTERSPEECH 2024] The official implementation of EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for …☆171May 20, 2025Updated 8 months ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆87Dec 20, 2022Updated 3 years ago
- ☆10Sep 17, 2022Updated 3 years ago
- 干中学|| build_mcp_from_scratch☆24Oct 15, 2025Updated 4 months ago
- Amazon S3 tokenizer☆10Updated this week
- Fast Fourier Transform (FFT) implementation for Unity☆10Aug 15, 2017Updated 8 years ago
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- VexFS is a Linux kernel-native file system with built-in vector search and semantic memory. Designed for AI agents, RAG, and LLM workload…☆24Oct 19, 2025Updated 3 months ago
- Information geometry and its extension information topology☆11Dec 2, 2017Updated 8 years ago
- Where is the "main theme" in an orchestral score?☆12Oct 25, 2025Updated 3 months ago
- Official Implementation for CVPR 2025 paper Instant Adversarial Purification with Adversarial Consistency Distillation.☆14Dec 19, 2025Updated last month
- ☆12Nov 12, 2024Updated last year
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated 3 weeks ago
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 4 months ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago