Purdue-M2 / AI-Synthesized-Voice-GeneralizationView external linksLinks
This repository is the official implementation of our paper "Improving Generalization for AI-Synthesized Voice Detection", which has been accepted by AAAI 2025.
☆22Jan 13, 2026Updated last month
Alternatives and similar repositories for AI-Synthesized-Voice-Generalization
Users that are interested in AI-Synthesized-Voice-Generalization are comparing it to the libraries listed below
Sorting:
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 4 months ago
- The pytorch implementation of BAM for Partialspoof Audio Localization.☆28Aug 16, 2024Updated last year
- Synthesis speech detection based on Breathing-Talking-Silence sounds☆21Sep 3, 2025Updated 5 months ago
- ☆103Nov 14, 2025Updated 3 months ago
- ☆32Dec 24, 2025Updated last month
- 🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)☆17Updated this week
- ☆10Dec 22, 2023Updated 2 years ago
- ☆11Jun 14, 2024Updated last year
- Baselines for IS25 Source Tracing Special Session☆33Jan 3, 2025Updated last year
- This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially S…☆20Nov 2, 2024Updated last year
- A fully and partially fake speech dataset for evaluation☆13Nov 11, 2025Updated 3 months ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 10 months ago
- Region-Based Optimization in Continual Learning for Audio Deepfake Detection☆12Dec 17, 2024Updated last year
- A list of tools, papers and code related to Fake Audio Detection.☆222Dec 10, 2025Updated 2 months ago
- [ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization☆36Dec 20, 2024Updated last year
- This repository is related to our Dataset and Detection code from the paper: AI-Synthesized Voice Detection Using Neural Vocoder Artifact…☆142Jun 12, 2025Updated 8 months ago
- ☆24Mar 29, 2025Updated 10 months ago
- ☆36Oct 15, 2024Updated last year
- This is the repo of our work titled “Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception”☆26May 21, 2025Updated 8 months ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods☆22Aug 13, 2025Updated 6 months ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Jun 5, 2023Updated 2 years ago
- ☆14Jul 24, 2025Updated 6 months ago
- ☆19Jan 8, 2025Updated last year
- Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"☆37Jul 24, 2023Updated 2 years ago
- This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".☆66Dec 13, 2024Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- [INTERSPEECH'24] Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detection☆54Dec 4, 2024Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆25Nov 7, 2023Updated 2 years ago
- [T-IFS'24] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations☆30Jul 31, 2024Updated last year
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Aug 24, 2023Updated 2 years ago
- ☆30Oct 29, 2024Updated last year
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year