yzyouzhang/Audio_Research_in_US

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yzyouzhang/Audio_Research_in_US)

yzyouzhang / Audio_Research_in_US

Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD, postdoc in audio research.

☆27

Alternatives and similar repositories for Audio_Research_in_US

Users that are interested in Audio_Research_in_US are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ETZET / SpeechEmotionAVLearning
View on GitHub
☆13Nov 25, 2023Updated 2 years ago
Speech-Arena / speech_df_arena
View on GitHub
☆40Feb 26, 2026Updated 5 months ago
TakHemlata / T-EER
View on GitHub
Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"
☆14Sep 25, 2023Updated 2 years ago
yzyouzhang / SASV_PR
View on GitHub
Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"
☆18Jun 24, 2022Updated 4 years ago
xjchenGit / awesome-audio-visual-deepfake
View on GitHub
awesome-audio-visual-robustness
☆11Jan 27, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yzyouzhang / Awesome-Multimedia-Deepfake-Detection
View on GitHub
Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024
☆17Aug 26, 2024Updated last year
SVDDChallenge / CtrSVDD_Utils
View on GitHub
☆18Jan 10, 2024Updated 2 years ago
IsaacYQH / WildFX
View on GitHub
Official implementation of WildFX Dataset Generating pipeline.
☆21Oct 21, 2025Updated 9 months ago
luferrer / ConfidenceIntervals
View on GitHub
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
☆99Apr 5, 2024Updated 2 years ago
WangHelin1997 / LibriLightMix-WHAMR
View on GitHub
Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM
☆17Nov 7, 2024Updated last year
xjchenGit / SingGraph
View on GitHub
Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).
☆24Sep 19, 2025Updated 10 months ago
Yaselley / deepfense-framework
View on GitHub
DeepFense: A Unified, Modular, and Extensible Framework for Robust Deepfake Audio Detection
☆27Jul 11, 2026Updated 2 weeks ago
soham97 / ADIFF
View on GitHub
Explaining audio differences using language
☆16Feb 11, 2025Updated last year
JunyiPeng00 / SLT22_MultiHead-Factorized-Attentive-Pooling
View on GitHub
An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification
☆24Sep 22, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
WangHelin1997 / SSR-Speech
View on GitHub
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis
☆154Jan 1, 2025Updated last year
dl4am / tutorial
View on GitHub
Deep learning for automatic mixing
☆32Aug 29, 2024Updated last year
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
ucas-hao / qwen_audio_for_add
View on GitHub
[ACMMM2025] Official released code for ALLM4ADD
☆44Oct 30, 2025Updated 8 months ago
CameronChurchwell / combnet
View on GitHub
☆23Aug 4, 2025Updated 11 months ago
haoheliu / SemantiCodec
View on GitHub
☆45Jun 11, 2024Updated 2 years ago
nii-yamagishilab / PartialSpoof
View on GitHub
☆62Jul 15, 2024Updated 2 years ago
yuhanghe01 / RiTTA
View on GitHub
Event Relation in Text-to-Audio (TTA) Generation
☆21Feb 26, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Pliploop / GDRetriever
View on GitHub
Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…
☆19Sep 25, 2025Updated 10 months ago
zjzser / WMCodec
View on GitHub
PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…
☆18Jul 31, 2025Updated 11 months ago
ckyang1124 / LALM-Evaluation-Survey
View on GitHub
Collection of works for evaluating (and analyzing) large audio-language models (LALMs)
☆41Aug 11, 2025Updated 11 months ago
gdalsanto / dafx25-ddsp-tutorial
View on GitHub
Companion repository of the DAFx25 tutorial "Building Flexible Audio DDSP Pipelines: A Case Study on Artificial Reverb"
☆19Nov 10, 2025Updated 8 months ago
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
yongyizang / SingFake
View on GitHub
Official Repository for "SingFake: Singing Voice Deepfake Detection"
☆64Feb 26, 2024Updated 2 years ago
nii-yamagishilab / AntiDeepfake
View on GitHub
Project for training SSL-based deepfake speech detector
☆56Jul 9, 2026Updated 3 weeks ago
zlin0 / wedefense
View on GitHub
WeDefense: A Toolkit to Defend Against Fake Audio
☆33Feb 20, 2026Updated 5 months ago
ErosRos / conformer-based-classifier-for-anti-spoofing
View on GitHub
Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.
☆32Nov 7, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yzyouzhang / AIR-ASVspoof
View on GitHub
Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"
☆140Aug 30, 2024Updated last year
WangHelin1997 / SpecAugment-plus
View on GitHub
A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
☆34Jun 25, 2021Updated 5 years ago
MuSAELab / AUDDT
View on GitHub
A toolkit for benchmarking on a wide variety of audio deepfake datasets.
☆35May 22, 2026Updated 2 months ago
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
ta012 / SSLAM
View on GitHub
[ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes
☆79Oct 8, 2025Updated 9 months ago
tts-tutorial / interspeech2022
View on GitHub
☆162Sep 19, 2022Updated 3 years ago
mileskuo42 / AudioMarkBench
View on GitHub
Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking
☆48Aug 23, 2024Updated last year