yzyouzhang / Audio_Research_in_USView external linksLinks
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD, postdoc in audio research.
☆27Nov 13, 2025Updated 3 months ago
Alternatives and similar repositories for Audio_Research_in_US
Users that are interested in Audio_Research_in_US are comparing it to the libraries listed below
Sorting:
- ☆32Dec 24, 2025Updated last month
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- PyTorch Implementation of [WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification](https://arxiv.or…☆16Jul 31, 2025Updated 6 months ago
- ☆13Nov 25, 2023Updated 2 years ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆24Sep 22, 2024Updated last year
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆57Oct 8, 2025Updated 4 months ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆96Apr 5, 2024Updated last year
- Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024☆17Aug 26, 2024Updated last year
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆43May 15, 2025Updated 9 months ago
- ☆18Jan 10, 2024Updated 2 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆20Feb 26, 2025Updated 11 months ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆33Sep 9, 2025Updated 5 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆47Apr 14, 2025Updated 10 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 4 months ago
- Official Repository for "SingFake: Singing Voice Deepfake Detection"☆63Feb 26, 2024Updated last year
- [T-IFS'24] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations☆30Jul 31, 2024Updated last year
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆25Nov 7, 2023Updated 2 years ago
- The pytorch implementation of BAM for Partialspoof Audio Localization.☆28Aug 16, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking☆45Aug 23, 2024Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 4 months ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Jun 23, 2022Updated 3 years ago
- Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"☆25Dec 3, 2023Updated 2 years ago
- Official implementation of the SPL paper "One-class Learning Towards Synthetic Voice Spoofing Detection"☆135Aug 30, 2024Updated last year
- ☆45Jun 11, 2024Updated last year
- Vox-Profile Benchmark☆67Sep 12, 2025Updated 5 months ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆25Aug 11, 2024Updated last year
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Dec 4, 2024Updated last year
- ☆30Jul 18, 2024Updated last year
- A unified dataset of multilingual emotional human utterances☆29Jan 16, 2026Updated last month
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆147Jan 1, 2025Updated last year
- awesome-audio-visual-robustness☆11Jan 27, 2024Updated 2 years ago
- ☆10Apr 17, 2024Updated last year