xinghua-qu / AudioQRLinks
☆10Updated 2 years ago
Alternatives and similar repositories for AudioQR
Users that are interested in AudioQR are comparing it to the libraries listed below
Sorting:
- Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking☆39Updated 11 months ago
- VoiceLDM: Text-to-Speech with Environmental Context☆181Updated last year
- Official Repository for "SingFake: Singing Voice Deepfake Detection"☆58Updated last year
- ☆62Updated last year
- ☆40Updated 2 years ago
- Official Implementation of EnCLAP (ICASSP 2024)☆92Updated last year
- ☆15Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 10 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆70Updated 2 years ago
- ☆24Updated last year
- ☆43Updated 2 months ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Updated 2 months ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆102Updated 2 years ago
- Training code and trained checkpoints for ASGAN.☆62Updated last year
- PAM is a no-reference audio quality metric for audio generation tasks☆70Updated last year
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆92Updated last year
- HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.☆16Updated 6 months ago
- ☆20Updated 10 months ago
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆151Updated last month
- ☆37Updated 4 months ago
- ☆25Updated 2 years ago
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆40Updated 2 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆55Updated 9 months ago
- Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.☆89Updated 3 years ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆82Updated 5 months ago
- ☆41Updated 10 months ago
- Official code for Interspeech 2023 paper "Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clusterin…☆56Updated 2 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆50Updated last year
- A sequence-to-sequence voice conversion toolkit.☆102Updated last year
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆80Updated 7 months ago