xinghua-qu / AudioQRLinks
☆11Updated 2 years ago
Alternatives and similar repositories for AudioQR
Users that are interested in AudioQR are comparing it to the libraries listed below
Sorting:
- VoiceLDM: Text-to-Speech with Environmental Context☆188Updated last year
- Official Repository for "SingFake: Singing Voice Deepfake Detection"☆63Updated last year
- Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking☆43Updated last year
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Updated 2 months ago
- Python code for handling the Clotho dataset.☆85Updated 5 years ago
- Official Implementation of EnCLAP (ICASSP 2024)☆94Updated last year
- [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and s…☆148Updated 6 months ago
- Expressive Anechoic Recordings of Speech (EARS)☆200Updated last year
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆53Updated last year
- An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.☆193Updated 5 months ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆76Updated 2 years ago
- PAM is a no-reference audio quality metric for audio generation tasks☆76Updated last year
- ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation☆36Updated last year
- ☆70Updated last year
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Updated 9 months ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆48Updated 8 months ago
- SAMO: SPEAKER ATTRACTOR MULTI-CENTER ONE-CLASS LEARNING FOR VOICE ANTI-SPOOFING☆41Updated 2 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆39Updated last year
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆110Updated last year
- ☆35Updated last year
- End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions☆94Updated 2 years ago
- Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.☆87Updated 2 years ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆192Updated last year
- Source code for the paper 'Audio Captioning Transformer'☆57Updated 3 years ago
- Code for paper Learning Audio-Visual Dereverberation☆30Updated 3 years ago
- ☆41Updated 8 months ago
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆105Updated 2 years ago
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.☆16Updated 2 months ago
- A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)☆58Updated last year