NKU-HLT / RAMP_MOSView external linksLinks
Retrieval-Augmented MOS Prediction with Prior Knowledge Integration
☆32Mar 23, 2025Updated 10 months ago
Alternatives and similar repositories for RAMP_MOS
Users that are interested in RAMP_MOS are comparing it to the libraries listed below
Sorting:
- ☆12Apr 18, 2025Updated 9 months ago
- ☆40Apr 2, 2025Updated 10 months ago
- Paper List☆18Jul 2, 2025Updated 7 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated last year
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆12Nov 28, 2024Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- [ICANN 2023] Anomaly-Based Insider Threat Detection via Hierarchical Information Fusion☆18Nov 20, 2023Updated 2 years ago
- [NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting☆36Nov 14, 2023Updated 2 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- [ACL 2023] PromptRank: Unsupervised Keyphrase Extraction Using Prompt☆52May 16, 2023Updated 2 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated last week
- PAM is a no-reference audio quality metric for audio generation tasks☆77Jul 19, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 6 months ago
- ☆14Feb 19, 2025Updated 11 months ago
- ☆32Nov 24, 2024Updated last year
- ☆109Jun 14, 2023Updated 2 years ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆11Dec 21, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- ☆11Jun 14, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- ☆32Nov 18, 2025Updated 2 months ago
- ☆13Sep 25, 2024Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆132Oct 2, 2025Updated 4 months ago
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated 2 weeks ago
- Multi-lingual AudioCaps☆12Nov 20, 2023Updated 2 years ago
- ☆13Oct 11, 2024Updated last year
- The official implementation of the DIFFA series for dLLM-based large audio language model☆59Feb 2, 2026Updated 2 weeks ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆78Jun 8, 2025Updated 8 months ago
- StyleTTS 2 Optimized Training Fork☆33Feb 2, 2025Updated last year
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆80Sep 29, 2025Updated 4 months ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 10 months ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 4 months ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 10 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year