Retrieval-Augmented MOS Prediction with Prior Knowledge Integration
☆32Mar 23, 2025Updated last year
Alternatives and similar repositories for RAMP_MOS
Users that are interested in RAMP_MOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Apr 18, 2025Updated last year
- ☆44Apr 2, 2025Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆42Mar 20, 2024Updated 2 years ago
- Paper List☆18Jul 2, 2025Updated 9 months ago
- [ICANN 2023] Anomaly-Based Insider Threat Detection via Hierarchical Information Fusion☆18Nov 20, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [NAACL 2024] Better Zero-Shot Reasoning with Role-Play Prompting☆36Nov 14, 2023Updated 2 years ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- [ACL 2023] PromptRank: Unsupervised Keyphrase Extraction Using Prompt☆51May 16, 2023Updated 2 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 8 months ago
- [ICASSP 2025]☆15Feb 19, 2025Updated last year
- [AAAI 2026 & ACL 2026] The official implementation of the DIFFA series for dLLM-based large audio language model☆76Apr 7, 2026Updated last week
- PAM is a no-reference audio quality metric for audio generation tasks☆76Jul 19, 2024Updated last year
- ☆32Nov 24, 2024Updated last year
- ☆109Jun 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆21Aug 13, 2024Updated last year
- ☆12Mar 11, 2025Updated last year
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- [InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency☆62Oct 23, 2024Updated last year
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆25Aug 1, 2025Updated 8 months ago
- Multi-lingual AudioCaps☆12Nov 20, 2023Updated 2 years ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆62Feb 28, 2026Updated last month
- UT-Sarulab MOS prediction system using SSL models☆298Apr 11, 2024Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors☆25Jul 30, 2025Updated 8 months ago
- ☆11Oct 31, 2024Updated last year
- It's a repository for implementations of neural speech editing algorithms.☆205Jan 9, 2024Updated 2 years ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆134Mar 31, 2026Updated 2 weeks ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23Apr 13, 2026Updated last week
- ☆62May 31, 2024Updated last year
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆108Aug 1, 2025Updated 8 months ago
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆56Jun 25, 2024Updated last year
- Dataset [ACL 2026]☆32Jul 31, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.☆11Jun 7, 2022Updated 3 years ago
- ☆11Jun 14, 2024Updated last year
- UTokyo-SaruLab MOS Prediction System☆309Apr 2, 2026Updated 2 weeks ago
- ☆15Apr 4, 2025Updated last year
- A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models☆165Updated this week
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆82Feb 20, 2026Updated 2 months ago