NKU-HLT / RAMP_MOS
Retrieval-Augmented MOS Prediction with Prior Knowledge Integration
☆17Updated last month
Alternatives and similar repositories for RAMP_MOS:
Users that are interested in RAMP_MOS are comparing it to the libraries listed below
- ☆9Updated last month
- ☆149Updated 6 months ago
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆32Updated 10 months ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆239Updated 2 weeks ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆167Updated 6 months ago
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆49Updated 7 months ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆112Updated last month
- This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".☆47Updated last month
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆184Updated 9 months ago
- It's a repository for implementations of neural speech editing algorithms.☆193Updated last year
- A Survey of Spoken Dialogue Models (60 pages)☆251Updated 2 months ago
- llama-omni训练代码复现☆41Updated last week
- The open source code for LLM-Codec☆125Updated 5 months ago
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆140Updated last year
- This is an evolving repo for the paper "Towards Controllable Speech Synthesis in the Era of Large Language Models: A Survey".☆112Updated 2 weeks ago
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆144Updated 3 weeks ago
- unofficial implementation of the High Fidelity Neural Audio Compression☆141Updated 5 months ago
- ☆43Updated last year
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆138Updated last month
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆151Updated 2 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆126Updated 3 months ago
- UTokyo-SaruLab MOS Prediction System☆129Updated last month
- ☆24Updated 4 months ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆82Updated 3 weeks ago
- BLSP: Bootstrapping Langauge-Speech Pre-training via Behavior Alignment of Continuation Writing☆49Updated 10 months ago
- EMO-SUPERB submission☆42Updated 4 months ago
- Reference-aware automatic speech evaluation toolkit☆140Updated last month
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆29Updated last year
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆95Updated last week
- The official repository of Dynamic-SUPERB.☆169Updated 3 weeks ago