NKU-HLT / RAMP_MOS
Retrieval-Augmented MOS Prediction with Prior Knowledge Integration
☆11Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for RAMP_MOS
- ☆138Updated 4 months ago
- unofficial implementation of the High Fidelity Neural Audio Compression☆133Updated 2 months ago
- Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.☆198Updated 9 months ago
- Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)☆277Updated this week
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆113Updated 6 months ago
- Audio Codec Speech processing Universal PERformance Benchmark☆216Updated last week
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆166Updated 6 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆116Updated last week
- Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.☆61Updated this week
- Audio Large Language Models☆127Updated 2 weeks ago
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆157Updated 4 months ago
- 语音方向实验室/公司/资源/实习等,欢迎推荐或自荐☆517Updated 2 weeks ago
- UT-Sarulab MOS prediction system using SSL models☆185Updated 7 months ago
- ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore t…☆395Updated this week
- It's a repository for implementations of neural speech editing algorithms.☆191Updated 10 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆119Updated 3 weeks ago
- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…☆369Updated 9 months ago
- Training code for FAcodec presented in NaturalSpeech3☆178Updated 2 months ago
- ☆17Updated 8 months ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆472Updated 5 months ago
- Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆102Updated last month
- Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)☆136Updated last year
- [ICASSP 2024] KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels☆28Updated 7 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆96Updated 3 weeks ago
- The official repository of Dynamic-SUPERB.☆160Updated this week
- The open source code for LLM-Codec☆114Updated 2 months ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆131Updated last month
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆42Updated 4 months ago
- [INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark☆141Updated 4 months ago
- ☆136Updated last year