NKU-HLT / MusicEval-baselineLinks
☆12Updated 9 months ago
Alternatives and similar repositories for MusicEval-baseline
Users that are interested in MusicEval-baseline are comparing it to the libraries listed below
Sorting:
- Retrieval-Augmented MOS Prediction with Prior Knowledge Integration☆32Updated 10 months ago
- A Benchmark for Evaluating Turn-Taking and Overlap Handling in Full-Duplex Spoken Dialogue Models☆118Updated 4 months ago
- ☆78Updated 5 months ago
- Xmart青年论坛仓库,存放历史学生论坛和前沿讲座的视频回放和讲义,获取最新Xmart预告欢迎关注公众号【XLANCE Lab】☆39Updated last month
- ☆37Updated 4 years ago
- ☆108Updated 2 years ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Updated 2 months ago
- A benchmark for evaluating audio encoders on various audio tasks.☆42Updated last month
- ☆62Updated last year
- The open source code for LLM-Codec☆146Updated last year
- Audio-FLAN☆160Updated 4 months ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆41Updated 10 months ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆33Updated 2 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Updated 2 years ago
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆194Updated last year
- A CSRankings-like index for speech researchers☆35Updated last year
- The official source code of UniAudio☆95Updated last year
- Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"☆83Updated 4 months ago
- WavReward: Spoken Dialogue Models With Generalist Reward Evaluators☆54Updated 8 months ago
- ☆32Updated last year
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆76Updated 3 weeks ago
- ☆66Updated 2 years ago
- ARCH: Audio Representations benCHmark☆53Updated last year
- Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization☆190Updated last year
- ☆10Updated 3 years ago
- ☆79Updated 7 months ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 4 years ago
- ☆22Updated 2 months ago
- Template for creating audio encoders compatible with X-ARES☆17Updated last month
- Dataset☆28Updated 6 months ago