QuwanAI / MoodBenchLinks
基于PQAEF (https://github.com/QuwanAI/PQAEF) 框架设计的情感陪伴对话系统测评基准
☆41Updated 5 months ago
Alternatives and similar repositories for MoodBench
Users that are interested in MoodBench are comparing it to the libraries listed below
Sorting:
- Peking University & Quwan Ability Evaluation Framework ;☆48Updated 5 months ago
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆219Updated last year
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆844Updated this week
- ☆484Updated 9 months ago
- ☆473Updated 8 months ago
- An Open-Sourced LLM-empowered Foundation TTS System☆895Updated 4 months ago
- 使用vllm加速cosyvoice2的推理☆478Updated 9 months ago
- Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music…☆1,317Updated last week
- ☆343Updated 9 months ago
- MiMo-Audio: Audio Language Models are Few-Shot Learners☆965Updated 4 months ago
- Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…☆1,330Updated 4 months ago
- ☆242Updated 11 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆925Updated last year
- MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting…☆1,091Updated last month
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆651Updated 2 weeks ago
- GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code☆1,802Updated last year
- ☆341Updated 3 months ago
- GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning☆918Updated last month
- ☆580Updated 3 weeks ago
- ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM☆362Updated 8 months ago
- A framework for efficient model inference with omni-modality models☆2,491Updated this week
- ☆204Updated last year
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆789Updated last week
- 🤗 R1-AQA Model: mispeech/r1-aqa☆315Updated 10 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆811Updated last year
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆862Updated last week
- Memory-Guided Diffusion for Expressive Talking Video Generation☆1,076Updated 6 months ago
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆299Updated 3 months ago
- Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code☆1,089Updated last year
- OpenMusic: SOTA Text-to-music (TTM) Generation☆635Updated 7 months ago