LinkSoul-AI / LLaSM
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
☆519Updated last year
Related projects: ⓘ
- The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,390Updated 2 months ago
- ☆463Updated 3 months ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,069Updated last month
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆429Updated 7 months ago
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆649Updated 5 months ago
- 支持中英文双语视觉-文本对话的开源可商用多模态模型。☆349Updated 11 months ago
- ☆258Updated last month
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆483Updated 8 months ago
- SpeechGPT Series: Speech Large Language Models