LinkSoul-AI / LLaSMLinks
第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。
☆556Updated last year
Alternatives and similar repositories for LLaSM
Users that are interested in LLaSM are comparing it to the libraries listed below
Sorting:
- ☆201Updated 9 months ago
- The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,733Updated last year
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆513Updated last year
- SpeechGPT Series: Speech Large Language Models☆1,385Updated 11 months ago
- ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM☆330Updated last month
- ☆729Updated last year
- 支持中英文双语视觉-文本对话的开源可商用多模态模型。☆373Updated last year
- ☆361Updated 11 months ago
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆284Updated last month
- The official repo of Aquila2 series proposed by BAAI, including pretrained & chat large language models.☆444Updated 9 months ago
- XVERSE-13B: A multilingual large language model developed by XVERSE Technology Inc.☆646Updated last year
- [EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models☆468Updated 6 months ago
- 用于汇总目前的开源中文对话数据集☆161Updated 2 years ago
- The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,800Updated 2 months ago
- ☆233Updated 4 months ago
- Pseudo Streaming SenseVoice with Hotwords☆310Updated 4 months ago
- Yuan 2.0 Large Language Model☆688Updated last year
- MooER: Moore-threads Open Omni model for speech-to-speech intERaction. MooER-omni includes a series of end-to-end speech interaction mode…☆215Updated 6 months ago
- Text Normalization & Inverse Text Normalization☆612Updated this week
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆554Updated 3 weeks ago
- X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages☆312Updated last year
- 使用vllm加速cosyvoice2的推理☆370Updated 2 months ago
- The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"☆263Updated last year
- OrionStar-Yi-34B-Chat 是一款开源中英文Chat模型,由猎户星空基于Yi-34B开源模型、使用15W+高质量语料微调而成。☆259Updated last year
- BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs☆511Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆59Updated 2 months ago
- Repo for adapting Meta LlaMA2 in Chinese! META最新发布的LlaMA2的汉化版! (完全开源可商用)☆743Updated last year
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆266Updated last year
- “百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced l…☆317Updated 7 months ago
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆563Updated last year