Ninot1Quyi / Qwen2.5-Omni-multimodal-chat
基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, supporting real-time voice interaction, dynamic voice activity detection, and streaming audio processing.
☆36Updated this week
Alternatives and similar repositories for Qwen2.5-Omni-multimodal-chat:
Users that are interested in Qwen2.5-Omni-multimodal-chat are comparing it to the libraries listed below
- ☆193Updated 6 months ago
- Its an open source LLM based on MOE Structure.☆58Updated 9 months ago
- 文本语料转训练集工具,txt转dataset☆91Updated 11 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆148Updated 2 months ago
- flow mirror models from JZX AI Labs☆44Updated 6 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆88Updated 6 months ago
- project page for ChatAnyone☆86Updated 3 weeks ago
- GLM Series Edge Models☆134Updated last month
- A demo built on Megrez-3B-Instruct, integrating a web search tool to enhance the model's question-and-answer capabilities.☆37Updated 4 months ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆33Updated 8 months ago
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆28Updated 11 months ago
- 使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略☆29Updated last week
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 7 months ago
- The plan which extend ChatHaruhi into Zero-shot Roleplaying model☆103Updated last year
- ☆78Updated 11 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆27Updated 6 months ago
- 基于FunASR实现语音识别, 包含常规版和ONNX版(推荐)。☆37Updated 6 months ago
- 主要写er-nerf从零到一所有部署过程☆44Updated 7 months ago
- ☆51Updated 9 months ago
- 骆驼大乱斗: Massive Game Content Generated by LLM☆19Updated last year
- ☆58Updated 5 months ago
- Real time faster whisper gradio☆26Updated 6 months ago
- ChatTTS HTTP API☆52Updated 10 months ago
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 2 weeks ago
- Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SO…☆73Updated 2 months ago
- ☆67Updated last year
- 首个llama2 13b 中文版模型 (Base + 中文对话SFT,实现流畅多轮人机自然语言交互)☆90Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆95Updated 6 months ago
- A Multi-modal RAG Project with Dataset from Honor of Kings, one of the most popular smart phone games in China☆65Updated 7 months ago
- 研究GOT-OCR-项目落地加速,不限语言☆60Updated 5 months ago