基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, supporting real-time voice interaction, dynamic voice activity detection, and streaming audio processing.
☆90May 11, 2025Updated last year
Alternatives and similar repositories for Qwen2.5-Omni-multimodal-chat
Users that are interested in Qwen2.5-Omni-multimodal-chat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Building a multi-agent RAG system with advanced RAG methods☆13Jan 12, 2025Updated last year
- 使用fastrtc框架调用qwen-2.5-omni-realtime实现实时语音、视频等☆14Jun 27, 2025Updated 11 months ago
- 一个拥有长期记忆, 表情动作, 语音对话/打断/声纹识别, FunctionCall, 多模型支持的AI Waifu客户端.☆29Apr 23, 2025Updated last year
- ☆10Feb 17, 2023Updated 3 years ago
- ☆26Mar 11, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- In-car multi-channel speech transcription system of AISHELL-5.☆44Jun 9, 2025Updated last year
- 树莓派qwen-omni语音助手免TTS/STT☆17Apr 4, 2025Updated last year
- ☆19Jan 18, 2019Updated 7 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- ☆21Mar 7, 2025Updated last year
- 简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目☆42Sep 23, 2024Updated last year
- ☆12Dec 1, 2025Updated 6 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆48Sep 18, 2025Updated 8 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆14Jan 25, 2024Updated 2 years ago
- bm25 is a scoring function that helps with information retrieval☆14Sep 17, 2020Updated 5 years ago
- 针对口语进行时间抽取并标准化☆13Mar 2, 2020Updated 6 years ago
- 词、句拼音转汉字、拼音分割、拼音补全、pygame输入中文☆15Mar 21, 2020Updated 6 years ago
- provide SPHERE-formatted output as well as RIFF, AU, AIFF and raw☆14Dec 18, 2021Updated 4 years ago
- Conversational Multimodal Emotion Recognition☆12Dec 7, 2020Updated 5 years ago
- crawl the public files of different governments through python 3.☆15Aug 29, 2019Updated 6 years ago
- ☆14May 28, 2025Updated last year
- 基于BERT和指针网络构建实体抽取任务☆14Aug 2, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- A multi-agent business consultant app on streamlit implemented using crewAI☆18Jul 5, 2024Updated last year
- This repo would give multi-task keypoint detect code based yolov8. The landmarks or keypoints with different classes and numbers can be …☆12Feb 28, 2023Updated 3 years ago
- ☆24Jul 10, 2025Updated 11 months ago
- A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.☆10Nov 6, 2021Updated 4 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- ☆13Apr 12, 2024Updated 2 years ago
- ☆29Oct 1, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- 【Demo】对新闻标题使用TF-IDF向量化和cosine相似度计算完成相似标题推荐☆14Mar 2, 2020Updated 6 years ago
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆4,020Jun 12, 2025Updated last year
- 语音增强TFCN论文复现☆42Feb 8, 2022Updated 4 years ago
- QuantClaw is a plug-and-play task-type routing quantization plugin for OpenClaw.☆116Apr 27, 2026Updated last month
- Pyhon X 金融分析 X Azure☆13Oct 21, 2023Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Jun 23, 2022Updated 3 years ago