Ninot1Quyi/Qwen2.5-Omni-multimodal-chat

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Ninot1Quyi/Qwen2.5-Omni-multimodal-chat)

Ninot1Quyi / Qwen2.5-Omni-multimodal-chat

基于通义千问 Qwen2.5-Omni 的实时语音对话系统，使用在线API服务，支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, supporting real-time voice interaction, dynamic voice activity detection, and streaming audio processing.

☆91

Alternatives and similar repositories for Qwen2.5-Omni-multimodal-chat

Users that are interested in Qwen2.5-Omni-multimodal-chat are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yarikama / Agentic-Advanced-RAG
View on GitHub
Building a multi-agent RAG system with advanced RAG methods
☆13Jan 12, 2025Updated last year
AllenTom / SunoGenerator
View on GitHub
a client for suno to use ai music generator
☆19Apr 17, 2024Updated 2 years ago
fengnian123 / qwen-2.5-omni-realtime-chat
View on GitHub
使用fastrtc框架调用qwen-2.5-omni-realtime实现实时语音、视频等
☆14Jun 27, 2025Updated last year
aniketp02 / wav2lip_144x144
View on GitHub
☆10Feb 17, 2023Updated 3 years ago
lifeiteng / VoiceBox
View on GitHub
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
☆29Aug 4, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
fyabc / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆49Sep 18, 2025Updated 10 months ago
jonigata / lsmithbridge
View on GitHub
stable-diffusion-webui extension that bypass to lsmith
☆11Apr 28, 2023Updated 3 years ago
mbrostami / ComfyUI-TITrain
View on GitHub
ComfyUI Textual Inversion Training nodes using input images from workflow
☆13Jul 21, 2025Updated last year
NoahBishop / index-tts
View on GitHub
☆12Dec 1, 2025Updated 7 months ago
yxduir / LLM-SRT
View on GitHub
☆28Mar 11, 2026Updated 4 months ago
zxs731 / raspbarry_qwen2.5_omni
View on GitHub
树莓派qwen-omni语音助手免TTS/STT
☆17Apr 4, 2025Updated last year
cyysky / MAI-UI-Navigation-Agent
View on GitHub
MAI UI Navigation Agent
☆16Dec 29, 2025Updated 6 months ago
qiuchili / diasenti
View on GitHub
Conversational Multimodal Emotion Recognition
☆12Dec 7, 2020Updated 5 years ago
matthewoestreich / psFreshservice
View on GitHub
Freshservice Powershell Module
☆14Oct 4, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
RemSynch / SenseVoice-Real-Time
View on GitHub
简单实现VAD+声纹锁+SenseVoice完成类语音实时转录的小项目
☆42Sep 23, 2024Updated last year
adrianlzr / okta-scim-python-flask
View on GitHub
SCIM Cloud Server - POC of Integrating a SCIM Server with Okta. Should not be used in production
☆10Dec 8, 2022Updated 3 years ago
patil-suraj / simple-diffusion
View on GitHub
An implementation of simple diffusion in PyTorch (and JAX)
☆34Jan 28, 2023Updated 3 years ago
abb128 / turndetection
View on GitHub
☆21Mar 7, 2025Updated last year
hanantabak2 / crewai_ai_business_consultant_on_streamlit
View on GitHub
A multi-agent business consultant app on streamlit implemented using crewAI
☆18Jul 5, 2024Updated 2 years ago
Horizon2333 / videoqa_dataset_visualization
View on GitHub
Load and visualize different datasets in video question answering
☆10May 11, 2021Updated 5 years ago
wangchengzhong / GALDSE
View on GitHub
☆15Mar 11, 2025Updated last year
alexa / xlgen-eacl-2023
View on GitHub
☆13Apr 12, 2024Updated 2 years ago
alpoktem / Prosograph
View on GitHub
A Visualizer for prosodically annotated speech corpora
☆12Oct 27, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
light1726 / SpeechTripleNet
View on GitHub
The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"
☆33Nov 23, 2023Updated 2 years ago
chinayuren2022-2025 / kimi_stock_advisor
View on GitHub
AI Quant Monitor 是一款专为 A 股/ETF 投资者打造的轻量级智能监控终端，融合了 EasyQuotation 毫秒级行情与 Kimi 大模型深度分析能力。系统通过本地 SQLite 引擎实时清洗数据并反向合成涨速与量比指标，精准捕捉“火箭发射”与“高台跳…
☆19Jul 6, 2026Updated 2 weeks ago
go-board / std
View on GitHub
An enhanced version of the standard library based the new Generics feature.
☆19Mar 21, 2024Updated 2 years ago
Audio-AGI / dcase2024_task9_baseline
View on GitHub
Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"
☆26Mar 27, 2024Updated 2 years ago
YuxiangChai / AMEX-codebase
View on GitHub
☆33Sep 27, 2024Updated last year
QwenLM / Qwen2.5-Omni
View on GitHub
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…
☆4,047Jun 12, 2025Updated last year
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
DaiYvhang / AISHELL-5
View on GitHub
In-car multi-channel speech transcription system of AISHELL-5.
☆48Jun 9, 2025Updated last year
ziplab / LongVLM
View on GitHub
☆108Jul 30, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SaudM / GPT-LMU-APP
View on GitHub
利用 GPT-3.5、GPT-4、Claude 一键构建AI应用，轻松打造各种 AI APP，如 AI 客服、AI 助手、AI 讲师、AI 律师等。支持多用户和多模型管理。
☆11Aug 22, 2023Updated 2 years ago
ShiningLab / POS-Tagger-for-Punctuation-Restoration
View on GitHub
This repository is for the paper Incorporating External POS Tagger for Punctuation Restoration. Proc. Interspeech 2021, 1987-1991, doi: 1…
☆11May 24, 2026Updated 2 months ago
bytedance / X-Dyna
View on GitHub
[CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animation
☆268Jan 30, 2025Updated last year
chensonglu / LPD-end-to-end
View on GitHub
☆14Apr 7, 2020Updated 6 years ago
jesonxiang / cpp_extension_pybind11
View on GitHub
A demo project demonstrating the performance improvement by cpp extension, which wrapped with pybind11.
☆10Nov 16, 2021Updated 4 years ago
fudan-zvg / Reason2Drive
View on GitHub
[ECCV 2024] Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving
☆101Jan 1, 2024Updated 2 years ago
eyelash500 / 2020_ironman_python
View on GitHub
Pyhon X 金融分析 X Azure
☆13Oct 21, 2023Updated 2 years ago