aliyun / alibabacloud-bailian-speech-demoView external linksLinks
Sample Repository for the AlibabaCloud Bailian Speech SDK
☆374Dec 19, 2025Updated last month
Alternatives and similar repositories for alibabacloud-bailian-speech-demo
Users that are interested in alibabacloud-bailian-speech-demo are comparing it to the libraries listed below
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated 9 months ago
- silero-vad pytorch implement☆34Nov 23, 2024Updated last year
- This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…☆1,115Mar 1, 2025Updated 11 months ago
- 百聆 是一个类似GPT-4o的语音对 话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,时延低至800ms,Mac等低配置也可运行,支持打断☆1,606Jul 31, 2025Updated 6 months ago
- 小智的视觉对话☆32Apr 25, 2025Updated 9 months ago
- ☆15Jul 4, 2024Updated last year
- RTC AIGC Demo☆246Nov 19, 2025Updated 2 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆14,891Feb 4, 2026Updated 2 weeks ago
- 这是一个用于连接小智AI服务的Python客户端库。它提供了简单的接口来进行语音对话和文本交互。☆26Mar 14, 2025Updated 11 months ago
- ☆11Mar 13, 2023Updated 2 years ago
- A Chrome DevTools Extension for OpenSumi.☆14Apr 22, 2024Updated last year
- pytorch+bert实现的意图识别与槽位填充☆11May 30, 2023Updated 2 years ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆12Dec 26, 2024Updated last year
- Automate the batch upload and parsing of documents into Dify's knowledge base, reducing manual intervention and wait time.☆15Aug 29, 2024Updated last year
- ☆10May 27, 2025Updated 8 months ago
- Vue移动商城项目,练习Vue时的demo。☆10Jan 6, 2023Updated 3 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆128Apr 26, 2023Updated 2 years ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆32Aug 29, 2024Updated last year
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,766Updated this week
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Sep 20, 2024Updated last year
- 一个用于CosyVoice的api接口项目☆335Aug 31, 2025Updated 5 months ago
- ☆22Jul 30, 2025Updated 6 months ago
- Characterize Anything: A Wondrous Chemical Reaction between vision models and AI Characters☆16May 17, 2023Updated 2 years ago
- ☆33Feb 28, 2025Updated 11 months ago
- Multilingual Voice Understanding Model☆7,497Dec 30, 2025Updated last month
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆58Feb 2, 2025Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆19,578Feb 11, 2026Updated last week
- AI 技术分享频道相关文件☆98Updated this week
- golang use ffmpeg to mix the video☆11May 28, 2023Updated 2 years ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆20Oct 28, 2025Updated 3 months ago
- CPU inference version of VisemeNet-tensorflow☆14Nov 6, 2019Updated 6 years ago
- LiveKit + Next.js AI voice agent interface☆16Feb 21, 2025Updated 11 months ago
- chai2010 的博客☆13Jan 24, 2026Updated 3 weeks ago
- OCRFusion is an integrated solution that combines multiple open-source OCR (Optical Character Recognition) models, layout analysis, and t…☆16Jul 30, 2024Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆429Mar 13, 2025Updated 11 months ago
- Python Wrapper of Silero VAD☆64May 8, 2025Updated 9 months ago
- ☆69Jul 17, 2024Updated last year
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆48Mar 19, 2025Updated 10 months ago
- Add real-time Speech-to-Text to your LiveKit application with AssemblyAI☆18Jun 5, 2025Updated 8 months ago