Sample Repository for the AlibabaCloud Bailian Speech SDK
☆411Dec 19, 2025Updated 5 months ago
Alternatives and similar repositories for alibabacloud-bailian-speech-demo
Users that are interested in alibabacloud-bailian-speech-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 内容审核及速率限制服务☆26May 18, 2025Updated last year
- “alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力,包括语音识别、语音合成、文件转写等。”☆81Aug 22, 2025Updated 9 months ago
- RTC AIGC Demo☆280Mar 25, 2026Updated 2 months ago
- 百聆 是一个类似GPT-4o的语音对话机器人,通过ASR+LLM+TTS实现,集成DeepSeek R1等优秀大模型,接入openClaw,真正的个人语音助手,时延低至800ms,Mac等低配置也可运行,支持打断☆1,715Apr 6, 2026Updated 2 months ago
- ☆15Jul 4, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- silero-vad pytorch implement☆36Nov 23, 2024Updated last year
- This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…☆1,232Jun 3, 2026Updated last week
- 安卓手机部署DeepSeek-R1 蒸馏的1.5B模型☆24Feb 4, 2025Updated last year
- 小智的视觉对话☆33Apr 25, 2025Updated last year
- Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-…☆17,657Updated this week
- Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …☆12Apr 29, 2025Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- CPU inference version of VisemeNet-tensorflow☆14Nov 6, 2019Updated 6 years ago
- ESP32 component helps connect WiFi☆90Jun 8, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Automate the batch upload and parsing of documents into Dify's knowledge base, reducing manual intervention and wait time.☆14Aug 29, 2024Updated last year
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,905Feb 25, 2026Updated 3 months ago
- [ACL 2025 Main] A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆55Mar 19, 2025Updated last year
- ☆23Oct 30, 2024Updated last year
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆21,535May 25, 2026Updated 3 weeks ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆32Aug 29, 2024Updated last year
- Multilingual speech understanding: ASR + emotion recognition + audio event detection. 50+ languages, 15x faster than Whisper, non-autoreg…☆8,497Updated this week
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆64Feb 2, 2025Updated last year
- ☆24Feb 23, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 通用简单工具项目☆22Oct 6, 2024Updated last year
- Compute WER and SER for speech recognition evaluation☆26Jun 6, 2026Updated last week
- 这是一个用于连接小智AI服务的Python客户端库。它提供了简单的接口来进行语音对话和文本交互。☆27Mar 14, 2025Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (STT + TTS), and OpenAI (LLM)☆22Jun 3, 2026Updated last week
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 3 years ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆136Apr 26, 2023Updated 3 years ago
- ☆11Mar 13, 2023Updated 3 years ago
- Utilizes ONNX Runtime for audio denoising.☆132Jun 6, 2026Updated last week
- The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.☆1,902Jul 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors☆28Jul 30, 2025Updated 10 months ago
- golang use ffmpeg to mix the video☆11May 28, 2023Updated 3 years ago
- ☆15Sep 19, 2024Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,985Dec 8, 2025Updated 6 months ago
- 实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning,…☆1,257Dec 18, 2025Updated 5 months ago
- The open-source foundation for production Voice AI. Build, scale, and own your AI calling infrastructure. Bridge legacy SIP to modern LLM…☆40May 12, 2026Updated last month
- ☆69Jul 17, 2024Updated last year