aliyun / alibabacloud-nls-python-sdkLinks
“alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力,包括语音识别、语音合成、文件转写等。”
☆75Updated 4 months ago
Alternatives and similar repositories for alibabacloud-nls-python-sdk
Users that are interested in alibabacloud-nls-python-sdk are comparing it to the libraries listed below
Sorting:
- ☆65Updated last week
- 实时语音识别API WebSocket☆156Updated last year
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆47Updated last year
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆82Updated last year
- 超快的中文普通话TTS☆122Updated 4 years ago
- 中文标点符号模型,可以给文本添加标点符号。☆147Updated last year
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆92Updated last month
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- 重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。☆50Updated 2 years ago
- 基于Gradio开发的ChatGPT聊天应用,可以文字 或 语音对话,发送的音频通过OpenAI的STT转文本后,再通过ChatGPT生成回复,回复的内容通过OpenAI TTS合成后返回并自动播放,实现语音聊天功能。☆35Updated last year
- ChatTTS HTTP API☆54Updated last year
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆105Updated last year
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆348Updated 3 weeks ago
- 获取bilibili直播弹幕,使用WebSocket协议☆37Updated last year
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆83Updated 7 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 3 months ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆526Updated 2 years ago
- 张艺谋(国师)一键声音克隆和恶搞文本生成项目☆17Updated 2 years ago
- 跨语种语音克隆,中文版Webui☆62Updated 2 years ago
- 基于SenseVoice的funasr版本进行的api发布,可以无缝对接oneapi☆89Updated last year
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆58Updated 11 months ago
- 自己手写的百度搜索接口的封装,pip安装,支持命令行执行。Baidu Search unofficial API for Python with no external dependencies☆157Updated last year
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Updated last year
- 使用 FastAPI、Streamlit本地部署ChatTTS文本转语音模型,并通过 Docker Compose 进行容器化部署。☆27Updated last year
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆106Updated last year
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆175Updated last year
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆595Updated last year
- ☆25Updated 4 years ago
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆313Updated 2 weeks ago
- 文本语料转训练集工具,txt转dataset☆93Updated last year