aliyun / alibabacloud-nls-python-sdkLinks
“alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力,包括语音识别、语音合成、文件转写等。”
☆67Updated last month
Alternatives and similar repositories for alibabacloud-nls-python-sdk
Users that are interested in alibabacloud-nls-python-sdk are comparing it to the libraries listed below
Sorting:
- ☆61Updated 2 weeks ago
- 实时语音识别API WebSocket☆150Updated last year
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆45Updated last year
- 超快的中文普通话TTS☆121Updated 4 years ago
- 中文标点符号模型,可以给文本添加标点符号。☆144Updated 9 months ago
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆101Updated last year
- 语音识别API,分实时语音和长语音离线上传识别,支持中英文等多达100个国家的语言实时转写和同声传译☆80Updated 9 months ago
- 重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。☆49Updated 2 years ago
- 基于Gradio开发的ChatGPT聊天应用,可以文字 或 语音对话,发送的音频通过OpenAI的STT转文本后,再通过ChatGPT生成回复,回复的内容通过OpenAI TTS合成后返回并自动播放,实现语音聊天功能。☆35Updated last year
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆105Updated last year
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆288Updated last week
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆80Updated last month
- 张艺谋(国师)一键声音克隆和恶搞文本生成项目☆17Updated 2 years ago
- Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高☆504Updated 10 months ago
- 使用 PaddleGAN 套件的 Wave2lip 模型给照片上的人“配音、配嘴型儿”~~☆26Updated 4 years ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆522Updated last year
- 获取bilibili直播弹幕,使用WebSocket协议☆37Updated last year
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆271Updated 2 years ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆289Updated 4 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆103Updated this week
- ChatTTS HTTP API☆54Updated last year
- 10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:https://www.TTSlist.com 备用:http://ttslist.aiqb…☆196Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆358Updated 6 months ago
- ☆203Updated last year
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆137Updated 2 months ago
- 基于通义千问 Qwen2.5-Omni 的实时语音对话系统,使用在线API服务,支持实时语音交互、动态语音活动检测和流式音频处理。A real-time voice conversation system based on Qwen2.5-Omni Online-API, …☆74Updated 5 months ago
- ☆138Updated 2 years ago
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆51Updated 8 months ago
- Documentation for Bert-VITS2☆22Updated last year