TencentCloud / tencentcloud-speech-sdk-python
☆47Updated last month
Alternatives and similar repositories for tencentcloud-speech-sdk-python:
Users that are interested in tencentcloud-speech-sdk-python are comparing it to the libraries listed below
- 中文标点符号模型,可以给文本添加标点符号。☆140Updated 4 months ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆39Updated 6 months ago
- “alibabacloud-nls-python-sdk提供使用阿里云智能语音服务的能力,包括语音识别、语音合成、文件转写等。”☆53Updated 4 months ago
- TTS appalication based on modelscope KAN-TTS☆43Updated last year
- ☆52Updated 9 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆66Updated 3 weeks ago
- 超快的中文普通话TTS☆120Updated 4 years ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆112Updated 7 months ago
- ☆112Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆90Updated 7 months ago
- Pseudo Streaming SenseVoice with Hotwords☆245Updated last month
- 使用vllm加速cosyvoice2的推理☆232Updated last week
- flow mirror models from JZX AI Labs☆45Updated 6 months ago
- chinese sentence punctuation prediction,中文句子标点符号预测。☆27Updated 2 years ago
- ASRT语音识别系统的Python版SDK☆51Updated 2 years ago
- 用 OCR 提取视频硬字幕☆71Updated 2 months ago
- Documentation for Bert-VITS2☆22Updated last year
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆261Updated last year
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Updated 10 months ago
- ☆29Updated 5 years ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆161Updated this week
- ☆195Updated 7 months ago
- 能说话的ChatPaper☆11Updated last year
- Dynamic Voice Actor Assignment and Emotional Narration for Realistic Story Play☆40Updated 3 weeks ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆17Updated last week
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆22Updated 11 months ago
- 演示 vllm 对中文大语言模型的神奇效果☆31Updated last year
- 基于 g2pW 提升 pypinyin 的准确性☆87Updated last year
- 基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…☆54Updated last year
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆261Updated 3 weeks ago