lansinuote / Chinese_Speech_to_TextLinks
☆17Updated last year
Alternatives and similar repositories for Chinese_Speech_to_Text
Users that are interested in Chinese_Speech_to_Text are comparing it to the libraries listed below
Sorting:
- ☆17Updated 4 months ago
- This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not exclud…☆1,047Updated last month
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆1,080Updated last month
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆471Updated 8 months ago
- ☆729Updated last year
- Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。☆684Updated last month
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆284Updated last month
- An minimal Seq2Seq example of Automatic Speech Recognition (ASR) based on Transformer☆67Updated last year
- 基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型☆860Updated last month
- Pseudo Streaming SenseVoice with Hotwords☆310Updated 4 months ago
- 使用vllm加速cosyvoice2的推理☆369Updated 2 months ago
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆563Updated last year
- chinese speech pretrained models☆1,143Updated 10 months ago
- This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…☆837Updated 4 months ago
- A Bob plugin that calls self-deployed Cosyvoice service to achieve TTS.☆39Updated 11 months ago
- 机器学习实战案例,涉及机器学习、深度学习等各个方向。每个案例代码量在百行左右。☆210Updated last month
- BERT-based intent and slots detector for chatbots.☆195Updated 4 months ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆513Updated last year
- 基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。☆736Updated 6 months ago
- 从小说中提取对话数据集☆214Updated last year
- 学习ChatGLM3模型和LangChain框架的架构与核心功能,并基于LangChain+ChatGLM3实现本地知识库问答。☆38Updated last year
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆273Updated last month
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆400Updated 6 months ago
- 本项目是基于Pytorch的语音合成项目,使用的是VITS,VITS是一种语音合成方法,这种时端到端的模型使用起来非常简单,不需要文本对齐等太复杂的流程,直接一键训练和生成,大大降低了学习门槛。☆49Updated 10 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆555Updated last year
- 大语言模型微调,Qwen2VL、Qwen2、GLM4指令微调☆447Updated last month
- Hugging Face Audio Course中文版,帮助学习者快速入门音频模态☆36Updated last year
- bert-base-chinese example☆908Updated last year
- Step-by-step Jupyter notebook tutorials for ChatTTS☆164Updated last year
- 中文标点符号模型,可以给文本添加标点符号。☆142Updated 6 months ago