shuaijiang / Whisper-FinetuneLinks
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
☆296Updated 3 months ago
Alternatives and similar repositories for Whisper-Finetune
Users that are interested in Whisper-Finetune are comparing it to the libraries listed below
Sorting:
- Pseudo Streaming SenseVoice with Hotwords☆343Updated 5 months ago
- ☆755Updated last year
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆519Updated last year
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆1,117Updated 2 months ago
- 使用vllm加速cosyvoice2的推理☆406Updated 4 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆607Updated last month
- Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…☆1,285Updated 5 months ago
- Port of Funasr's Sense-voice model in C/C++☆424Updated 2 months ago
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆576Updated last year
- ☆367Updated last year
- 基于标贝 数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆270Updated last year
- ☆201Updated 11 months ago
- Text Normalization & Inverse Text Normalization☆645Updated last month
- 中文标点符号模型,可以给文本添加标点符号。☆143Updated 8 months ago
- Added vLLM support to IndexTTS for faster inference.☆449Updated 3 weeks ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆496Updated 10 months ago
- 基于 faster-whisper 的伪实时语音转写服务☆226Updated 4 months ago
- SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.☆517Updated 5 months ago
- TTS appalication based on modelscope KAN-TTS☆43Updated last year
- ☆132Updated 2 years ago
- 端到端语音唤醒工具箱,从模型训练到 模型推理。☆127Updated 3 weeks ago
- 第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。☆555Updated last year
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆535Updated 2 years ago
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆265Updated this week
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆43Updated 10 months ago
- A 10000+ hours dataset for Chinese speech recognition☆559Updated 2 years ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆413Updated 8 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆76Updated this week
- 基于SparkTTS、OrpheusTTS等模型,提供高质量中文语音合成与声音克隆服务。☆513Updated 3 months ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆64Updated 2 weeks ago