shuaijiang / Whisper-Finetune
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment
☆239Updated 2 months ago
Alternatives and similar repositories for Whisper-Finetune:
Users that are interested in Whisper-Finetune are comparing it to the libraries listed below
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆966Updated last month
- Pseudo Streaming SenseVoice with Hotwords☆202Updated last week
- Port of Funasr's Sense-voice model in C/C++☆272Updated this week
- ☆644Updated 9 months ago
- 基于 faster-whisper 的伪实时语音转写服务☆202Updated 5 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆351Updated 4 months ago
- KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…☆499Updated last year
- ☆189Updated 5 months ago
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆257Updated last year
- ☆337Updated 7 months ago
- Text Normalization & Inverse Text Normalization☆545Updated 3 months ago
- 端到端语音唤醒工具箱,从模型训练到模型推理。☆105Updated 6 months ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆381Updated 9 months ago
- 中文标点符号模型,可以给文本添加标点符号。☆137Updated 2 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆337Updated 2 months ago
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆529Updated 9 months ago
- ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).☆62Updated 2 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆170Updated 7 months ago
- ☆101Updated last year
- TTS appalication based on modelscope KAN-TTS☆43Updated 10 months ago
- 一个用于CosyVoice的api接口项目☆223Updated last month
- SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.☆477Updated last month
- Sample Repository for the AlibabaCloud Bailian Speech SDK☆102Updated 2 weeks ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆31Updated 4 months ago
- chinese speech pretrained models☆1,077Updated 6 months ago
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆500Updated last year
- An Open-Sourced LLM-empowered Foundation TTS System☆630Updated 4 months ago
- 本项目使用了EcapaTdnn、ResNetSE、ERes2Net、CAM++等多种先进的声纹识别模型,同时本项目也支持了MelSpectrogram、Spectrogram、MFCC、Fbank等多种数据预处理方法☆258Updated this week
- a gradio webui for faster whisper☆252Updated last year