MyloBishop / transperLinks
A simple implementation of real-time output device audio transcription and translation using "faster_whisper" and "pyaudiowpatch".
☆22Updated 2 years ago
Alternatives and similar repositories for transper
Users that are interested in transper are comparing it to the libraries listed below
Sorting:
- 基于 faster-whisper 的伪实时语音转写服务☆232Updated 7 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆429Updated 11 months ago
- API and websocket server for sensevoice. It has inherited some enhanced features, such as VAD detection, real-time streaming recognition,…☆532Updated last year
- 这是一款基于FunASR实现的说话人分离的GUI程序☆146Updated last week
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆59Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆406Updated 9 months ago
- Step-by-step Jupyter notebook tutorials for ChatTTS☆172Updated last year
- This is a web-based intelligent dialogue program built using ASR, LLM, and TTS.☆24Updated last year
- SummerTTS 是一个基于C++的独立编译的中文和英文语音合成项目,可以本地运行不需要网络,而且没有额外的依赖,一键编译完成即可用于中文和英文的语音合成。SummerTTS is a standalone Chinese and English speech synt…☆515Updated 5 months ago
- 📣 商用级开源语音自动识别程序库,开箱即用,全平台支持,中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide …☆594Updated last year
- Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高☆518Updated last month
- ☆372Updated last year
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆12Updated last year
- This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice…☆1,058Updated 9 months ago
- ☆17Updated last year
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆1,165Updated this week
- ☆49Updated 2 years ago
- Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training wit…☆312Updated 3 weeks ago
- 一个用于CosyVoice的api接口项目☆325Updated 3 months ago
- a gradio webui for faster whisper☆275Updated 2 years ago
- Fast-TTS 是一个基于异步框架的文本到语音转换(TTS)生成器项目。该项目利用了异步编程技术来高效处理请求和响应,实现了快速、秒级的流式生成长文本语音播放服务。Fast-TTS 可以快速地将长文本转换为语音流,并实时播放,适用于多种应用场景,如语音合成、智能助手、内容…☆53Updated last year
- ☆814Updated last year
- wukong-robot项目是由github网友wzpan等开发并维护的一个开源中文 语音对话机器人项目,能够让感兴趣的开发者快速打造个性化的智能音箱。 模块化。功能插件、语音识别、语音合成、对话机器人都做到了高度模块化,第三方插件单独维护,方便继承和开发自己的插件 - 中文…☆55Updated 4 years ago
- 基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。☆753Updated this week
- 小智同学测试工具(websocket)☆47Updated 10 months ago
- 这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树 莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…☆543Updated 2 years ago
- 基于FunASR官方Demo修改的WS服务端,配合FastAPI提供HTTP服务,可以在浏览器中进行实时ASR测试☆43Updated 4 months ago
- 异步语音对话组件。☆30Updated 9 months ago
- 重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。☆49Updated 2 years ago