heyudage / VoiceTypingLinks
通过语音(说话)即可完成实时文本输入。通过PaddleSpeech项目二次开发 完成,支持离线脱网环境部署,支持GPU推理,目前客户端仅支持Windows。
☆25Updated 2 years ago
Alternatives and similar repositories for VoiceTyping
Users that are interested in VoiceTyping are comparing it to the libraries listed below
Sorting:
- SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…☆99Updated 10 months ago
- 重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。☆49Updated 2 years ago
- chinese real time voice cloning☆38Updated 5 years ago
- 使用onnxruntime部署实时视频帧插值,包含C++和Python两个版本的程序☆27Updated last year
- QGUI - 0.1MB超轻量Python GUI框架,用模板来快捷制作深度学习模型推理界面☆127Updated 2 years ago
- some ncnn demos of FunASR☆27Updated last year
- Qwen-TTS offers a robust voice synthesis service using FastAPI, supporting bilingual and dialect options. Explore seamless audio generati…☆80Updated this week
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆34Updated 10 months ago
- 离线语音合成☆34Updated 2 years ago
- 使用onnxruntime部署facefusion换脸,包含C++和Python两个版本的程序☆120Updated last year
- 声纹识别☆23Updated last year
- EIVideo- 交互式智能视频标注工具,几次鼠标点击即可解放双手,让视频标注更加轻松☆31Updated 3 years ago
- 使用OpenCV+onnxruntime部署中文clip做以文搜图,给出一句话来描述想要的图片,就能从图库中搜出来符合要求的图片。包含C++和Python两个版本的程序☆79Updated last year
- 一个多语言支持、易使用的 OCR 项目。An easy-to-use OCR project with multilingual support.☆124Updated 3 years ago
- 自用,语音到文本用的sencevoice,llm部分基于ollama的API调用,文本到语音用的cosyvoice,实时语音输入参考的https://github.com/ABexit/ASR-LLM-TTS。☆10Updated 10 months ago
- 使用ONNXRuntime部署人脸动漫化——AnimeGAN,包含C++和Python两个版本的代码实现☆44Updated 3 years ago
- 一个模块化,全过程可离线,低占用率的对话机器人/智能音箱☆113Updated 7 months ago
- ChatTTS HTTP API☆54Updated last year
- Inference TinyLlama models on ncnn☆24Updated 2 years ago
- paddlespeech用于语音合成的简单操作界面☆25Updated 2 years ago
- This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library…☆55Updated 8 months ago
- 替换照片中人物背景☆22Updated 3 years ago
- 大模型驱动的虚拟主播☆12Updated last year
- 小智同学测试工具(websocket)☆45Updated 8 months ago
- 使用ONNXRuntime部署U-2-Net生成人脸素描画,包含C++和Python两个版本的程序☆37Updated 3 years ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆41Updated 11 months ago
- qwen2 and llama3 cpp implementation☆47Updated last year
- 🔥 🔥 🔥 This is the implementation of vehicle license plate recognition powered by PaddleOCR-2.4☆23Updated 3 years ago
- 语音技术:文字转语音☆46Updated 2 years ago
- A real-time swarf detection and analysis system based on YOLO and Qwen-vl-max, providing efficient video stream processing and intelligen…☆35Updated 2 months ago