将音频或视频中的中文语音识别并导出为srt字幕,基于魔塔社区Paraformer模型
☆114Jul 10, 2024Updated last year
Alternatives and similar repositories for zh_recogn
Users that are interested in zh_recogn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 基于ffmpeg.wasm的在线视频处理工具☆55May 15, 2024Updated last year
- 张艺谋(国师)一键声音克隆和恶搞文本生成项目☆17Jun 15, 2023Updated 2 years ago
- 基于 RWKV_Role_Playing 项目接入GPT-SoVITS语音对话项目☆31Apr 8, 2024Updated 2 years ago
- Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式☆4,507Jan 22, 2026Updated 3 months ago
- Api tool for local offline text translation supporting multiple languages/支持多语言的本地离线文字翻译api☆482Nov 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 一键扒谱☆18Nov 15, 2023Updated 2 years ago
- 基于Python3.10异步非阻塞框架Tornado6.0和前端Vue.js3框架实现ChatGPT的流式返回协议Server-sent events☆23Mar 7, 2023Updated 3 years ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆182Jul 10, 2024Updated last year
- 数据集自动化制作脚本☆71Mar 26, 2023Updated 3 years ago
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆722Jul 2, 2024Updated last year
- 基于Bert-vits2-Extra项目添加的流式推理和流式接口api功能☆16Apr 12, 2024Updated 2 years ago
- EZ Translate: A chrome extension using AI LLMs (e.g., Gemini, Gemma, QWEN) for on-page, screenshot and popup translations.☆45Dec 8, 2025Updated 4 months ago
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,948Aug 29, 2025Updated 8 months ago
- 在cloudflare上基于m2m100创建完全免费的翻译API服务☆52Nov 8, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems …☆1,968Nov 26, 2024Updated last year
- 文本语料转训练集工具,txt转dataset☆92May 1, 2024Updated 2 years ago
- 一个极简的音视频格式转换工具☆21Mar 12, 2024Updated 2 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- 利用Github Action的能力获取天气并生成图片,用于第三方分享☆15Mar 17, 2026Updated last month
- ☆30Aug 12, 2023Updated 2 years ago
- HOOKUI,No Code, dyamic analaysis☆75Oct 31, 2025Updated 6 months ago
- 模拟剪映转换字幕☆45Jan 9, 2023Updated 3 years ago
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆7,542Dec 5, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 视频转图文 AI跨平台客户端(win mac linux)☆342Oct 11, 2024Updated last year
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- 用于kokoro TTS的webui界面和兼容openai api☆40Feb 4, 2025Updated last year
- WebUI for Whisper API☆36Sep 14, 2024Updated last year
- 音频响度统一,音量归一化处理☆13May 3, 2024Updated 2 years ago
- Python的音频工具☆16Dec 5, 2025Updated 5 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆112Oct 6, 2025Updated 7 months ago
- 透明锁屏-锁屏时保持屏幕内容可见!防止误操作,保护隐私。适用于展示、娱乐和安全场景。☆347Dec 26, 2025Updated 4 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 使用深度学习框架提取视频硬字幕;docker容器免安装深度学习库,使用本地api接口使得界面和后端识别分离;☆23Dec 20, 2021Updated 4 years ago
- A fork of Rope with webcam support☆13Mar 13, 2024Updated 2 years ago
- llama_index_examples_python3.10 基于ChatGPT的垂直领域语料向量索引优化☆36Apr 10, 2023Updated 3 years ago
- 根据声音生成音色文件☆36Aug 6, 2024Updated last year
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 11 months ago
- 使用CHATTTS合成语音,使用FASTAPI作为API服务端,基于GFAST制作了管理系统,提供了音色管理和webui界面☆35Jun 14, 2024Updated last year
- 一个有想法的视频处理工具,追求 AI 效果☆55Mar 9, 2022Updated 4 years ago