将音频或视频中的中文语音识别并导出为srt字幕,基于魔塔社区Paraformer模型
☆113Jul 10, 2024Updated last year
Alternatives and similar repositories for zh_recogn
Users that are interested in zh_recogn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 张艺谋(国师)一键声音克隆和恶搞文本生成项目☆17Jun 15, 2023Updated 2 years ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- 基于 RWKV_Role_Playing 项目接入GPT-SoVITS语音对话项目☆31Apr 8, 2024Updated 2 years ago
- Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式☆4,545Jan 22, 2026Updated 4 months ago
- Api tool for local offline text translation supporting multiple languages/支持多语言的本地离线文字翻译api☆483Nov 4, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 一键扒谱☆18Nov 15, 2023Updated 2 years ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆182Jul 10, 2024Updated last year
- 数据集自动化制作脚本☆71Mar 26, 2023Updated 3 years ago
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆723Jul 2, 2024Updated last year
- 基于Bert-vits2-Extra项目添加的流式推理和流式接口api功能☆16Apr 12, 2024Updated 2 years ago
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,955Aug 29, 2025Updated 8 months ago
- 在cloudflare上基于m2m100创建完全免费的翻译API服务☆54Nov 8, 2024Updated last year
- an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems …☆1,974Nov 26, 2024Updated last year
- 文本语料转训练集工具,txt转dataset☆92May 1, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 一个极简的音视频格式转换工具☆21Mar 12, 2024Updated 2 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- 利用Github Action的能力获取天气并生成图片,用于第三方分享☆15Mar 17, 2026Updated 2 months ago
- Translate the video from one language to another and embed dubbing & subtitles.☆17,521Updated this week
- ☆30Aug 12, 2023Updated 2 years ago
- 模拟剪映转换字幕☆45Jan 9, 2023Updated 3 years ago
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆7,567Dec 5, 2025Updated 5 months ago
- 视频转图文 AI跨平台客户端(win mac linux)☆345Oct 11, 2024Updated last year
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- IMAGdressing在Windows环境下运行的webui界面☆22Jul 25, 2024Updated last year
- 用于kokoro TTS的webui界面和兼容openai api☆40Feb 4, 2025Updated last year
- 音频响度统一,音量归一化处理☆13May 3, 2024Updated 2 years ago
- Vue.js3+Tornado6 前后端分离异步非阻塞教育平台项目☆10Dec 7, 2022Updated 3 years ago
- Python的音频工具☆16Dec 5, 2025Updated 5 months ago
- 可循环值守和多人录制的直播录制软件,支持抖音、TikTok、快手、虎牙、斗鱼、B站、小红书、网易cc、pandaTV等平台直播录制,抓取多平台直播源地址,抖音无水印解析,快手无水印解析☆19Feb 8, 2024Updated 2 years ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆111Oct 6, 2025Updated 7 months ago
- 透明锁屏-锁屏时保持屏幕内容可见!防止误操作,保护隐私。适用于展示、娱乐和安全场景。☆363May 17, 2026Updated last week
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 启发自Nuxt3的前端模板库☆12Mar 16, 2022Updated 4 years ago
- 使用深度学习框架提取视频硬字幕;docker容器免安装深度学习库,使用本地api接口使得界面和后端识别分离;☆23Dec 20, 2021Updated 4 years ago
- A fork of Rope with webcam support☆13Mar 13, 2024Updated 2 years ago
- 跨语种语音克隆,中文版Webui☆62Jan 4, 2024Updated 2 years ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 11 months ago
- 使用CHATTTS合成语音,使用FASTAPI作为API服务端,基于GFAST制作了管理系统,提供了音色管理和webui界面☆35Jun 14, 2024Updated last year
- 一个有想法的视频处理工具,追求 AI 效果☆55Mar 9, 2022Updated 4 years ago