jianchang512 / zh_recognView external linksLinks
将音频或视频中的中文语音识别并导出为srt字幕,基于魔塔社区Paraformer模型
☆115Jul 10, 2024Updated last year
Alternatives and similar repositories for zh_recogn
Users that are interested in zh_recogn are comparing it to the libraries listed below
Sorting:
- 张艺谋(国师)一键声音克隆和恶搞文本生成项目☆17Jun 15, 2023Updated 2 years ago
- 基于 RWKV_Role_Playing 项目接入GPT-SoVITS语音对话项目☆30Apr 8, 2024Updated last year
- EZ Translate: A chrome extension using AI LLMs (e.g., Gemini, Gemma, QWEN) for on-page, screenshot and popup translations.☆41Dec 8, 2025Updated 2 months ago
- 数据集自动化制作脚本☆72Mar 26, 2023Updated 2 years ago
- ☆11Feb 20, 2025Updated 11 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆184Jul 10, 2024Updated last year
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式☆4,269Jan 22, 2026Updated 3 weeks ago
- ☆12Apr 1, 2024Updated last year
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 8 months ago
- ChatTTS 2000条音色稳定性打分🥇+区分男女年龄👧+在线试听🔈 ChatTTS 2K Speaker Stability Score & Categorized by Gender and Age & Audio Preview☆705Jul 2, 2024Updated last year
- ☆30Aug 12, 2023Updated 2 years ago
- 第三方Doc2X桌面应用,支持Linux(X11,Wayland)/Windows☆39Aug 9, 2024Updated last year
- 透明锁屏-锁屏时保持屏幕内容可见!防止误操作,保护隐私。适用于展示、娱乐和安全场景。☆196Dec 26, 2025Updated last month
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆109Oct 6, 2025Updated 4 months ago
- 文本语料转训练集工具,txt转dataset☆93May 1, 2024Updated last year
- Python的音频工具☆16Dec 5, 2025Updated 2 months ago
- 一个极简的音视频格式转换工具☆19Mar 12, 2024Updated last year
- webrtc sharing file☆13Dec 30, 2024Updated last year
- 基于Bert-vits2-Extra项目添加的流式推理和流式接口api功能☆15Apr 12, 2024Updated last year
- 用于kokoro TTS的webui界面和兼容openai api☆40Feb 4, 2025Updated last year
- 视频转图文 AI跨平台客户端(win mac linux)☆334Oct 11, 2024Updated last year
- 一个有想法的视频处理工具,追求 AI 效果☆55Mar 9, 2022Updated 3 years ago
- Api tool for local offline text translation supporting multiple languages/支持多语言的本地离线文字翻译api☆481Nov 4, 2024Updated last year
- ☆39Oct 1, 2023Updated 2 years ago
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,908Aug 29, 2025Updated 5 months ago
- an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems …☆1,790Nov 26, 2024Updated last year
- CloudFlare free temp domain email 免费 临时 域名邮箱☆78Updated this week
- 在cloudflare上基于m2m100创建完全免费的翻译API服务☆50Nov 8, 2024Updated last year
- Re-implementation of SLAM-ASR paper's experiment, using Phi-2 and Hubert☆21Jun 14, 2024Updated last year
- 一键扒谱☆18Nov 15, 2023Updated 2 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆23Nov 12, 2025Updated 3 months ago
- Wechat robot for Java 使用Java开发的微信机器人程序☆18Sep 12, 2016Updated 9 years ago
- Official forum for aiverything, please file your issue here.☆22Jan 15, 2026Updated last month
- Translate the video from one language to another and embed dubbing & subtitles.☆16,150Updated this week
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆7,504Dec 5, 2025Updated 2 months ago
- Bilingual Singing Voice Synthesis☆18Mar 25, 2024Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- 一个聚合AI相关节目的播客rss☆20Feb 8, 2026Updated last week