Const-me / WhisperLinks
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
☆9,849Updated last year
Alternatives and similar repositories for Whisper
Users that are interested in Whisper are comparing it to the libraries listed below
Sorting:
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆15,551Updated last week
- faster_whisper GUI with PySide6☆2,748Updated 10 months ago
- Faster Whisper transcription with CTranslate2☆18,757Updated last week
- Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式☆3,940Updated 2 months ago
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆2,593Updated last week
- ↔️ Translate subtitle using LLM☆1,655Updated last month
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,982Updated 9 months ago
- 🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。☆6,052Updated 7 months ago
- Translate the video from one language to another and add dubbing.视频翻译/语音转录/字幕配音工具☆15,023Updated last week
- 字幕机翻,翻译字幕文件 .srt .ass .vtt,和同类产品相比,特点是可以自己填写 API key,这样价格最低。最新版本 5.3.6(发布时间 2025 年 8 月 22 号)☆2,571Updated last month
- 视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos a…☆7,984Updated 2 months ago
- GUI for a Vocal Remover that uses Deep Neural Networks.☆22,232Updated 7 months ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆9,249Updated 2 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,637Updated last year
- vits2 backbone with multilingual-bert☆8,605Updated this week
- Port of OpenAI's Whisper model in C/C++☆44,056Updated this week
- Edited from Const-me/Whisper.☆139Updated 3 weeks ago
- CapsWriter 的离线版,一个好用的 PC 端的语音输入工具☆4,307Updated last year
- 一个可以录制 Microsoft Edge 浏览器的语音合成(TTS)语音并输出为 .wav 音频的(windows平台)工具。☆1,358Updated 11 months ago
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,804Updated 2 months ago
- [ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting☆6,337Updated 8 months ago
- Open source real-time translation app for Android that runs locally☆9,312Updated this week
- [NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆17,595Updated last year
- 基于AI的图片/视频硬字幕去除、文本水印去除,无损分辨率生成去字幕、去水印后的图片/视频文件。无需申请第三方API,本地实现。AI-based tool for removing hard-coded subtitles and text-like watermarks f…☆8,283Updated 4 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,716Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆18,436Updated last week
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆7,388Updated 2 months ago
- Windows desktop front end for Spleeter - AI source separation☆2,588Updated 2 years ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆1,129Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆90,019Updated last month