Const-me / WhisperLinks
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
☆9,409Updated 10 months ago
Alternatives and similar repositories for Whisper
Users that are interested in Whisper are comparing it to the libraries listed below
Sorting:
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆14,629Updated last week
- Faster Whisper transcription with CTranslate2☆16,585Updated 2 weeks ago
- faster_whisper GUI with PySide6☆2,480Updated 6 months ago
- 🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。☆6,004Updated 2 months ago
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。☆13,080Updated this week
- Port of OpenAI's Whisper model in C/C++☆40,830Updated this week
- Batch speech to text using OpenAI's whisper.☆296Updated 2 months ago
- 视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos a…☆7,417Updated last month
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆2,177Updated 2 months ago
- 用文本编辑器剪视频☆7,257Updated 8 months ago
- A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large …☆5,881Updated last month
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,596Updated 6 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆16,311Updated last week
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,602Updated last year
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆12,002Updated last week
- vits2 backbone with multilingual-bert☆8,468Updated last week
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆11,011Updated 3 weeks ago
- 🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。☆49,105Updated last week
- ↔️ Translate subtitle using ChatGPT☆1,640Updated last year
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆7,134Updated 2 months ago
- A generative speech model for daily dialogue.☆36,799Updated 3 weeks ago
- SOTA Open Source TTS☆21,914Updated last week
- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.☆4,675Updated 3 months ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,930Updated 5 months ago
- ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.☆9,495Updated last month
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,560Updated 7 months ago
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆8,045Updated 10 months ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆8,464Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervision☆83,419Updated last month
- Powerful Free DeepL API, No Token Required☆7,731Updated last month