Const-me / Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
☆8,479Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Whisper
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆12,570Updated this week
- faster_whisper GUI with PySide6☆1,648Updated 2 months ago
- Faster Whisper transcription with CTranslate2☆12,540Updated this week
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。☆10,766Updated this week
- 🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.☆10,528Updated this week
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆6,876Updated 11 months ago
- Port of OpenAI's Whisper model in C/C++☆35,738Updated this week
- 基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.☆23,919Updated this week
- ↔️ Translate subtitle using ChatGPT☆1,606Updated 7 months ago
- vits2 backbone with multilingual-bert☆8,011Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆71,523Updated last week
- Whisper based Japanese subtitle generator☆1,596Updated 3 weeks ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆11,992Updated 4 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆12,524Updated 3 months ago
- SoftVC VITS Singing Voice Conversion☆25,916Updated last year
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆6,334Updated this week
- 视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos a…☆6,135Updated 3 weeks ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,753Updated 4 months ago
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆7,005Updated this week
- Batch speech to text using OpenAI's whisper.☆262Updated this week
- 🔊 Text-Prompted Generative Audio Model☆36,141Updated 3 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,444Updated 7 months ago
- Desktop application of new Bing's AI-powered chat (Windows, macOS and Linux)☆9,254Updated 9 months ago
- Powerful Free DeepL API, No Token Required☆6,654Updated 2 weeks ago
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆11,165Updated this week
- CapsWriter 的离线版,一个好用的 PC 端的语音输入工具☆2,962Updated 4 months ago
- A generative speech model for daily dialogue.☆32,442Updated 2 weeks ago
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆27,368Updated last month
- A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large …☆5,299Updated last month
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆35,856Updated 2 weeks ago