Const-me / Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
☆8,126Updated last month
Related projects: ⓘ
- Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.☆11,953Updated this week
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音☆9,955Updated this week
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆32,567Updated this week
- GUI for a Vocal Remover that uses Deep Neural Networks.☆17,474Updated 3 months ago
- SoftVC VITS Singing Voice Conversion☆25,358Updated 10 months ago
- 🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。☆5,714Updated 9 months ago
- Easily train a good VC model with voice data <= 10 mins!☆22,944Updated 2 weeks ago
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆20,787Updated this week
- A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large …☆5,055Updated this week
- Faster Whisper transcription with CTranslate2☆11,378Updated 3 weeks ago
- 2^x Image Super-Resolution☆5,580Updated this week
- Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powere…☆18,792Updated 2 weeks ago
- vits2 backbone with multilingual-bert☆7,793Updated this week
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆11,637Updated 2 months ago
- A generative speech model for daily dialogue.☆30,703Updated 2 weeks ago
- Integrating ChatGPT into your browser deeply, everything you need is here☆9,879Updated last month
- Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术☆10,166Updated last month
- 基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.☆23,604Updated last week
- so-vits-svc fork with realtime support, improved interface and more features.☆8,674Updated this week
- Powerful Free DeepL API, No Token Required☆6,022Updated this week
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆5,259Updated 2 months ago
- 用文本编辑器剪视频☆6,572Updated 5 months ago
- 视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos a…☆5,696Updated last month
- All-in-one chatbot client☆9,938Updated 2 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆6,675Updated 9 months ago
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆4,692Updated 2 months ago
- 🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.☆9,832Updated this week
- [NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆14,998Updated last month
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆7,202Updated 3 weeks ago
- faster_whisper GUI with PySide6☆1,386Updated this week