chidiwilliams / buzzLinks
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
☆14,629Updated last week
Alternatives and similar repositories for buzz
Users that are interested in buzz are comparing it to the libraries listed below
Sorting:
- High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model☆9,409Updated 10 months ago
- Faster Whisper transcription with CTranslate2☆16,585Updated 2 weeks ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆16,311Updated last week
- Robust Speech Recognition via Large-Scale Weak Supervision☆83,419Updated last month
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text fronten…☆12,002Updated last week
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。☆13,080Updated this week
- Easily train a good VC model with voice data <= 10 mins!☆30,118Updated 6 months ago
- 🔊 Text-Prompted Generative Audio Model☆38,031Updated 10 months ago
- Port of OpenAI's Whisper model in C/C++☆40,830Updated this week
- ☆34,443Updated last year
- High-Resolution Image Synthesis with Latent Diffusion Models☆41,141Updated 8 months ago
- A generative speech model for daily dialogue.☆36,799Updated 3 weeks ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,602Updated last year
- Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powere…☆21,547Updated last month
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆13,233Updated last month
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,685Updated 9 months ago
- 🎤 微软语音合成工具,使用 Electron + Vue + ElementPlus + Vite 构建。☆6,004Updated 2 months ago
- 用文本编辑器剪视频☆7,257Updated 8 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆14,672Updated last week
- 视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos a…☆7,417Updated last month
- A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity…☆11,011Updated 3 weeks ago
- All-in-one chatbot client☆10,332Updated 3 months ago
- faster_whisper GUI with PySide6☆2,480Updated 6 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,496Updated last year
- GUI for a Vocal Remover that uses Deep Neural Networks.☆20,962Updated 3 months ago
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆35,368Updated this week
- SOTA Open Source TTS☆21,914Updated last week
- 基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.☆24,479Updated 7 months ago
- Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.☆31,335Updated 10 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,560Updated 7 months ago