chidiwilliams / buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
☆14,007Updated this week
Alternatives and similar repositories for buzz:
Users that are interested in buzz are comparing it to the libraries listed below
- High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model☆9,086Updated 7 months ago
- Faster Whisper transcription with CTranslate2☆14,975Updated last week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆14,656Updated this week
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。☆12,307Updated this week
- 🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用☆36,925Updated this week
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆7,755Updated last month
- Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powere…☆20,716Updated last week
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆38,868Updated 7 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆78,799Updated 2 months ago
- 基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.☆24,282Updated 4 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.☆4,572Updated 11 months ago
- SOTA Open Source TTS☆20,268Updated last week
- faster_whisper GUI with PySide6☆2,221Updated 3 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆43,162Updated this week
- Keyviz is a free and open-source tool to visualize your keystrokes ⌨️ and 🖱️ mouse actions in real-time.☆7,157Updated 3 months ago
- 🔊 Text-Prompted Generative Audio Model☆37,279Updated 7 months ago
- Powerful Free DeepL API, No Token Required☆7,391Updated this week
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆8,347Updated 3 months ago
- Convert AI papers to GUI,Make it easy and convenient for everyone to use artificial intelligence technology。让每个人都简单方便的使用前沿人工智能技术☆10,497Updated 6 months ago
- A generative speech model for daily dialogue.☆35,409Updated 2 weeks ago
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆7,769Updated 7 months ago
- Port of OpenAI's Whisper model in C/C++☆38,708Updated this week
- 🔮 ChatGPT Desktop Application (Mac, Windows and Linux)☆53,648Updated 6 months ago
- OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。☆31,299Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translation☆11,434Updated 4 months ago
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆8,856Updated 3 weeks ago
- Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式☆3,181Updated 3 months ago
- 🚀 Power Your World with AI - Explore, Extend, Empower.☆7,298Updated last week
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆33,694Updated last week
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆12,031Updated this week