chidiwilliams / buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
☆13,231Updated this week
Alternatives and similar repositories for buzz:
Users that are interested in buzz are comparing it to the libraries listed below
- High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model☆8,759Updated 5 months ago
- Faster Whisper transcription with CTranslate2☆13,490Updated 2 weeks ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆38,875Updated 2 weeks ago
- All-in-one chatbot client☆10,103Updated 3 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆74,399Updated last week
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。☆11,522Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆13,382Updated this week
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,536Updated 4 months ago
- User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)☆24,786Updated this week
- A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large …☆5,492Updated last month
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,042Updated last year
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆6,814Updated 3 weeks ago
- 🤱🏻 Turn any webpage into a desktop app with Rust. 🤱🏻 利用 Rust 轻松构建轻量级多端桌面应用☆34,163Updated this week
- Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powere…☆20,198Updated last month
- 基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.☆24,095Updated 2 months ago
- Port of OpenAI's Whisper model in C/C++☆36,923Updated this week
- Desktop application of new Bing's AI-powered chat (Windows, macOS and Linux)☆9,226Updated 11 months ago
- AI agent stdlib that works with any LLM and TypeScript AI SDK.☆16,782Updated last month
- faster_whisper GUI with PySide6☆1,930Updated last month
- BibiGPT v1 · one-Click AI Summary for Audio/Video & Chat with Learning Content: Bilibili | YouTube | Tweet丨TikTok丨Dropbox丨Google Drive丨Lo…☆5,395Updated 11 months ago
- [NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆16,273Updated 3 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆30,505Updated last week
- SoftVC VITS Singing Voice Conversion☆26,318Updated last year
- Integrating ChatGPT into your browser deeply, everything you need is here☆10,237Updated last month
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆7,876Updated last month
- 视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos a…☆6,453Updated 3 weeks ago
- A generative speech model for daily dialogue.☆33,664Updated this week
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆7,594Updated 5 months ago
- GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.☆15,343Updated last month
- 🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.☆11,151Updated 2 weeks ago