dohyeondk / sub-toolsLinks
A robust Python toolkit for converting video/audio content into accurate, multilingual subtitles using WhisperX for transcription and Google's Gemini API for proofreading and translation.
☆25Updated last month
Alternatives and similar repositories for sub-tools
Users that are interested in sub-tools are comparing it to the libraries listed below
Sorting:
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 11 months ago
- Golang web client for Ollama, fast and easy to use.☆31Updated 6 months ago
- PasteLog is a simple, fast, and powerful pastebin. It allows you to publish Rich Text logs/Notes, and access them with a unique link.☆19Updated 2 months ago
- Empower Your Productivity with Local AI Assistants☆39Updated 3 months ago
- ☆23Updated 2 months ago
- 100% private AI transcription with an intuitive template system for maximum flexibility☆71Updated 6 months ago
- 一个精选的Veo3 AI视频生成提示词集合,包含各种创意场景和风格的视频提示词。Awesome Veo3 Prompts - A collection of 31 creative video generation prompts for Veo3 AI, featurin…☆41Updated 5 months ago
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.☆65Updated 3 months ago
- A bot that checks your grammar and phrasing using LLM of choice☆32Updated 11 months ago
- A minimal Android demo app for Kokoro-TTS☆40Updated 11 months ago
- converts EPUBs to MP3s/M4Bs using kokoro tts☆42Updated 3 weeks ago
- web based editor for subtitles and transcripts☆143Updated last year
- Extract2MD is a powerful and versatile AI-enabled client-side JavaScript library for extracting text from PDF files and converting it int…☆101Updated 8 months ago
- A modular, privacy-minded translation extension for browsers☆27Updated 5 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- An OpenVoice-based voice cloning tool, single executable file (~14M), supporting multiple formats without dependencies on ffmpeg, Python,…☆42Updated last week
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17Updated 8 months ago
- 🎙️ Fast CLI tool to transcribe audio/video files to SRT format using OpenAI Whisper API☆20Updated last month
- This is a demo application showing how a dynamic video can be previewed in the browser using the Creatomate Preview SDK.☆16Updated last year
- Extract any sound with text prompts. Memory-optimized SAM-Audio with modern UI.☆266Updated 3 weeks ago
- Create an AI generated video slideshow from an audiobook. Audio Book Slides☆37Updated 8 months ago
- Batch convert video to text using openai's whisper or the local coreML via whisper.cpp on your MacBook☆77Updated 5 months ago
- Chooat is an open-source project designed to provide a seamless and powerful AI chat experience.☆22Updated last year
- kokoro text to speech using javascript☆63Updated 11 months ago
- ez audio transcription tool with flexible processing and post-processing options☆161Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆48Updated last year
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25Updated 8 months ago
- ☆19Updated 2 months ago
- A multi engine TTS & LLM edge computing playground with audio book features and more!☆36Updated last week
- 🗣️🔊 Your Text-to-Speech Services, All-in-One.☆68Updated 3 weeks ago