dohyeondk / sub-toolsLinks
A robust Python toolkit for converting video/audio content into accurate, multilingual subtitles using WhisperX for transcription and Google's Gemini API for proofreading and translation.
☆26Updated 2 months ago
Alternatives and similar repositories for sub-tools
Users that are interested in sub-tools are comparing it to the libraries listed below
Sorting:
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated last year
- Golang web client for Ollama, fast and easy to use.☆31Updated 6 months ago
- ☆23Updated 3 months ago
- A multi engine TTS & LLM edge computing playground with audio book features and more!☆39Updated last week
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25Updated 9 months ago
- 100% private AI transcription with an intuitive template system for maximum flexibility☆71Updated 6 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- A modular, privacy-minded translation extension for browsers☆27Updated 5 months ago
- ☆20Updated last year
- ☆45Updated 5 months ago
- Monika is an AI assistant that combines speech-to-text, natural language processing, and text-to-speech capabilities for seamless interac…☆25Updated 10 months ago
- cut pdf into pieces☆29Updated last month
- HanaVerse is a interactive web UI for chatting with ollama with a lively 2D anime character Hana. Star it on GitHub!☆55Updated 8 months ago
- 一个精选的Veo3 AI视频生成提示词集合,包含各种创意场景和风格的视频提示词。Awesome Veo3 Prompts - A collection of 31 creative video generation prompts for Veo3 AI, featurin…☆48Updated 6 months ago
- I’m trying to create something similar to Grammarly. Hail to open source!☆15Updated 8 months ago
- Create 3D files in the CLI with Small Language Model☆43Updated 3 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated last year
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17Updated 8 months ago
- A zero-dependency prompt manager/catalog/library in a single HTML file. Everything is stored locally in your browser. Meow. 😼☆62Updated 5 months ago
- 🍌Banana Postor|香蕉打印店:开源 信息/海报生成器,自动总结和配图,多种构图、配色可选,中英双语☆24Updated 4 months ago
- A minimal GUI application that generates transcriptions for audio and videos using Whisper neural network.☆16Updated 2 years ago
- ☆29Updated last year
- An OpenVoice-based voice cloning tool, single executable file (~14M), supporting multiple formats without dependencies on ffmpeg, Python,…☆44Updated 3 weeks ago
- converts EPUBs to MP3s/M4Bs using kokoro tts☆45Updated last month
- A bot that checks your grammar and phrasing using LLM of choice☆32Updated last year
- This is a demo application showing how a dynamic video can be previewed in the browser using the Creatomate Preview SDK.☆17Updated last year
- OpenAI API and Whisper based Video Translation☆74Updated last year
- Empower Your Productivity with Local AI Assistants☆39Updated 3 months ago
- AI tool for auto-research, TTS, and Graphical assembly into a completed Podcast☆84Updated 6 months ago
- ☆33Updated 7 months ago