iangitonga / capgenxLinks
A minimal GUI application that generates transcriptions for audio and videos using Whisper neural network.
☆17Updated 2 years ago
Alternatives and similar repositories for capgenx
Users that are interested in capgenx are comparing it to the libraries listed below
Sorting:
- Golang web client for Ollama, fast and easy to use.☆29Updated 3 months ago
- 基于PYTTSX的文本转语音工具☆10Updated 2 years ago
- Simple agent framework using Ollama tool calling☆10Updated last year
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 9 months ago
- Context-aware LLM Translator (CALT)☆43Updated 10 months ago
- ez audio transcription tool with flexible processing and post-processing options☆159Updated last year
- An extension to use Kokoro TTS in text generation webui☆22Updated 6 months ago
- Codai is an AI programming tool that boosts coding efficiency and empowers non-programmers. Its future plans include introducing a local …☆24Updated 2 months ago
- GitHub Linker Extension which show repo related info, like blog, video, similar project etc.☆22Updated 8 months ago
- web based editor for subtitles and transcripts☆141Updated last year
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Updated 5 months ago
- 一个开源的,现代设计的LLMS/人工智能聊天框架。支持多人工智能供应商(OpenAI/Claude 3/Gemini/Ollama/Bedrock/Azure/Mistral/Conspirity),多模态(Vision/TTS)和插件系统。一键免费部署您的私人ChatGP…☆23Updated this week
- A bot that checks your grammar and phrasing using LLM of choice☆32Updated 9 months ago
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.☆62Updated last month
- Chat with your pdf using your local LLM, OLLAMA client.☆40Updated last year
- a cross-platform and customizable vlc video player that can generate subtitles using WhisperX model☆14Updated 2 years ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 10 months ago
- JotItNow is a AI Voice Notes App☆21Updated 8 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆13Updated last year
- 100% private AI transcription with an intuitive template system for maximum flexibility☆70Updated 3 months ago
- 基于Dolphin模型的东方语言音视频转字幕api及webui☆19Updated 7 months ago
- The GUI for "paddlepaddle" OCR☆47Updated 2 weeks ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆15Updated 9 months ago
- RSS/Atom feed reader for your desktop.☆41Updated last week
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆58Updated 11 months ago
- ☆14Updated 7 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆16Updated 5 months ago
- 模拟浏览器脚本操作,使用nodejs来批量读取和操作网盘文件信息。 这个代码库是`百度网盘批量清理重复文件计划`的一部 分。☆11Updated 2 years ago
- ☆21Updated 7 months ago
- A real time offline transcriber with gui, based on OpenAI whisper☆16Updated last year