iangitonga / capgenxLinks
A minimal GUI application that generates transcriptions for audio and videos using Whisper neural network.
☆16Updated 2 years ago
Alternatives and similar repositories for capgenx
Users that are interested in capgenx are comparing it to the libraries listed below
Sorting:
- Golang web client for Ollama, fast and easy to use.☆31Updated 6 months ago
- Simple agent framework using Ollama tool calling☆10Updated last year
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated last year
- A robust Python toolkit for converting video/audio content into accurate, multilingual subtitles using WhisperX for transcription and Goo…☆26Updated 2 months ago
- An OpenVoice-based voice cloning tool, single executable file (~14M), supporting multiple formats without dependencies on ffmpeg, Python,…☆44Updated 3 weeks ago
- ☆23Updated 3 months ago
- Chooat is an open-source project designed to provide a seamless and powerful AI chat experience.☆21Updated last year
- cursor.so api☆18Updated 2 years ago
- Context-aware LLM Translator (CALT)☆48Updated last year
- 🛡️ AI-powered system security auditor. A cross-platform TUI/CLI tool to analyze processes, network, and packages on Linux & Windows usin…☆40Updated 2 months ago
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Updated 3 weeks ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆28Updated 9 months ago
- Spec-driven thinking, nano-sized docs. Lightweight task specification for AI-assisted development.☆34Updated 2 weeks ago
- An interface for llama.cpp, ChatGPT, Gemini, and Claude☆27Updated last week
- Easy to use and open-source unknown stealer☆22Updated 2 years ago
- JotItNow is a AI Voice Notes App☆24Updated 11 months ago
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆31Updated last year
- Snag web pages like a polite robot with a browser☆25Updated this week
- Yet Another (LLM) Web UI, made with Gemini☆12Updated last year
- Query GPT in any input area.☆48Updated last year
- Web UI for working with large language models☆38Updated last year
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆14Updated last year
- A bot that checks your grammar and phrasing using LLM of choice☆32Updated last year
- GitHub Linker Extension which show repo related info, like blog, video, similar project etc.☆21Updated 11 months ago
- An extension to use Kokoro TTS in text generation webui☆21Updated 9 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17Updated 8 months ago
- Web Interface for Vision Language Models Including InternVLM2☆25Updated last year
- 基于Dolphin模型的东方语言音视频转字幕api及webui☆19Updated 10 months ago
- Mic-controlled mouse clicks☆17Updated 4 months ago
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆36Updated last year