badgids / transcription-app
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
☆20Updated last year
Alternatives and similar repositories for transcription-app:
Users that are interested in transcription-app are comparing it to the libraries listed below
- Okra, your all in one personal AI assistant☆14Updated 9 months ago
- Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-tex…☆56Updated 2 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- Seamless Voice Interactions with LLMs☆12Updated last year
- Using LLMs and rules for a local personal agent☆17Updated 2 months ago
- time based thinking and structure like OpenAI's o1 preview.☆10Updated 6 months ago
- AI Search engine☆12Updated last month
- ☆17Updated 3 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 9 months ago
- ☆46Updated 4 months ago
- AURORA (Artificial Unified Responsive Optimized Reasoning Agent) uses lobes and web research for RAG based memory and learning.☆16Updated 4 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆91Updated 9 months ago
- Discord chatbot interface to train an LLM on user message history☆27Updated last year
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆29Updated 6 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 7 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆27Updated 6 months ago
- ☆13Updated last month
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆28Updated 3 weeks ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 9 months ago
- ☆37Updated last year
- LLM backed Fantasy Tribe Game☆18Updated 4 months ago
- An AI Discord bot that connects to a koboldcpp instance by API calls. Have a more intelligent Clyde Bot of your own making!☆37Updated 5 months ago
- Automated LLM novelist☆44Updated 11 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆21Updated this week
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆21Updated last month
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆60Updated this week
- ☆28Updated 5 months ago
- ☆14Updated last year
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆25Updated 2 months ago
- Agentic RAG to help you build a startup🚀☆16Updated 3 weeks ago