badgids / transcription-app
a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulates typing the transcription in real-time wherever your cursor is on the screen. It can also do realtime translation.
☆20Updated last year
Alternatives and similar repositories for transcription-app:
Users that are interested in transcription-app are comparing it to the libraries listed below
- Desktop application for Linux and Windows that utilizes distil-whisper models from HuggingFace, to enable real-time offline speech-to-tex…☆57Updated 2 months ago
- An AI Discord bot that connects to a koboldcpp instance by API calls. Have a more intelligent Clyde Bot of your own making!☆37Updated 5 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆27Updated 6 months ago
- AI Search engine☆12Updated last month
- time based thinking and structure like OpenAI's o1 preview.☆10Updated 6 months ago
- Using LLMs and rules for a local personal agent☆17Updated 2 months ago
- Okra, your all in one personal AI assistant☆14Updated 9 months ago
- ☆46Updated 4 months ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 9 months ago
- LLM Chat is an open-source serverless alternative to ChatGPT.☆33Updated 6 months ago
- Create storybooks using CrewAI, Groq, and Ollama☆20Updated last year
- AURORA (Artificial Unified Responsive Optimized Reasoning Agent) uses lobes and web research for RAG based memory and learning.☆16Updated 4 months ago
- ☆17Updated 3 months ago
- ☆41Updated 11 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆32Updated 8 months ago
- Local & private voice controlled notepad using whisper.cpp☆24Updated last year
- Seamless Voice Interactions with LLMs☆12Updated last year
- ☆12Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆46Updated 2 months ago
- Modern AI chatbot supporting multiple LLMs. Switch between Gemini, Mistral, Llama, Claude and ChatGPT.☆54Updated 3 weeks ago
- Simple LLM interface based on terminal.☆11Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 5 months ago
- Streamlit Web UI for AGiXT☆26Updated last month
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆21Updated last month
- Anthropic Computer Use with Modal Sandboxes☆31Updated 5 months ago
- 🤖 AI driven interactive and autonomous i18n extractor and translator for projects with i18n internationalization modules 🌍☆23Updated 10 months ago
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆36Updated this week
- ☆37Updated last year