themanyone / caption_anything
Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation again.
☆16Updated 11 months ago
Alternatives and similar repositories for caption_anything:
Users that are interested in caption_anything are comparing it to the libraries listed below
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 7 months ago
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆43Updated last week
- Okra, your all in one personal AI assistant☆14Updated 8 months ago
- Speaker diarization service☆21Updated last month
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆49Updated 2 months ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆44Updated last month
- IRIS: Intelligent Residential Integration System - a mind for your home!☆61Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆31Updated 7 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆21Updated this week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆40Updated 2 months ago
- Prompt Jinja2 templates for LLMs☆29Updated last month
- ☆16Updated 2 months ago
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆197Updated 3 months ago
- OpenAI-Assistant API integration with Speech Recognition and Eleven Labs TTS. User can choose name, description, model of assistant and …☆18Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆34Updated this week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 5 months ago
- Local & private voice controlled notepad using whisper.cpp☆23Updated last year
- Text generation in Python, as easy as possible☆54Updated 2 weeks ago
- ☆16Updated last year
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆19Updated last year
- AI Voice Assistant: Talk to an AI agent that helps you with event scheduling, contact management, accessing your knowledge base, and web …☆29Updated 2 months ago
- WIP exploration using Twilio Media Streams and Generative AI☆39Updated last year
- a transcription application that listens to audio input from the microphone using OpenAI's Whisper, transcribes it into text, and simulat…☆18Updated 11 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆21Updated last year
- Locally running LLM with internet access☆93Updated 4 months ago
- A simple speech-to-text and text-to-speech AI chatbot that can be run fully offline.☆44Updated last year
- 🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.☆27Updated last year
- A VoiceAsistant with WhisperAI speech recognition☆29Updated 2 months ago