themanyone / caption_anythingLinks
Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation again.
☆23Updated 3 months ago
Alternatives and similar repositories for caption_anything
Users that are interested in caption_anything are comparing it to the libraries listed below
Sorting:
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆278Updated 2 weeks ago
- Coqui AI TTS plugin☆85Updated 5 months ago
- streaming speech to text server using Whisper☆98Updated 2 years ago
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆127Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆115Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆245Updated 4 months ago
- web based editor for subtitles and transcripts☆142Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Transcription with speaker diarization pipeline☆97Updated 2 years ago
- A simple, accessible and offline real-time transcription app for Android.☆13Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆72Updated 5 months ago
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆159Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆162Updated last week
- Offline voice input panel & keyboard with punctuation for Android.☆108Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- ☆77Updated 2 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- whisper.cpp bindings for python☆108Updated 2 years ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated 4 months ago
- ☆38Updated 2 years ago
- ☆33Updated last month
- Pybind11 bindings for Whisper.cpp☆62Updated 3 weeks ago
- IRIS: Demonstrator for use of LLMs in python (outdated)☆63Updated 9 months ago
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆62Updated 7 months ago
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆72Updated last week
- OpenAI-Assistant API integration with Speech Recognition and Eleven Labs TTS. User can choose name, description, model of assistant and …☆18Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year