themanyone / caption_anythingLinks
Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation again.
☆21Updated 2 months ago
Alternatives and similar repositories for caption_anything
Users that are interested in caption_anything are comparing it to the libraries listed below
Sorting:
- streaming speech to text server using Whisper☆98Updated 2 years ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆276Updated 2 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆159Updated 2 months ago
- web based editor for subtitles and transcripts☆141Updated last year
- Whisper from OpenAi and diarization with Pyannote☆50Updated last year
- Simulates talk with an AI that can express emotions☆82Updated 5 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆73Updated 4 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆24Updated last year
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆115Updated last year
- A lightweight Python library for running TTS models with a unified API.☆21Updated 9 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆21Updated last year
- Speaker diarization service☆24Updated 5 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Whatsapp Web Speech To Text☆54Updated 2 years ago
- A voice to text keyboard based on OpenAI Whisper Model.☆49Updated 2 years ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated last year
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆126Updated last year
- Coqui AI TTS plugin☆87Updated 4 months ago
- whisper.cpp bindings for python☆107Updated 2 years ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 11 months ago
- Offline voice input panel & keyboard with punctuation for Android.☆108Updated last year
- IRIS: Demonstrator for use of LLMs in python (outdated)☆63Updated 8 months ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆61Updated 6 months ago
- ☆32Updated last week
- faster-whisper as serverless endpoint☆125Updated last week
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year