themanyone / caption_anythingLinks
Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation again.
☆23Updated 5 months ago
Alternatives and similar repositories for caption_anything
Users that are interested in caption_anything are comparing it to the libraries listed below
Sorting:
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆287Updated last month
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆68Updated 2 years ago
- streaming speech to text server using Whisper☆101Updated 2 years ago
- Record audio or transcribe files using ctranslate2 and whisper!☆170Updated this week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆121Updated 2 years ago
- ☆100Updated last year
- Speaker diarization service☆26Updated last week
- Live-Transcription (STT) with Whisper PoC☆202Updated last year
- IRIS: Demonstrator for use of LLMs in python (outdated)☆63Updated 10 months ago
- A framework for creating voice based agents. Integrations LLMs with speech recognition and text-to-speech☆34Updated last year
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆72Updated last year
- web based editor for subtitles and transcripts☆143Updated last year
- Offline voice input panel & keyboard with punctuation for Android.☆108Updated last year
- A curated list of awesome OpenAI's Whisper☆102Updated 2 years ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆216Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- Simulates talk with an AI that can express emotions☆82Updated 7 months ago
- Multi speaker audio transcription☆44Updated 3 years ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.☆355Updated 6 months ago
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆130Updated 2 years ago
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆24Updated 2 years ago
- A simple, accessible and offline real-time transcription app for Android.☆14Updated last year
- Coqui AI TTS plugin☆85Updated 7 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- WIP exploration using Twilio Media Streams and Generative AI☆40Updated 2 years ago
- Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for speech-to-text and offers a privacy-focused, acce…☆134Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆187Updated 2 years ago
- WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.☆13Updated last year