themanyone / caption_anythingLinks
Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation again.
☆20Updated last month
Alternatives and similar repositories for caption_anything
Users that are interested in caption_anything are comparing it to the libraries listed below
Sorting:
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆267Updated 3 weeks ago
- streaming speech to text server using Whisper☆95Updated 2 years ago
- web based editor for subtitles and transcripts☆142Updated last year
- Whisper from OpenAi and diarization with Pyannote☆48Updated last year
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆24Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆37Updated 9 months ago
- On-device noise suppression powered by deep learning☆74Updated 2 months ago
- A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆65Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 10 months ago
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated last year
- Speaker diarization service☆24Updated 3 months ago
- A lightweight Python library for running TTS models with a unified API.☆20Updated 7 months ago
- Coqui AI TTS plugin☆87Updated 3 months ago
- A simple, accessible and offline real-time transcription app for Android.☆12Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆152Updated 3 weeks ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆217Updated last month
- ☆35Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆159Updated last year
- Whatsapp Web Speech To Text☆54Updated 2 years ago
- Transcription with speaker diarization pipeline☆94Updated 2 years ago
- A testing repo to share code and thoughts on diarisation☆56Updated last year
- ☆28Updated last month
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated 11 months ago
- whisper.cpp bindings for python☆106Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆68Updated 2 months ago
- ☆156Updated last year