Haschtl / transcripy
Multi speaker audio transcription
☆29Updated 2 years ago
Alternatives and similar repositories for transcripy:
Users that are interested in transcripy are comparing it to the libraries listed below
- An open-source, browser-based transcript viewer and manager. Upload, transcribe, and chat with meeting recordings using AI. Features meet…☆29Updated last week
- Local & private voice controlled notepad using whisper.cpp☆22Updated last year
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app☆42Updated 11 months ago
- A voice to text keyboard based on OpenAI Whisper Model.☆50Updated last year
- llmon-py is a multimodal webui for Llama 3-8B.☆16Updated 6 months ago
- Caption, translate, and optionally record in real time "what you hear" from speakers and microphone. Never miss part of the conversation …☆16Updated 10 months ago
- 🔊😊 A fastapi voice-assistant framework to quickly prototype LLM-powered voice assistants in <5 minutes.☆27Updated last year
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆37Updated 2 weeks ago
- demo app: how to use LLM as a general purpose classifier☆92Updated last year
- A VoiceAsistant with WhisperAI speech recognition☆29Updated 2 months ago
- agi from function calls, if you want in vscode☆18Updated last year
- Self-hosted AI voice agent☆80Updated 5 months ago
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... Fast!!☆30Updated 2 weeks ago
- Second attempt at AI webcam, this time with OpenAI API☆38Updated last year
- Real time audio to audio translation over sockets. With virtual microphones, you can use this in any video conferencing software you'd li…☆26Updated 5 months ago
- Imagine translating your speech or anybody's speech to any language you want within minutes. check this out...☆39Updated 5 months ago
- Prompt Development Environment for GPT☆13Updated last year
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆189Updated 3 months ago
- A langchain app to visualise a debate using Tree-of-Thought reasoning☆58Updated 11 months ago
- Foundational Models for State-of-the-Art Speech and Text Translation☆11Updated last year
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆65Updated 3 weeks ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆43Updated 2 weeks ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆83Updated 2 weeks ago
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆46Updated last year
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆108Updated last year
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆28Updated last year
- Porting BabyAGI to Oobabooba.☆33Updated last year
- Speaker diarization service☆20Updated last month
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆53Updated last week