JSchmie / ScrAIbeLinks
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
☆55Updated 8 months ago
Alternatives and similar repositories for ScrAIbe
Users that are interested in ScrAIbe are comparing it to the libraries listed below
Sorting:
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆234Updated last month
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆85Updated 7 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆118Updated 5 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆150Updated 2 weeks ago
- API server for Instant voice cloning by MyShell.☆103Updated 11 months ago
- Simulates talk with an AI that can express emotions☆78Updated 3 months ago
- FastAPI service on top of WhisperX☆129Updated last week
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆118Updated last week
- Streaming and Fine-tuning for Chatterbox TTS☆182Updated 3 months ago
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆63Updated last week
- WebUI for ScAIbe☆45Updated 3 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆237Updated 5 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 11 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆405Updated last month
- Efficient approach to speaker diarization using voice characteristics extraction☆100Updated 3 months ago
- ☆82Updated 6 months ago
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆199Updated 2 weeks ago
- Self-hosted AI medical scribe.☆50Updated last week
- Run Orpheus 3B Locally With LM Studio☆31Updated 6 months ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆52Updated 6 months ago
- ☆99Updated last year
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆37Updated last week
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆323Updated 3 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 9 months ago
- 🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs ✨☆93Updated 2 months ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆62Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆158Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆157Updated last year
- Examples of using the llasa-tts models locally☆180Updated 5 months ago
- Explore, Install, Innovate — in 1 Click.☆83Updated this week