JSchmie / ScrAIbeLinks
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
☆54Updated 7 months ago
Alternatives and similar repositories for ScrAIbe
Users that are interested in ScrAIbe are comparing it to the libraries listed below
Sorting:
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆226Updated 2 weeks ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆146Updated 2 weeks ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆83Updated 6 months ago
- Simulates talk with an AI that can express emotions☆78Updated 2 months ago
- WebUI for ScAIbe☆45Updated 3 months ago
- API server for Instant voice cloning by MyShell.☆99Updated 11 months ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆344Updated last week
- ☆99Updated last year
- Streaming and Fine-tuning for Chatterbox TTS☆164Updated 2 months ago
- ez audio transcription tool with flexible processing and post-processing options☆158Updated last year
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆58Updated 3 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆56Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆99Updated 2 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆281Updated last month
- EPUB, PDF, DOCX, MD, and TXT file text to speech document reader. Read documents in realtime with high-quality TTS; or extract audiobooks…☆189Updated last month
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆369Updated last year
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆390Updated 3 weeks ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆114Updated 3 weeks ago
- FastAPI service on top of WhisperX☆123Updated this week
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆111Updated 5 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 10 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 8 months ago
- web based editor for subtitles and transcripts☆140Updated last year
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆262Updated 2 months ago
- A simple GUI to use Whisper.☆210Updated last month
- Self-hosted AI medical scribe.☆49Updated last week
- An API to transcribe audio with OpenAI's Whisper Large v3!☆298Updated 9 months ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆317Updated 2 months ago
- Run Orpheus 3B Locally With LM Studio☆31Updated 5 months ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆37Updated last month