JSchmie / ScrAIbeLinks
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
☆62Updated 10 months ago
Alternatives and similar repositories for ScrAIbe
Users that are interested in ScrAIbe are comparing it to the libraries listed below
Sorting:
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆244Updated 3 months ago
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆71Updated last week
- WebUI for ScAIbe☆49Updated 6 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆105Updated 5 months ago
- ☆100Updated last year
- Simulates talk with an AI that can express emotions☆82Updated 5 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated last year
- API server for Instant voice cloning by MyShell.☆106Updated last year
- FastAPI service on top of WhisperX☆156Updated last week
- ez audio transcription tool with flexible processing and post-processing options☆160Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆141Updated 8 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆161Updated 2 weeks ago
- Transcribe audio and video files with speaker diarization and logically grouped timestamps using Gemini Flash☆42Updated last week
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆230Updated 9 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆89Updated 10 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆57Updated 2 weeks ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆218Updated last year
- Streaming and Fine-tuning for Chatterbox TTS☆229Updated 5 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated last year
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆52Updated 9 months ago
- Examples of using the llasa-tts models locally☆181Updated 7 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- streaming speech to text server using Whisper☆98Updated 2 years ago
- web based editor for subtitles and transcripts☆141Updated last year
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆383Updated last year
- ☆71Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆99Updated last year
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆125Updated 3 months ago