JSchmie / ScrAIbeLinks
Tool for automatic transcription and speaker diarization based on whisper and pyannote.
☆63Updated last year
Alternatives and similar repositories for ScrAIbe
Users that are interested in ScrAIbe are comparing it to the libraries listed below
Sorting:
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆250Updated 5 months ago
- WebUI for ScAIbe☆50Updated 8 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- API server for Instant voice cloning by MyShell.☆107Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆159Updated 3 weeks ago
- Record audio or transcribe files using ctranslate2 and whisper!☆170Updated this week
- FastAPI service on top of WhisperX☆170Updated this week
- ☆100Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆162Updated 2 years ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆416Updated last week
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆90Updated last year
- 💬 Fast, cross-platform CLI and GUI for batch transcription, translation, speaker annotation and subtitle generation using OpenAI’s Whisp…☆86Updated 3 weeks ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- Cog implementation of transcribing + diarization pipeline with Whisper & Pyannote☆232Updated 11 months ago
- Simulates talk with an AI that can express emotions☆82Updated 7 months ago
- web based editor for subtitles and transcripts☆143Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- streaming speech to text server using Whisper☆101Updated 2 years ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆128Updated 5 months ago
- Run different pipelines of WhisperX - Transcription, Diarization, VAD, Alignment completely OFFLINE.☆45Updated 10 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!☆110Updated 2 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆267Updated 7 months ago
- Examples of using the llasa-tts models locally☆182Updated 9 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆285Updated 9 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆52Updated 10 months ago
- Transcription and diarization (speaker identification)☆34Updated 2 years ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆540Updated last year