JuergenFleiss / aTrain
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.
☆430Updated this week
Alternatives and similar repositories for aTrain:
Users that are interested in aTrain are comparing it to the libraries listed below
- Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)☆638Updated this week
- ez audio transcription tool with flexible processing and post-processing options☆142Updated last year
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆974Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆577Updated last month
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆335Updated 8 months ago
- A simple GUI to use Whisper.☆131Updated 6 months ago
- Open source inference code for Rev's model☆372Updated 3 weeks ago
- Modern GUI application that transcribes and translate audio files using OpenAI Whisper.☆138Updated 6 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆185Updated this week
- ☆558Updated 9 months ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆219Updated 2 weeks ago
- ☆1,349Updated this week
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆1,625Updated 2 weeks ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆304Updated 3 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆86Updated this week
- Live-Transcription (STT) with Whisper PoC☆173Updated 7 months ago
- turnkey self-hosted offline transcription and diarization service with llm summary☆805Updated 4 months ago
- A python package to build AI-powered real-time audio applications☆1,181Updated this week
- A nearly-live implementation of OpenAI's Whisper.☆2,423Updated last week
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆801Updated 4 months ago
- open source audio and video transcription software☆344Updated last week
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆42Updated 3 weeks ago
- Local SRT/LLM/TTS Voicechat☆613Updated 4 months ago
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆142Updated last year
- Whisper with Medusa heads☆822Updated this week
- web based editor for subtitles and transcripts☆118Updated 6 months ago
- A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and …☆188Updated 4 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆4,129Updated last month
- Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.☆197Updated 3 months ago
- FastAPI service on top of WhisperX☆67Updated 2 weeks ago