JuergenFleiss / aTrain
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.
☆483Updated last week
Alternatives and similar repositories for aTrain:
Users that are interested in aTrain are comparing it to the libraries listed below
- Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)☆740Updated this week
- ez audio transcription tool with flexible processing and post-processing options☆149Updated last year
- open source audio and video transcription software☆387Updated 2 weeks ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆667Updated 3 months ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆998Updated 2 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆344Updated 10 months ago
- ☆1,675Updated this week
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆105Updated last month
- A simple GUI to use Whisper.☆145Updated 3 weeks ago
- A nearly-live implementation of OpenAI's Whisper.☆2,688Updated this week
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆255Updated this week
- web based editor for subtitles and transcripts☆128Updated 7 months ago
- Open source inference code for Rev's model☆395Updated last month
- Modern GUI application that transcribes and translate audio files using OpenAI Whisper.☆146Updated 8 months ago
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆1,925Updated this week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆203Updated this week
- Real-time, Fully Local Speech-to-Text and Speaker Diarization. FastAPI Server & Web Interface☆186Updated this week
- turnkey self-hosted offline transcription and diarization service with llm summary☆834Updated 6 months ago
- Live-Transcription (STT) with Whisper PoC☆175Updated 9 months ago
- ☆577Updated 11 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆742Updated 2 months ago
- A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. …☆565Updated last year
- A Fast TTS Engine☆483Updated 2 months ago
- Whisper with Medusa heads☆830Updated last month
- Local SRT/LLM/TTS Voicechat☆658Updated 6 months ago
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆4,364Updated last month
- A desktop application that transcribes audio from files, microphone input or YouTube videos with the option to translate the content and …☆196Updated 5 months ago
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆846Updated 6 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆200Updated last month
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆155Updated last year