JuergenFleiss / aTrain
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.
☆299Updated last week
Related projects: ⓘ
- Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)☆398Updated this week
- ☆384Updated this week
- A simple GUI to use Whisper.☆84Updated last month
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆163Updated last week
- Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from te…☆277Updated this week
- ez audio transcription tool with flexible processing and post-processing options☆122Updated 7 months ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆308Updated 3 months ago
- web based editor for subtitles and transcripts☆102Updated last month
- Modern GUI application that transcribes and translate audio files using OpenAI Whisper.☆108Updated last month
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆111Updated last year
- Speech Diarization for scrum automation☆94Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆134Updated 3 weeks ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆147Updated 3 weeks ago
- Fast! Offline, privacy-focused, hands-free voice typing, 2-way AI voice chat, AI images, webcam, recorder, voice control, in under 4 GiB …☆157Updated this week
- Live-Transcription (STT) with Whisper PoC☆140Updated 3 months ago
- a gradio webui for faster whisper☆217Updated last year
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆382Updated 2 weeks ago
- 💬📝 A small dictation app using OpenAI's Whisper speech recognition model.☆304Updated 3 weeks ago
- Whisper with Medusa heads☆774Updated last week
- An API to transcribe audio with OpenAI's Whisper Large v3!☆166Updated 3 weeks ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆262Updated 7 months ago
- Have a natural voice conversation with an LLM☆189Updated this week
- open source audio and video transcription software☆263Updated 3 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆276Updated 3 weeks ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆866Updated last month
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆171Updated 3 months ago
- ScribeWizard: Generate organized notes from audio using Groq, Whisper, and Llama3☆412Updated 3 weeks ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆677Updated last month
- Local SRT/LLM/TTS Voicechat☆471Updated last month
- ☆1,079Updated 2 months ago