geekodour/wscribe

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/geekodour/wscribe)

geekodour / wscribe

ez audio transcription tool with flexible processing and post-processing options

☆171

Alternatives and similar repositories for wscribe

Users that are interested in wscribe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

geekodour / wscribe-editor
View on GitHub
web based editor for subtitles and transcripts
☆147Aug 16, 2024Updated last year
hedrergudene / asr-sd-pipeline
View on GitHub
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆102May 7, 2024Updated 2 years ago
hyperaudio / hyperaudio-lite
View on GitHub
Hyperaudio Lite - a Super-lightweight Interactive Transcript Player
☆168Jul 4, 2026Updated 3 weeks ago
hyperaudio / ha-converter
View on GitHub
Hyperaudio Converter - converts from JSON/SRT to HTML Based Interactive Transcript
☆14Dec 16, 2020Updated 5 years ago
Fcabla / whisper_subtitler
View on GitHub
Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…
☆19Mar 10, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
gaspardpetit / verbatim
View on GitHub
High accuracy code-switching whisper / qwen3 transcription
☆39Jun 17, 2026Updated last month
Softcatala / whisper-ctranslate2
View on GitHub
Whisper command line client compatible with original OpenAI client based on CTranslate2.
☆1,333Feb 14, 2026Updated 5 months ago
BBC-Esq / Faster-Whisper-Transcriber
View on GitHub
Record audio or transcribe files using ctranslate2 and whisper!
☆210Updated this week
MahmoudAshraf97 / whisper-diarization
View on GitHub
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
☆5,608Feb 23, 2026Updated 5 months ago
denfed / wave-spec-fusion
View on GitHub
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…
☆16Aug 9, 2021Updated 4 years ago
aTrainTranscription / aTrain
View on GitHub
A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…
☆1,187Jul 16, 2026Updated last week
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
EtienneAb3d / WhisperHallu
View on GitHub
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆350Nov 12, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
JaesungHuh / SimpleDiarization
View on GitHub
Simple diarization model
☆53Jun 13, 2025Updated last year
ElvisClaros / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆23Sep 26, 2024Updated last year
JFalnes / Skribify
View on GitHub
Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …
☆12Apr 29, 2025Updated last year
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
ob1y2k / publitio_android_sdk
View on GitHub
Simple Android SDK for Publitio
☆10Jan 16, 2021Updated 5 years ago
EtienneAb3d / WhisperTimeSync
View on GitHub
Synchronize Whisper's timestamps over an existing accurate transcription
☆165May 28, 2024Updated 2 years ago
LilDevsy0117 / Ultra-Sortformer
View on GitHub
Ultra-Sortformer for Scalable Speaker Diarization
☆27Apr 9, 2026Updated 3 months ago
Wordcab / wordcab-transcribe
View on GitHub
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
☆219Oct 30, 2024Updated last year
hyperaudio / hyperaudio
View on GitHub
☆14Mar 31, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
maum-ai / sane-tts
View on GitHub
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
☆11Jun 30, 2023Updated 3 years ago
kuielab / voice_datasets
View on GitHub
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
☆20Apr 1, 2021Updated 5 years ago
freds0 / kabooks
View on GitHub
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…
☆13Mar 24, 2023Updated 3 years ago
RandomInternetPreson / text-generation-webui-barktts
View on GitHub
A simple extension that uses Bark Text-to-Speech for audio output
☆10Nov 20, 2023Updated 2 years ago
NavodPeiris / speechlib
View on GitHub
Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…
☆266Apr 19, 2026Updated 3 months ago
jbeliao / SLAM
View on GitHub
☆16Sep 12, 2019Updated 6 years ago
cnbeining / Whisper_Notebook
View on GitHub
A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.
☆33Feb 4, 2024Updated 2 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
zh-plus / openlrc
View on GitHub
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT，Claude等)来转录、翻译你的音频为字幕文件。
☆669May 25, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Keyaku / bouncy
View on GitHub
Game for Godot demonstrating OpenCV calls through GDNative
☆20May 16, 2021Updated 5 years ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
mt-upc / ZeroSwot
View on GitHub
Pushing the Limits of Zero-shot End-to-End Speech Translation
☆25Dec 12, 2024Updated last year
JonathanFly / faster-whisper-livestream-translator
View on GitHub
faster-whisper livestream translation, OBS noise reduction, dual language subtitles
☆82Apr 26, 2023Updated 3 years ago
haoxiangsnr / spiking-fullsubnet
View on GitHub
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
☆142Jan 28, 2026Updated 5 months ago
philgzl / brever
View on GitHub
Speech enhancement in noisy and reverberant environments using deep neural networks
☆23Oct 10, 2025Updated 9 months ago
gweltou / anaouder-cli
View on GitHub
Anaouder mouezh e Brezhoneg gant Vosk
☆15Nov 24, 2025Updated 8 months ago