ez audio transcription tool with flexible processing and post-processing options
☆168Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for wscribe
Users that are interested in wscribe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- web based editor for subtitles and transcripts☆147Aug 16, 2024Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆102May 7, 2024Updated 2 years ago
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Player☆166May 26, 2026Updated 3 weeks ago
- Hyperaudio Converter - converts from JSON/SRT to HTML Based Interactive Transcript☆14Dec 16, 2020Updated 5 years ago
- Record audio or transcribe files using ctranslate2 and whisper!☆200Updated this week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 3 years ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆1,317Feb 14, 2026Updated 4 months ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…☆1,155Updated this week
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆350Nov 12, 2024Updated last year
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,563Feb 23, 2026Updated 3 months ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- A lightweight transcript editor for editing and correcting STT generated timed transcripts☆59Updated this week
- ☆16Jun 6, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Synchronize Whisper's timestamps over an existing accurate transcription☆164May 28, 2024Updated 2 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆23Sep 26, 2024Updated last year
- Parakeet 0.6b V2 + Pyannote diarization behind a Whisper API☆77Feb 21, 2026Updated 3 months ago
- Simple diarization model☆53Jun 13, 2025Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated last year
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆3,069Nov 7, 2025Updated 7 months ago
- MCP server for transcript processing — formatting, contextual repair & smart summarization with deep-thinking LLMs☆19Apr 7, 2026Updated 2 months ago
- Simple Android SDK for Publitio☆10Jan 16, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆220Oct 30, 2024Updated last year
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).☆19Apr 1, 2021Updated 5 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 3 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- ☆14Mar 31, 2023Updated 3 years ago
- A simple extension that uses Bark Text-to-Speech for audio output☆10Nov 20, 2023Updated 2 years ago
- Faster Whisper transcription with CTranslate2☆23,584Nov 19, 2025Updated 6 months ago
- A Colab Notebook for OpenAI Whisper and DeepL API, aiming to create human-comparable results of translation and transcription.☆33Feb 4, 2024Updated 2 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。☆661May 25, 2026Updated 3 weeks ago
- Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…☆265Apr 19, 2026Updated last month
- ☆16Sep 12, 2019Updated 6 years ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆22,462Jun 3, 2026Updated last week
- Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.☆138Jan 28, 2026Updated 4 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆23Oct 10, 2025Updated 8 months ago