geekodour / wscribeView external linksLinks
ez audio transcription tool with flexible processing and post-processing options
☆162Feb 1, 2024Updated 2 years ago
Alternatives and similar repositories for wscribe
Users that are interested in wscribe are comparing it to the libraries listed below
Sorting:
- web based editor for subtitles and transcripts☆143Aug 16, 2024Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100May 7, 2024Updated last year
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Player☆163Nov 19, 2024Updated last year
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- ☆14Aug 16, 2023Updated 2 years ago
- SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech☆11Jun 30, 2023Updated 2 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Hyperaudio Converter - converts from JSON/SRT to HTML Based Interactive Transcript☆14Dec 16, 2020Updated 5 years ago
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆1,207Updated this week
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 8 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Nov 12, 2024Updated last year
- Dippy Synthetic Speech Subnet☆17Sep 11, 2025Updated 5 months ago
- ☆16Sep 12, 2019Updated 6 years ago
- A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning mod…☆1,040Updated this week
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆161May 28, 2024Updated last year
- Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper☆5,355Nov 26, 2025Updated 2 months ago
- [NeurIPS 2023 - ML for Audio Workshop (Oral)] Zero-shot audio captioning with audio-language model guidance and audio context keywords☆18Nov 30, 2024Updated last year
- Speech enhancement in noisy and reverberant environments using deep neural networks☆22Oct 10, 2025Updated 4 months ago
- Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.☆2,852Nov 7, 2025Updated 3 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Oct 31, 2023Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Record audio or transcribe files using ctranslate2 and whisper!☆170Feb 6, 2026Updated last week
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).☆20Apr 1, 2021Updated 4 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆19Mar 10, 2023Updated 2 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Speech-to-text transcription VST3/ARA plugin☆53Feb 2, 2026Updated last week
- ivrit.ai codebase☆45Oct 24, 2025Updated 3 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆27Aug 1, 2023Updated 2 years ago
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- A Network Service Messaging framework for use on top of the W3C Network Service Discovery specification☆11Apr 29, 2014Updated 11 years ago
- Skribify is a powerful transcription and summarization tool that leverages the power of OpenAI's GPT-4 and WhisperAI to generate concise …☆12Apr 29, 2025Updated 9 months ago
- Simple Android SDK for Publitio☆10Jan 16, 2021Updated 5 years ago
- Platform for creating audio-first AI assistants that can work offline using a flexible plugin architecture☆13Jun 29, 2025Updated 7 months ago
- ATSC 3.0 to MPEG-2 TS Converter☆21Sep 11, 2025Updated 5 months ago
- Reimplementation of Miipher☆29Aug 16, 2023Updated 2 years ago