BBC-Esq / WhisperS2T-transcriberLinks
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆52Updated 5 months ago
Alternatives and similar repositories for WhisperS2T-transcriber
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below
Sorting:
- A UI for the Piper TTS☆97Updated 11 months ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆26Updated 2 months ago
- ☆98Updated last year
- Piper Tray is a lightweight system tray utility written in C# for use with Piper TTS.☆28Updated 2 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆29Updated 2 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆109Updated 4 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆155Updated last year
- Examples of using the llasa-tts models locally☆179Updated 4 months ago
- ☆101Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆225Updated last week
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆53Updated 7 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆146Updated last year
- ☆67Updated 5 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated 3 weeks ago
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆76Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Streaming and Fine-tuning for Chatterbox TTS☆157Updated 2 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆37Updated 3 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆101Updated 5 months ago
- Audio datasets, easier.☆84Updated 2 years ago
- SoTA open-source TTS☆79Updated 2 months ago
- ez audio transcription tool with flexible processing and post-processing options☆158Updated last year
- A random walk voice style cloning application for Kokoro text to speech☆117Updated 2 months ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆138Updated last week
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆56Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆63Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆50Updated 8 months ago
- ☆478Updated 2 months ago