BBC-Esq / WhisperS2T-transcriberLinks
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆61Updated 2 weeks ago
Alternatives and similar repositories for WhisperS2T-transcriber
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below
Sorting:
- Audio datasets, easier.☆85Updated 2 years ago
- ☆99Updated last year
- Piper Tray is a lightweight system tray utility written in C# for use with Piper TTS.☆29Updated 4 months ago
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆157Updated last year
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆103Updated 6 months ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆28Updated 4 months ago
- ☆66Updated 6 months ago
- Examples of using the llasa-tts models locally☆180Updated 5 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆158Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆126Updated 6 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆29Updated 4 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆29Updated last year
- TTS pipeline that uses RVC to enhance audio quality and cloning☆143Updated last year
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆206Updated 2 weeks ago
- SoTA open-source TTS☆100Updated 3 weeks ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated 3 weeks ago
- ☆101Updated last year
- A UI for the Piper TTS☆101Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated 10 months ago
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆37Updated 4 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆237Updated last month
- Streaming and Fine-tuning for Chatterbox TTS☆193Updated 3 months ago
- A fast MP3 decoder for python, using minimp3☆28Updated 3 years ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated last year
- SoTA open-source TTS☆96Updated 4 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆78Updated 2 months ago
- A Gradio UI for XTTSv2 and RVC.☆158Updated last year
- ☆74Updated last year