BBC-Esq / WhisperS2T-transcriberLinks
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆63Updated last month
Alternatives and similar repositories for WhisperS2T-transcriber
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below
Sorting:
- Faster Tortoise inference then Tortoise Fast Fork☆127Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆28Updated last week
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆79Updated last year
- ☆99Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆159Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- ☆101Updated last year
- A UI for the Piper TTS☆103Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆29Updated 4 months ago
- Audio datasets, easier.☆85Updated 2 years ago
- Piper Tray is a lightweight system tray utility written in C# for use with Piper TTS.☆28Updated 5 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆43Updated last month
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆106Updated 7 months ago
- Examples of using the llasa-tts models locally☆181Updated 6 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated 11 months ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆91Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆47Updated 6 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆204Updated 4 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- A multi-voice TTS system trained with an emphasis on quality☆24Updated last year
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 2 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- SoTA open-source TTS☆114Updated 2 weeks ago
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)☆290Updated this week
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- AudioSR-Colab-Fork☆49Updated 3 weeks ago
- ☆69Updated 7 months ago
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆238Updated last month
- ☆50Updated 2 weeks ago
- A curated list of awesome OpenAI's Whisper☆98Updated 2 years ago