BBC-Esq / WhisperS2T-transcriberLinks
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆44Updated 2 months ago
Alternatives and similar repositories for WhisperS2T-transcriber
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below
Sorting:
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆38Updated this week
- A UI for the Piper TTS☆94Updated 9 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated last week
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆120Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆122Updated 3 weeks ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated 10 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆90Updated 2 months ago
- ☆95Updated last year
- ☆83Updated 11 months ago
- Simulates talk with an AI that can express emotions☆70Updated 10 months ago
- Audio datasets, easier.☆84Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆25Updated this week
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- Collection of the best Applio plugins.☆29Updated 8 months ago
- ez audio transcription tool with flexible processing and post-processing options☆150Updated last year
- ☆99Updated 9 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆64Updated 2 months ago
- A multi-voice TTS system trained with an emphasis on quality☆26Updated 2 years ago
- ☆67Updated 2 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆12Updated 8 months ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆60Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆61Updated 6 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆96Updated 2 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆36Updated last week
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆47Updated 4 months ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated 10 months ago
- Diffusion_TTS extension for booga☆68Updated 11 months ago