BBC-Esq / WhisperS2T-transcriberLinks
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆50Updated 3 months ago
Alternatives and similar repositories for WhisperS2T-transcriber
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below
Sorting:
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆128Updated last week
- A UI for the Piper TTS☆93Updated 9 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated 2 weeks ago
- ☆97Updated last year
- ☆67Updated 3 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆99Updated 2 months ago
- An extension to use Kokoro TTS in text generation webui☆20Updated last month
- XTTSv2 Extension for oobabooga text-generation-webui☆154Updated last year
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 6 months ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆56Updated 10 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆62Updated 3 months ago
- A simple wrapper around "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching" that provides an OpenAI-compatibl…☆13Updated 4 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆109Updated last week
- Slightly improved official version for finetune xtts☆73Updated 9 months ago
- Piper Tray is a lightweight system tray utility written in C# for use with Piper TTS.☆23Updated 3 weeks ago
- A Gradio UI for XTTSv2 and RVC.☆158Updated last year
- A random walk voice style cloning application for Kokoro text to speech☆99Updated last week
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆98Updated 3 months ago
- Automated speech dataset creator☆152Updated 2 weeks ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆120Updated last year
- Collection of the best Applio plugins.☆29Updated 9 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆39Updated last week
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆65Updated last week
- ez audio transcription tool with flexible processing and post-processing options☆152Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆25Updated 3 weeks ago
- A multi-voice TTS system trained with an emphasis on quality☆26Updated 2 years ago
- A Gradio UI for XTTSv2 and RVC.☆68Updated 9 months ago
- Writing Extension for Text Generation WebUI☆60Updated 5 months ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆62Updated 9 months ago