BBC-Esq / WhisperS2T-transcriber
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆28Updated 4 months ago
Alternatives and similar repositories for WhisperS2T-transcriber:
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below
- Real-time end-to-end singing voice convertion☆19Updated 2 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆16Updated last month
- G2P☆35Updated this week
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆18Updated last month
- Convert your PDFs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and efficient…☆37Updated 2 weeks ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆48Updated last month
- ☆25Updated 9 months ago
- Collection of the best Applio plugins.☆26Updated 4 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆14Updated 3 months ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆41Updated 6 months ago
- A multi-voice TTS system trained with an emphasis on quality☆26Updated 2 years ago
- A UI for the Piper TTS☆78Updated 4 months ago
- Search Character Hub from within SillyTavern☆26Updated 6 months ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆45Updated 6 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆52Updated this week
- ☆22Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆80Updated this week
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆19Updated 3 months ago
- ☆10Updated 8 months ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 5 months ago
- 💬 "Realtime" voice transcription and cloning using ElevenLabs's API.☆53Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- ☆27Updated last year
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆67Updated last year
- zero-shot realtime TTS system, fully offline, free and open source☆24Updated 2 weeks ago
- Audio datasets, easier.☆83Updated last year