BBC-Esq / WhisperS2T-transcriberLinks
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆70Updated 3 months ago
Alternatives and similar repositories for WhisperS2T-transcriber
Users that are interested in WhisperS2T-transcriber are comparing it to the libraries listed below
Sorting:
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- Examples of using the llasa-tts models locally☆182Updated 8 months ago
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆29Updated 2 weeks ago
- ☆100Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- ☆100Updated last year
- A UI for the Piper TTS☆106Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆30Updated 6 months ago
- ✨ A real-time voice changer application using WebSockets and ONNX/TensorFlow/PyTorch☆49Updated 11 months ago
- Audio datasets, easier.☆86Updated 2 years ago
- Modern Desktop Application offering a suite of tools for audio/video text recognition and a variety of other useful utilities.☆58Updated last year
- SoTA open-source TTS☆137Updated 2 weeks ago
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆46Updated 3 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆160Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Using RVC via console or python scripts☆140Updated last year
- A Gradio UI for XTTSv2 and RVC.☆160Updated last year
- Streaming and Fine-tuning for Chatterbox TTS☆252Updated 6 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆69Updated 5 months ago
- ☆510Updated 3 weeks ago
- Karaoke Player / Editor with automatic clip creation from any song file using vocals and lyrics extraction (Speech-to-Text)☆84Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- A multi-voice TTS system trained with an emphasis on quality☆26Updated 2 years ago
- Tool for automatic transcription and speaker diarization based on whisper and pyannote.☆62Updated 11 months ago
- Advanced RVC Inference for quicker and effortless model downloads☆64Updated this week
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Updated last month
- RVC realtime voice changer - standalone/lightweight☆89Updated 3 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆246Updated 4 months ago