DakeQQ / Transcribe-and-Translate-SubtitlesLinks
Transcribe subtitles and translate them offline with ease.
☆28Updated last week
Alternatives and similar repositories for Transcribe-and-Translate-Subtitles
Users that are interested in Transcribe-and-Translate-Subtitles are comparing it to the libraries listed below
Sorting:
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆39Updated last week
- ☆22Updated this week
- Utilizes ONNX Runtime to transcribe audio into text.☆35Updated last week
- StyleTTS 2 Optimized Training Fork☆31Updated 4 months ago
- High quality text-to-speech based on StyleTTS 2.☆51Updated last week
- ez audio transcription tool with flexible processing and post-processing options☆152Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆97Updated last week
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- A curated list of awesome voice activity detection☆57Updated 7 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- ☆139Updated last year
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 2 months ago
- Utilizes ONNX Runtime for speech activity detection.☆25Updated last week
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆24Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated 11 months ago
- ONNX Inference of Pyannote Segmentation☆91Updated 6 months ago
- A lightweight end-to-end text-to-speech model☆114Updated 4 months ago
- ☆235Updated last week
- Cantonese Text to Speech with VITS implementation☆30Updated 2 years ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆90Updated last month
- Real-time Speech-Text Foundation Model Toolkit (wip)☆237Updated 3 months ago
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆26Updated last month
- G2P☆262Updated last month
- Resources that make every language unique☆13Updated 7 months ago
- Running the F5-TTS by ONNX Runtime☆156Updated 2 weeks ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- OpenAI Whisper Prompt Examples☆52Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year
- ONNX implementation of Whisper. PyTorch free.☆100Updated 7 months ago
- Synchronize Whisper's timestamps over an existing accurate transcription☆152Updated last year