EtienneAb3d / SRT-SyncLinks
Synchronize SRT timestamps over an existing accurate transcription
☆41Updated last year
Alternatives and similar repositories for SRT-Sync
Users that are interested in SRT-Sync are comparing it to the libraries listed below
Sorting:
- Synchronize Whisper's timestamps over an existing accurate transcription☆160Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆162Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Updated last year
- web based editor for subtitles and transcripts☆143Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆249Updated 5 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆888Updated 7 months ago
- Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, sp…☆425Updated 5 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- Text to speech alignment using CTC forced alignment☆421Updated 2 months ago
- Running the F5-TTS by ONNX Runtime☆191Updated 3 weeks ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆54Updated 3 years ago
- ☆100Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- ☆54Updated 2 weeks ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆128Updated last year
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆180Updated 2 years ago
- Extract hardcoded subtitles from videos using machine learning☆217Updated 5 months ago
- Open source inference code for Rev's model☆435Updated 9 months ago
- Ultimate Vocal Remover CLI☆157Updated 11 months ago
- A testing repo to share code and thoughts on diarisation☆57Updated last year
- ☆192Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆136Updated this week
- Whisper command line client compatible with original OpenAI client based on CTranslate2.☆1,191Updated last month
- Fine Tune the Style-TTS2 Voice Model☆266Updated 7 months ago
- epub2tts-edge uses Microsoft Edge cloud-based TTS to create a full featured audiobook m4b from an epub or text file☆202Updated last year
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆48Updated 4 months ago
- G2P☆396Updated 5 months ago