EtienneAb3d / SRT-Sync
Synchronize SRT timestamps over an existing accurate transcription
☆30Updated 6 months ago
Alternatives and similar repositories for SRT-Sync
Users that are interested in SRT-Sync are comparing it to the libraries listed below
Sorting:
- Synchronize Whisper's timestamps over an existing accurate transcription☆148Updated 11 months ago
- ez audio transcription tool with flexible processing and post-processing options☆149Updated last year
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆325Updated 6 months ago
- web based editor for subtitles and transcripts☆130Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆698Updated 4 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆119Updated 5 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆49Updated 2 years ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆111Updated last week
- Text to speech alignment using CTC forced alignment☆281Updated last month
- Cross-platform speech toolset, used from the command-line or as a Node.js library. Includes a variety of engines for speech synthesis, sp…☆362Updated last week
- Extract hardcoded subtitles from videos using machine learning☆175Updated last week
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆213Updated last month
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆218Updated 3 months ago
- Running the F5-TTS by ONNX Runtime☆147Updated this week
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆243Updated 11 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- A testing repo to share code and thoughts on diarisation☆55Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- Simple Diarization model☆47Updated last year
- Split long audio files based on subtitle-info in SRT File (Transcript saved in CSV)☆20Updated 5 years ago
- TorToiSe fine-tuning with DLAS☆220Updated 9 months ago
- 📈 A forced aligner intended for synchronization of narrated text☆93Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆379Updated last year
- Open source inference code for Rev's model☆402Updated 3 weeks ago
- streaming speech to text server using Whisper☆92Updated last year
- A lightweight end-to-end text-to-speech model☆113Updated 2 months ago
- Ultimate Vocal Remover CLI☆140Updated 3 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆210Updated 6 months ago