evermoving / SystemCaptionerLinks
Generates and shows real-time captions by listening to your Windows PC's audio. With standalone .exe option.
☆40Updated 5 months ago
Alternatives and similar repositories for SystemCaptioner
Users that are interested in SystemCaptioner are comparing it to the libraries listed below
Sorting:
- Synchronize Whisper's timestamps over an existing accurate transcription☆160Updated last year
- [WIP] Scripts for fine-tuning Whisper☆222Updated 2 years ago
- Text to speech alignment using CTC forced alignment☆430Updated 2 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆348Updated last year
- Running the F5-TTS by ONNX Runtime☆191Updated last month
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- ☆192Updated last year
- Fine Tune the Style-TTS2 Voice Model☆266Updated 7 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- Voice Conversion With Just Nearest Neighbors☆511Updated 3 weeks ago
- Ultimate Vocal Remover CLI☆157Updated last year
- A python package for deep multilingual punctuation prediction.☆156Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- Voice gender classifier using ECAPA-TDNN☆64Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆80Updated 2 years ago
- Ultimate Vocal Remover Inference CLI☆108Updated last year
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆182Updated last year
- ☆263Updated 2 years ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆889Updated 8 months ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆132Updated last month
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆180Updated 2 years ago
- Live-Transcription (STT) with Whisper PoC☆202Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- A testing repo to share code and thoughts on diarisation☆57Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆250Updated 5 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆216Updated last year
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆413Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆86Updated last year
- ☆55Updated 2 weeks ago