evermoving / SystemCaptioner
Generates and shows real-time captions by listening to your Windows PC's audio. With standalone .exe option.
โ27Updated 4 months ago
Alternatives and similar repositories for SystemCaptioner:
Users that are interested in SystemCaptioner are comparing it to the libraries listed below
- Text to speech alignment using CTC forced alignmentโ270Updated last month
- ONNX Inference of Pyannote Segmentationโ85Updated 4 months ago
- ๐ Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. ๐ง๐ฅ๐ Advanced audio processing.โ242Updated 10 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsโ322Updated 5 months ago
- โ216Updated last month
- Running the F5-TTS by ONNX Runtimeโ146Updated 2 weeks ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitlesโ78Updated last year
- [WIP] Scripts for fine-tuning Whisperโ219Updated last year
- Ultimate Vocal Remover Inference CLIโ66Updated 2 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionโ680Updated 4 months ago
- ไธไธช็ฎๅ็้ณ้ข้ๅชๅทฅๅ ท,ๆ้ซweb UI็้ขๅapiๆฅๅฃโ25Updated 5 months ago
- Open source inference code for Rev's modelโ399Updated this week
- Synchronize Whisper's timestamps over an existing accurate transcriptionโ147Updated 10 months ago
- SOFA: Singing-Oriented Forced Alignerโ163Updated last week
- ez audio transcription tool with flexible processing and post-processing optionsโ149Updated last year
- A enterprise-grade Voice Activity Detector from modelscope and funasr.โ96Updated 2 years ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineโ400Updated 7 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withโฆโ205Updated 2 weeks ago
- โ130Updated 4 months ago
- Utilizes ONNX Runtime to transcribe audio into text.โ24Updated this week
- Fine-Tune Whisper with Transformers and PEFTโ55Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech denโฆโ72Updated 8 months ago
- โ13Updated last week
- ultimate vocal remover application run on linux ubuntu1604โ52Updated 2 years ago
- Python Wrapper of Silero VADโ51Updated this week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionโ174Updated 6 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.โ135Updated last year
- This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation usingโฆโ92Updated 4 months ago
- Predicts the level of noise and reverberation on your audiofilesโ148Updated 11 months ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions forโฆโ158Updated 11 months ago