PinkFloyded / video-ocrLinks
☆52Updated 3 years ago
Alternatives and similar repositories for video-ocr
Users that are interested in video-ocr are comparing it to the libraries listed below
Sorting:
- ez audio transcription tool with flexible processing and post-processing options☆158Updated last year
- web based editor for subtitles and transcripts☆140Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 8 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆121Updated this week
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆79Updated 2 years ago
- Python API & command-line tool to easily transcribe speech-based video files into clean text☆216Updated 9 months ago
- Meta's "No Language Left Behind" models served as web app and REST API☆234Updated 2 months ago
- Offline srt producer gui with whisper.cpp☆26Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆141Updated last week
- Real-Time Whisper Voice Recognition with vosk model feedback.☆118Updated 2 years ago
- Auto transcribe tool based on whisper☆227Updated 2 years ago
- streaming speech to text server using Whisper☆94Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆336Updated 9 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- whisper.cpp bindings for python☆101Updated 2 years ago
- openvino version of openai/whisper☆172Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆155Updated last year
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆173Updated 2 years ago
- ☆17Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆226Updated last week
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆62Updated 2 months ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆220Updated 9 months ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- A testing repo to share code and thoughts on diarisation☆56Updated last year
- Synchronize SRT timestamps over an existing accurate transcription☆34Updated 9 months ago
- ☆38Updated 2 years ago
- BlinkDL's RWKV-v4 running in the browser☆47Updated 2 years ago