PinkFloyded / video-ocrLinks

☆49

Alternatives and similar repositories for video-ocr

Users that are interested in video-ocr are comparing it to the libraries listed below

Sorting:

geekodour / wscribe
ez audio transcription tool with flexible processing and post-processing options
☆155Updated last year
geekodour / wscribe-editor
web based editor for subtitles and transcripts
☆137Updated 11 months ago
tomchang25 / whisper-auto-transcribe
Auto transcribe tool based on whisper
☆226Updated 2 years ago
LibreTranslate / Locomotive
Toolkit for training/converting LibreTranslate compatible language models 🚂
☆55Updated last month
JonathanFly / faster-whisper-livestream-translator
faster-whisper livestream translation, OBS noise reduction, dual language subtitles
☆78Updated 2 years ago
amrrs / openai-whisper-webapp
Code for OpenAI Whisper Web App Demo
☆93Updated 2 years ago
devmaxxing / videocr-PaddleOCR
Extract hardcoded subtitles from videos using machine learning
☆191Updated last month
appvoid / vosper
Real-Time Whisper Voice Recognition with vosk model feedback.
☆117Updated 2 years ago
winstxnhdw / nllb-api
A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.
☆118Updated this week
carloscdias / whisper-cpp-python
whisper.cpp bindings for python
☆98Updated last year
revdotcom / reverb-self-hosted
This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.
☆53Updated 7 months ago
altryne / whisper-me-this
Automatically generate and overlay subtitles for any video using OpenAi Whisper
☆19Updated 2 years ago
fcakyon / pywhisper
openai/whisper + extra features
☆89Updated 2 years ago
thammegowda / nllb-serve
Meta's "No Language Left Behind" models served as web app and REST API
☆225Updated 2 months ago
hedrergudene / asr-sd-pipeline
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆96Updated last year
pszemraj / vid2cleantxt
Python API & command-line tool to easily transcribe speech-based video files into clean text
☆217Updated 8 months ago
ancs21 / awesome-openai-whisper
A curated list of awesome OpenAI's Whisper
☆101Updated last year
asukaminato0721 / autosrt
Offline srt producer gui with whisper.cpp
☆26Updated last year
BBC-Esq / Faster-Whisper-Transcriber
Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.
☆133Updated this week
aflorithmic / viseme-to-video
Creates video from TTS output and viseme images.
☆12Updated 3 years ago
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆222Updated 3 months ago
awexandrr / audioWhisper
Listen to any audio stream on your machine and print out the transcribed or translated audio.
☆119Updated last year
fleek / VADtransciber
☆38Updated 2 years ago
Sirozha1337 / faster-auto-subtitle
Automatically generate, translate and overlay subtitles for any video.
☆31Updated 2 weeks ago
argosopentech / translate-html
Translate HTML using Argos Translate
☆52Updated 2 years ago
Fcabla / whisper_subtitler
Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…
☆18Updated 2 years ago
Topping1 / whispercppGUI
GUI for whispercpp, a high performance C++ port of OpenAI's whisper
☆82Updated 4 months ago
nalbion / whisper-server
streaming speech to text server using Whisper
☆93Updated 2 years ago
HallowSiddharth / VoiceCraftAI
VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.
☆66Updated 9 months ago
the-crypt-keeper / ggml-downloader
Simple, Fast, Parallel Huggingface GGML model downloader written in python
☆24Updated 2 years ago