PinkFloyded / video-ocrLinks
☆49Updated 3 years ago
Alternatives and similar repositories for video-ocr
Users that are interested in video-ocr are comparing it to the libraries listed below
Sorting:
- ez audio transcription tool with flexible processing and post-processing options☆155Updated last year
- web based editor for subtitles and transcripts☆137Updated 11 months ago
- Auto transcribe tool based on whisper☆226Updated 2 years ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆55Updated last month
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆78Updated 2 years ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Extract hardcoded subtitles from videos using machine learning☆191Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆118Updated this week
- whisper.cpp bindings for python☆98Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 7 months ago
- Automatically generate and overlay subtitles for any video using OpenAi Whisper☆19Updated 2 years ago
- openai/whisper + extra features☆89Updated 2 years ago
- Meta's "No Language Left Behind" models served as web app and REST API☆225Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Python API & command-line tool to easily transcribe speech-based video files into clean text☆217Updated 8 months ago
- A curated list of awesome OpenAI's Whisper☆101Updated last year
- Offline srt producer gui with whisper.cpp☆26Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆133Updated this week
- Creates video from TTS output and viseme images.☆12Updated 3 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆222Updated 3 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆119Updated last year
- ☆38Updated 2 years ago
- Automatically generate, translate and overlay subtitles for any video.☆31Updated 2 weeks ago
- Translate HTML using Argos Translate☆52Updated 2 years ago
- Generate transcriptions and subtitles using OpenAI whisper as a base model, stable-ts/whisperx as a timestamp stabilizer using ASR models…☆18Updated 2 years ago
- GUI for whispercpp, a high performance C++ port of OpenAI's whisper☆82Updated 4 months ago
- streaming speech to text server using Whisper☆93Updated 2 years ago
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆66Updated 9 months ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated 2 years ago