PinkFloyded / video-ocrLinks
☆48Updated 3 years ago
Alternatives and similar repositories for video-ocr
Users that are interested in video-ocr are comparing it to the libraries listed below
Sorting:
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- Creates video from TTS output and viseme images.☆12Updated 2 years ago
- web based editor for subtitles and transcripts☆133Updated 9 months ago
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- ChatGPT Anywhere is a browser extension skeleton for seamless ChatGPT integration, interacting directly with the ChatGPT's browser API an…☆38Updated last year
- Meta's "No Language Left Behind" models served as web app and REST API☆212Updated last week
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆78Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing options☆151Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Incredibly descriptive audiovisual summaries for videos☆41Updated 10 months ago
- Offline srt producer gui with whisper.cpp☆25Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆112Updated this week
- openvino version of openai/whisper☆166Updated last year
- Easy-Translate is a script for translating large text files with a SINGLE COMMAND. Easy-Translate is designed to be as easy as possible f…☆217Updated 6 months ago
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆90Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆27Updated 10 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆50Updated 11 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- ☆70Updated last month
- Large-Language-Model to Machine Interface project.☆19Updated last year
- ☆43Updated 4 months ago
- Mistral-7B finetuned for function calling☆16Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- ☆37Updated 2 years ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆80Updated 6 months ago
- whisper.cpp bindings for python☆96Updated last year
- this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews☆47Updated 10 months ago
- 4bit bitsandbytes quants of the best 7B vlms☆30Updated 8 months ago