PinkFloyded / video-ocr
☆47Updated 3 years ago
Alternatives and similar repositories for video-ocr:
Users that are interested in video-ocr are comparing it to the libraries listed below
- ez audio transcription tool with flexible processing and post-processing options☆147Updated last year
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆104Updated last week
- openai/whisper + extra features☆88Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆77Updated last year
- Code for OpenAI Whisper Web App Demo☆94Updated 2 years ago
- web based editor for subtitles and transcripts☆126Updated 7 months ago
- A testing repo to share code and thoughts on diarisation☆53Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆199Updated last month
- Large-Language-Model to Machine Interface project.☆18Updated last year
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆57Updated this week
- Whisper combined with Silero VAD, for improved long-form transcriptions☆47Updated 2 years ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆100Updated last month
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆36Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆117Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆142Updated 10 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Faster Whisper ASR transcription with CTranslate2☆20Updated 5 months ago
- Synchronize SRT timestamps over an existing accurate transcription☆28Updated 4 months ago
- whisper.cpp bindings for python☆93Updated last year
- Meta's "No Language Left Behind" models served as web app and REST API☆206Updated 7 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆19Updated 5 months ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- ☆9Updated last month
- Create narrated video story from book chapter using NLP, OpenAI and StableDiffusion.☆13Updated last week
- Wan 2.1 AI Video Generator Web UI☆19Updated 3 weeks ago
- [WIP] AI Try-On plugin for Chrome☆27Updated last year
- Use LLM (ollama, QWEN, ChatGPT) to translate the pdf inplacely☆28Updated 3 weeks ago