PinkFloyded / video-ocrLinks
☆48Updated 3 years ago
Alternatives and similar repositories for video-ocr
Users that are interested in video-ocr are comparing it to the libraries listed below
Sorting:
- Code for OpenAI Whisper Web App Demo☆93Updated 2 years ago
- Offline srt producer gui with whisper.cpp☆26Updated last year
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆152Updated last year
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- openai/whisper + extra features☆89Updated 2 years ago
- whisper.cpp bindings for python☆98Updated last year
- web based editor for subtitles and transcripts☆135Updated 10 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆78Updated 2 years ago
- Creates video from TTS output and viseme images.☆12Updated 3 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆97Updated last week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Synchronize Whisper's timestamps over an existing accurate transcription☆152Updated last year
- Faster Whisper ASR transcription with CTranslate2☆22Updated 8 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- This is an example of search videos using jina☆23Updated 3 years ago
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated 2 years ago
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 9 months ago
- A performant high-throughput CPU-based API for Meta's No Language Left Behind (NLLB) using CTranslate2, hosted on Hugging Face Spaces.☆115Updated this week
- Whisper combined with Silero VAD, for improved long-form transcriptions☆52Updated 2 years ago
- Adds a web API to RVC to infer via json requests☆26Updated 11 months ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆119Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- ☆37Updated 2 years ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆52Updated this week
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆128Updated last week
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆120Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- Coqui AI TTS plugin☆80Updated 3 months ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆80Updated 6 months ago