dinosauria123 / makepdf
Make a searchable pdf via Google Cloud Vision OCR
☆14Updated 5 years ago
Alternatives and similar repositories for makepdf:
Users that are interested in makepdf are comparing it to the libraries listed below
- Tool to OCR PDFs using Google Cloud Vision☆42Updated 2 years ago
- gcv2hocr converts from Google Cloud Vision OCR output to hocr to make a searchable pdf.☆106Updated 4 years ago
- A python utility to convert .srt files to a .txt format.☆14Updated 2 years ago
- 💬 Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper on Win, Linux and Mac ... fast!☆42Updated 3 weeks ago
- Image Sorting and Classification via Text Detection and Recognition☆13Updated 5 years ago
- A simple tool to estimate the reading order of comic panels☆16Updated 2 years ago
- Convert ALTO XML to plain text + minimal metadata☆16Updated 6 months ago
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆170Updated 3 weeks ago
- A lightweight transcript editor for editing and correcting STT generated timed transcripts☆45Updated 3 weeks ago
- OCRmyPDF EasyOCR plugin☆84Updated 3 weeks ago
- OCR system for recognizing modern Japanese magazines☆144Updated last year
- A seamless optical character recognition real-time translator application right on your desktop☆16Updated 11 months ago
- Docker integration of Kitodo.Production and OCR-D☆9Updated last year
- Given the URL to a public JSON document in an International Image Interoperability Framework (IIIF) image server, this script will downlo…☆16Updated 2 years ago
- Deep Zoom Image Downloader☆20Updated this week
- Python API & command-line tool to easily transcribe speech-based video files into clean text☆210Updated 5 months ago
- ☆24Updated 2 years ago
- Extract images from the Google arts and culture page to create a supervised classification model and serve it in Python/FastAPI.☆10Updated 2 years ago
- GUI for whispercpp, a high performance C++ port of OpenAI's whisper☆70Updated last month
- Hyperaudio Lite - a Super-lightweight Interactive Transcript Player☆145Updated 5 months ago
- This repository contains code for line detection, character detection and recognition on the cuneiform 2d images☆32Updated 5 years ago
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆107Updated 2 months ago
- This tool is to help those who'd rather manually caption images in batches with ease☆14Updated last year
- An easy-to-use GUI addon for whisper-standalone-win. Designed for those who prefer a simple interface over typing commands and file paths…☆12Updated last year
- An open source online storytelling platform for everyone. Built by Cogapp.☆27Updated 2 months ago
- UI for extracting data from pdf files using watsonx prompts☆12Updated 2 months ago
- Repository hosting the common code for the entity-fishing clients☆10Updated 11 months ago
- Fast PDF generation and compression. Deals with millions of pages daily.☆115Updated 8 months ago
- Gui for users who use the coqui-TTS vits model.☆11Updated 2 years ago
- Command Line Interface for running 🤗 Transformers Image Classification locally☆19Updated 2 weeks ago