cdli-gh / Cuneiform-OCRLinks
This repository contains code for line detection, character detection and recognition on the cuneiform 2d images
☆35Updated 6 years ago
Alternatives and similar repositories for Cuneiform-OCR
Users that are interested in Cuneiform-OCR are comparing it to the libraries listed below
Sorting:
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆72Updated last week
- OCRmyPDF EasyOCR plugin☆93Updated last month
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆633Updated 3 weeks ago
- Tools for manipulating and evaluating the hOCR format for representing multi-lingual OCR results by embedding them into HTML.☆399Updated last year
- Building scantailor and its dependencies☆62Updated 2 years ago
- Translate HTML using Argos Translate☆53Updated 2 years ago
- Training scripts for Argos Translate☆141Updated last week
- A curated list of resources around PDF files☆143Updated last year
- Open source optical mark recognition (OMR) software for creating, tagging and reading bubble sheet forms. For Windows, Mac & Linux.☆28Updated 3 years ago
- Annotate entities directly onto a PDF with automatic OCR for scanned PDFs☆61Updated 2 years ago
- A post-processing tool for scanned sheets of paper.☆84Updated last year
- A free Windows graphical interface to the Tesseract 4.0 OCR engine.☆61Updated 3 years ago
- Toolkit for training/converting LibreTranslate compatible language models 🚂☆63Updated 3 months ago
- Convert a PDF via OCR to a TXT file in UTF-8 encoding☆152Updated 2 years ago
- Document image dewarping library using a cubic sheet model☆175Updated this week
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- A semi-automatic open-source tool for Layout Analysis and Region EXtraction on early printed books.☆192Updated 3 months ago
- examples and guides to using Nomic Atlas☆38Updated 6 months ago
- OPUS-CAT is a collection of software which make it possible to OPUS-MT neural machine translation models in professional translation. OPU…☆80Updated 8 months ago
- Quick access to any large language model from your browser.☆10Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characte…☆220Updated 9 months ago
- Document Layout Analysis☆390Updated this week
- Read-only mirror of https://gitlab.gnome.org/GNOME/ocrfeeder☆89Updated last month
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 7 months ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆327Updated 2 years ago
- Speak (speech-to-text) to LLMs (Ollama) in any lanaguage - Streamlit app☆47Updated last year
- OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.☆119Updated 2 years ago
- A wrapper for tesseract / abbyyOCR11 ocr4linux finereader cli that can perform batch operations or monitor a directory and launch an OCR …☆66Updated last year
- ☆39Updated 2 years ago
- Apply different text recognition services to images of handwritten documents.☆187Updated 2 years ago