Use any vision LLMs to perform OCR using LangChain
☆18Jul 29, 2025Updated 7 months ago
Alternatives and similar repositories for langchain-ocr
Users that are interested in langchain-ocr are comparing it to the libraries listed below
Sorting:
- ☆12Jul 17, 2025Updated 7 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 2 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆14Dec 6, 2025Updated 2 months ago
- An extensible viewer for OCR-D mets.xml files☆22May 30, 2024Updated last year
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- You Actually Look Twice At it☆39Jan 21, 2025Updated last year
- Template for AI chatbots & document management using Retrieval-Augmented Generation with vector search and FastAPI.☆59Feb 22, 2026Updated last week
- Page-wise text recognition with lower-supervision line data models☆51Updated this week
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- This is a source code in pure JS to convert widely unsupported WebP format to JPG format (PNG also possible)☆12Apr 30, 2018Updated 7 years ago
- Package that compiles the microsoft dxgkrnl driver from WSL Kernel for using partitioned GPUs from hyperV☆18Jun 29, 2024Updated last year
- Miqra According to the Masorah in two JSON formats☆12Updated this week
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- Notes and information for building the WSL-Kernel module and setting up GPU-PV in Linux guests.☆15Aug 22, 2025Updated 6 months ago
- Genetates or validates Hong Kong Identity Card number.☆12May 30, 2023Updated 2 years ago
- PHP Library to use GDAL functions☆13Apr 1, 2020Updated 5 years ago
- version 4.x of the Princeton Geniza Project☆12Feb 24, 2026Updated last week
- Digital texts in Prakrit☆10Sep 14, 2025Updated 5 months ago
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 4 months ago
- Make downloading scientific data much easier☆11Feb 15, 2026Updated 2 weeks ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated last month
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- No code solution for training tabular models☆34Jan 25, 2026Updated last month
- ☆40Dec 20, 2025Updated 2 months ago
- ☆29Jun 18, 2025Updated 8 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Lightweight library for accessing data and configuration☆13Apr 16, 2025Updated 10 months ago
- ☆10Jul 18, 2024Updated last year
- ☆13Aug 31, 2022Updated 3 years ago
- A collection of code snippets for geo developers☆11Oct 3, 2020Updated 5 years ago
- A simple demo use Qt & GDAL to visualize Esri shp file☆11Nov 20, 2020Updated 5 years ago
- Divide remote sensing images and their labels into data sets of specified size.☆12Dec 12, 2021Updated 4 years ago
- Standard email templates based on hundreds of real-world emails, including newsletters, on-boarding emails, announcements, events, produc…☆14Sep 15, 2025Updated 5 months ago
- Based on https://www.thinkautonomous.ai/point-clouds☆10Jun 2, 2020Updated 5 years ago
- Tools for debugging, analyzing, and validating 3D Tiles tilesets☆10Feb 1, 2019Updated 7 years ago
- This repository contain the implementation of DANIEL. (A fast Document Attention Network for Information Extraction and Labeling of handw…☆20Jan 12, 2026Updated last month
- XML parser to parse the PASCAL VOC annotiation xml's and convert it to .txt files☆12Jun 9, 2021Updated 4 years ago