Use any vision LLMs to perform OCR using LangChain
☆23Jul 29, 2025Updated 11 months ago
Alternatives and similar repositories for langchain-ocr
Users that are interested in langchain-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 17, 2025Updated 11 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 6 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆16Dec 6, 2025Updated 6 months ago
- tesseractXplore a tesseract ease of use gui with full control☆26Nov 10, 2021Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Template for AI chatbots & document management using Retrieval-Augmented Generation with vector search and FastAPI.☆84Updated this week
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated 2 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 8 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Page-wise text recognition with lower-supervision line data models☆53Jun 12, 2026Updated 3 weeks ago
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- You Actually Look Twice At it☆42Apr 15, 2026Updated 2 months ago
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- No code solution for training tabular models☆36Jun 16, 2026Updated 2 weeks ago
- Python tools for performing various operations on ALTO XML files☆50Jun 12, 2026Updated 3 weeks ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆16Jun 27, 2023Updated 3 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆16Jan 25, 2022Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- ☆29Jun 18, 2025Updated last year
- A dagger sdk written in rust for rust☆30Jun 29, 2023Updated 3 years ago
- Package that compiles the microsoft dxgkrnl driver from WSL Kernel for using partitioned GPUs from hyperV☆19Jun 29, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Some bits of javascript to transcribe scanned pages using PageXML☆17May 27, 2026Updated last month
- lcx.exe cross-platform version☆12Mar 5, 2016Updated 10 years ago
- version 4.x of the Princeton Geniza Project☆13Jun 12, 2026Updated 3 weeks ago
- Kubernetes operator that syncs cert-manager Secrets to Azure Key Vault.☆18May 22, 2026Updated last month
- Stock Management System *version 1.0 Stacks used: Python / Django / Html / CSS / jQuery / JavaScript / Bootstrap / MySQL Comprehen…☆15Oct 1, 2023Updated 2 years ago
- ☆15Jul 11, 2022Updated 3 years ago
- A simple todo app, focused on a pleasant, lightweight user experience. Built with Typescript, Next,js, Prisma and Stitches.js ⚡☆13Nov 12, 2021Updated 4 years ago
- Miqra According to the Masorah in two JSON formats☆12Jun 4, 2026Updated last month
- Lightweight library for accessing data and configuration☆13Apr 16, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Curated list of useful LLM / Analytics / Datascience resources☆14Jun 7, 2023Updated 3 years ago
- Digital texts in Prakrit☆11Sep 14, 2025Updated 9 months ago
- Standard email templates based on hundreds of real-world emails, including newsletters, on-boarding emails, announcements, events, produc…☆14May 23, 2026Updated last month
- Tools for normalizing the use of some characters and checking file consistencies☆12May 30, 2026Updated last month
- AI agent rules: markdown files for Claude.md, ChatGPT, Copilot, Cursor, Windsurf, and more.☆25Jun 17, 2026Updated 2 weeks ago
- A tool for improving the output of generic Arabic OCR systems using an n-gram based post-correction approach.☆10Sep 22, 2021Updated 4 years ago
- Notes and information for building the WSL-Kernel module and setting up GPU-PV in Linux guests.☆17Mar 22, 2026Updated 3 months ago