Use any vision LLMs to perform OCR using LangChain
☆22Jul 29, 2025Updated 9 months ago
Alternatives and similar repositories for langchain-ocr
Users that are interested in langchain-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 17, 2025Updated 10 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 5 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆15Dec 6, 2025Updated 5 months ago
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Template for AI chatbots & document management using Retrieval-Augmented Generation with vector search and FastAPI.☆81Updated this week
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 6 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Page-wise text recognition with lower-supervision line data models☆53Updated this week
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- You Actually Look Twice At it☆41Apr 15, 2026Updated last month
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 6 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- No code solution for training tabular models☆35May 12, 2026Updated last week
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆16Jan 25, 2022Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- ☆29Jun 18, 2025Updated 11 months ago
- A static MCP server that provides AI models with persistent tool context, preventing context loss between chats.☆30Apr 10, 2026Updated last month
- A dagger sdk written in rust for rust☆30Jun 29, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Package that compiles the microsoft dxgkrnl driver from WSL Kernel for using partitioned GPUs from hyperV☆19Jun 29, 2024Updated last year
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated 2 years ago
- lcx.exe cross-platform version☆12Mar 5, 2016Updated 10 years ago
- version 4.x of the Princeton Geniza Project☆12Updated this week
- Kubernetes operator that syncs cert-manager Secrets to Azure Key Vault.☆17Updated this week
- Stock Management System *version 1.0 Stacks used: Python / Django / Html / CSS / jQuery / JavaScript / Bootstrap / MySQL Comprehen…☆14Oct 1, 2023Updated 2 years ago
- ☆15Jul 11, 2022Updated 3 years ago
- A simple todo app, focused on a pleasant, lightweight user experience. Built with Typescript, Next,js, Prisma and Stitches.js ⚡☆13Nov 12, 2021Updated 4 years ago
- Miqra According to the Masorah in two JSON formats☆12Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Digital texts in Prakrit☆10Sep 14, 2025Updated 8 months ago
- Lightweight library for accessing data and configuration☆13Apr 16, 2025Updated last year
- Curated list of useful LLM / Analytics / Datascience resources☆14Jun 7, 2023Updated 2 years ago
- Standard email templates based on hundreds of real-world emails, including newsletters, on-boarding emails, announcements, events, produc…☆14Updated this week
- Tools for normalizing the use of some characters and checking file consistencies☆12Jan 12, 2026Updated 4 months ago
- AI agent rules: markdown files for Claude.md, ChatGPT, Copilot, Cursor, Windsurf, and more.☆23Feb 2, 2026Updated 3 months ago
- A tool for improving the output of generic Arabic OCR systems using an n-gram based post-correction approach.☆10Sep 22, 2021Updated 4 years ago