Use any vision LLMs to perform OCR using LangChain
☆22Jul 29, 2025Updated 10 months ago
Alternatives and similar repositories for langchain-ocr
Users that are interested in langchain-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 17, 2025Updated 10 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 6 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆16Dec 6, 2025Updated 6 months ago
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Template for AI chatbots & document management using Retrieval-Augmented Generation with vector search and FastAPI.☆82Updated this week
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated 2 years ago
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 7 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Page-wise text recognition with lower-supervision line data models☆53Updated this week
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- You Actually Look Twice At it☆42Apr 15, 2026Updated 2 months ago
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- No code solution for training tabular models☆36May 26, 2026Updated 2 weeks ago
- Python tools for performing various operations on ALTO XML files☆50Feb 27, 2025Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆15Jun 27, 2023Updated 2 years ago
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆16Jan 25, 2022Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- ☆29Jun 18, 2025Updated 11 months ago
- A dagger sdk written in rust for rust☆30Jun 29, 2023Updated 2 years ago
- Package that compiles the microsoft dxgkrnl driver from WSL Kernel for using partitioned GPUs from hyperV☆19Jun 29, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Some bits of javascript to transcribe scanned pages using PageXML☆17May 27, 2026Updated 2 weeks ago
- lcx.exe cross-platform version☆12Mar 5, 2016Updated 10 years ago
- version 4.x of the Princeton Geniza Project☆12Jun 5, 2026Updated last week
- Kubernetes operator that syncs cert-manager Secrets to Azure Key Vault.☆17May 22, 2026Updated 3 weeks ago
- Stock Management System *version 1.0 Stacks used: Python / Django / Html / CSS / jQuery / JavaScript / Bootstrap / MySQL Comprehen…☆15Oct 1, 2023Updated 2 years ago
- ☆15Jul 11, 2022Updated 3 years ago
- A simple todo app, focused on a pleasant, lightweight user experience. Built with Typescript, Next,js, Prisma and Stitches.js ⚡☆13Nov 12, 2021Updated 4 years ago
- Miqra According to the Masorah in two JSON formats☆12Jun 4, 2026Updated last week
- Lightweight library for accessing data and configuration☆13Apr 16, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Curated list of useful LLM / Analytics / Datascience resources☆14Jun 7, 2023Updated 3 years ago
- Digital texts in Prakrit☆10Sep 14, 2025Updated 9 months ago
- Standard email templates based on hundreds of real-world emails, including newsletters, on-boarding emails, announcements, events, produc…☆14May 23, 2026Updated 3 weeks ago
- Tools for normalizing the use of some characters and checking file consistencies☆12May 30, 2026Updated 2 weeks ago
- AI agent rules: markdown files for Claude.md, ChatGPT, Copilot, Cursor, Windsurf, and more.☆25Updated this week
- A tool for improving the output of generic Arabic OCR systems using an n-gram based post-correction approach.☆10Sep 22, 2021Updated 4 years ago
- Notes and information for building the WSL-Kernel module and setting up GPU-PV in Linux guests.☆16Mar 22, 2026Updated 2 months ago