Use any vision LLMs to perform OCR using LangChain
☆18Jul 29, 2025Updated 7 months ago
Alternatives and similar repositories for langchain-ocr
Users that are interested in langchain-ocr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Jul 17, 2025Updated 8 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- Small collection of PAGE XML related scripts used at the ZPD Würzburg☆12Aug 2, 2024Updated last year
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆14Dec 6, 2025Updated 3 months ago
- tesseractXplore a tesseract ease of use gui with full control☆28Nov 10, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Template for AI chatbots & document management using Retrieval-Augmented Generation with vector search and FastAPI.☆66Mar 18, 2026Updated last week
- An extensible viewer for OCR-D mets.xml files☆23May 30, 2024Updated last year
- Transkriptionen von Fibeln (19. Jahrhundert)☆11Oct 31, 2025Updated 4 months ago
- Check your modified Ground Truth files with visual support!☆10Jan 31, 2024Updated 2 years ago
- Crop And Splice Segments (of scanned pages)☆14Mar 11, 2019Updated 7 years ago
- Page-wise text recognition with lower-supervision line data models☆52Mar 11, 2026Updated 2 weeks ago
- You Actually Look Twice At it☆39Jan 21, 2025Updated last year
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- NewsEye / READ OCR training dataset from Austrian Newspapers (1864–1911)☆18Oct 31, 2025Updated 4 months ago
- No code solution for training tabular models☆35Jan 25, 2026Updated 2 months ago
- Python tools for performing various operations on ALTO XML files☆49Feb 27, 2025Updated last year
- Package that compiles the microsoft dxgkrnl driver from WSL Kernel for using partitioned GPUs from hyperV☆18Jun 29, 2024Updated last year
- Reichsanzeiger-NLP: NER/NEL corpus for the German historical newspaper "Deutscher Reichsanzeiger und Preußischer Staatsanzeiger" (1819–19…☆16Oct 18, 2024Updated last year
- Images of example pages from Transkribus model training sets to make it easier to find a match.☆15Jan 25, 2022Updated 4 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- ☆29Jun 18, 2025Updated 9 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A dagger sdk written in rust for rust☆30Jun 29, 2023Updated 2 years ago
- ☆14Jul 11, 2022Updated 3 years ago
- lcx.exe cross-platform version☆12Mar 5, 2016Updated 10 years ago
- version 4.x of the Princeton Geniza Project☆12Feb 24, 2026Updated last month
- Some bits of javascript to transcribe scanned pages using PageXML☆17Mar 18, 2024Updated 2 years ago
- Stock Management System *version 1.0 Stacks used: Python / Django / Html / CSS / jQuery / JavaScript / Bootstrap / MySQL Comprehen…☆14Oct 1, 2023Updated 2 years ago
- Kubernetes operator that syncs cert-manager Secrets to Azure Key Vault.☆17May 31, 2025Updated 9 months ago
- Notes and information for building the WSL-Kernel module and setting up GPU-PV in Linux guests.☆15Updated this week
- Miqra According to the Masorah in two JSON formats☆12Updated this week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Digital texts in Prakrit☆10Sep 14, 2025Updated 6 months ago
- A simple todo app, focused on a pleasant, lightweight user experience. Built with Typescript, Next,js, Prisma and Stitches.js ⚡☆13Nov 12, 2021Updated 4 years ago
- Lightweight library for accessing data and configuration☆13Apr 16, 2025Updated 11 months ago
- Curated list of useful LLM / Analytics / Datascience resources☆14Jun 7, 2023Updated 2 years ago
- Tools for normalizing the use of some characters and checking file consistencies☆11Jan 12, 2026Updated 2 months ago
- Standard email templates based on hundreds of real-world emails, including newsletters, on-boarding emails, announcements, events, produc…☆14Sep 15, 2025Updated 6 months ago
- ☆48Dec 20, 2025Updated 3 months ago