junhoyeo/BetterOCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/junhoyeo/BetterOCR)

junhoyeo / BetterOCR

🔍 Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with 🧠 LLM.

☆629

Alternatives and similar repositories for BetterOCR

Users that are interested in BetterOCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dicklesworthstone / llm_aided_ocr
View on GitHub
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs
☆2,910Mar 22, 2026Updated 3 weeks ago
lxe / llavavision
View on GitHub
A simple "Be My Eyes" web app with a llama.cpp/llava backend
☆494Nov 28, 2023Updated 2 years ago
mindee / doctr
View on GitHub
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
☆6,023Apr 12, 2026Updated last week
AdrianKrebs / datalens
View on GitHub
An experiment to automate job search with LLMs
☆91Sep 1, 2023Updated 2 years ago
datalab-to / surya
View on GitHub
OCR, layout analysis, reading order, table recognition in 90+ languages
☆19,588Apr 10, 2026Updated last week
Serverless GPU API endpoints on Runpod - Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
michaelcpuckett / express-worker
View on GitHub
Express.js ported to a Service Worker context
☆18Mar 6, 2025Updated last year
JaidedAI / EasyOCR
View on GitHub
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and …
☆29,290Dec 5, 2025Updated 4 months ago
clovaai / donut
View on GitHub
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
☆6,833Jul 11, 2024Updated last year
neuml / txtai
View on GitHub
💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows
☆12,395Apr 8, 2026Updated last week
struct-chat / embedding
View on GitHub
Vector Embedding Server in under 100 lines of code
☆22Mar 1, 2024Updated 2 years ago
automorphic-ai / trex
View on GitHub
Enforce structured output from LLMs 100% of the time
☆251Jul 20, 2024Updated last year
pchunduri6 / rag-demystified
View on GitHub
An LLM-powered advanced RAG pipeline built from scratch
☆857Jan 26, 2024Updated 2 years ago
vladignatyev / bulktag
View on GitHub
Bulk image tagging using OpenAI GPT-4 Vision
☆70Jul 22, 2024Updated last year
seanoliver / audioflare
View on GitHub
An all-in-one AI audio playground using Cloudflare AI Workers to transcribe, analyze, summarize, and translate any audio file.
☆479Aug 21, 2025Updated 7 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lifeiteng / OmniSenseVoice
View on GitHub
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
☆890Dec 10, 2025Updated 4 months ago
yaohaizhou / awesome-agi
View on GitHub
🤖 A list of latest AGI-related repos, resources and courses including LLMs and AI Agents.
☆13Sep 24, 2024Updated last year
airbytehq / dagster-langchain
View on GitHub
POC integration Airbyte+Dagster+Langchain
☆13Jun 1, 2023Updated 2 years ago
deepdoctection / deepdoctection
View on GitHub
A Repo For Document AI
☆3,161Apr 9, 2026Updated last week
Ucas-HaoranWei / Vary
View on GitHub
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
☆1,893Dec 30, 2024Updated last year
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,590Dec 14, 2025Updated 4 months ago
reductoai / remembrall
View on GitHub
☆169Apr 10, 2026Updated last week
NeumTry / NeumAI
View on GitHub
Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.
☆867Jan 15, 2024Updated 2 years ago
Fedia / bbb
View on GitHub
Browser Bot Bookmarklet
☆33Jan 27, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
SugarAI-HQ / CopilotOne
View on GitHub
Add Siri like Native AI Agents in you App.
☆55Jan 18, 2025Updated last year
Nathan-Handy / HandyDash
View on GitHub
HandyDash is a cross-platform HTTP, TCP, and IP monitoring tool, intended for desktop use. It is agent free, requires no installation, an…
☆16Aug 17, 2024Updated last year
adbar / trafilatura
View on GitHub
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…
☆5,703Sep 12, 2025Updated 7 months ago
Filimoa / open-parse
View on GitHub
Improved file parsing for LLM’s
☆3,155Nov 13, 2024Updated last year
datalab-to / marker
View on GitHub
Convert PDF to markdown + JSON quickly with high accuracy
☆34,060Updated this week
martinlevesque / mini-spend-tracker
View on GitHub
☆14May 9, 2023Updated 2 years ago
jasonjmcghee / plock
View on GitHub
From anywhere you can type, query and stream the output of any script (e.g. an LLM)
☆504Apr 12, 2024Updated 2 years ago
adarsh1021 / notionex
View on GitHub
Notionex is a simple Elixir client for the Notion API that also offers rendering of Notion pages into various formats (like HTML)
☆10Jun 15, 2025Updated 10 months ago
yeungchenwa / OCR-SAM
View on GitHub
[Open-Source Project] Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instance…
☆587Jan 30, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
amitlevy / evolutionaryGPT
View on GitHub
Evolutionary Search for expert-level performance on any task with environmental feedback
☆14Oct 12, 2025Updated 6 months ago
prakhar897 / hn-comments-drawer
View on GitHub
A js library to incorporate HN comments to any website
☆34May 3, 2024Updated last year
a-chris / faenz
View on GitHub
Faenz is the web analytics for smalls businesses and side projects.
☆20Sep 1, 2024Updated last year
jstrieb / paperify
View on GitHub
Transform any document, web page, or eBook into a research paper (ChatGPT not required)
☆373Sep 6, 2023Updated 2 years ago
orcaman / improving_whisper_transcriptions_with_gpt4o
View on GitHub
☆12Oct 5, 2024Updated last year
facebookresearch / nougat
View on GitHub
Implementation of Nougat Neural Optical Understanding for Academic Documents
☆9,915Feb 21, 2025Updated last year
Ucas-HaoranWei / GOT-OCR2.0
View on GitHub
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
☆8,109Feb 10, 2025Updated last year