junhoyeo / BetterOCRLinks
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β583Updated 2 months ago
Alternatives and similar repositories for BetterOCR
Users that are interested in BetterOCR are comparing it to the libraries listed below
Sorting:
- Provides OCR (Optical Character Recognition) services through web applicationsβ688Updated last year
- Extract structured text from pdfs quicklyβ585Updated 2 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β867Updated 11 months ago
- Improved file parsing for LLMβsβ3,042Updated 9 months ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,742Updated 6 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β860Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ491Updated last year
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,263Updated 5 months ago
- UniTable: Towards a Unified Table Foundation Modelβ502Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantizationβ448Updated last year
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servicβ¦β670Updated this week
- A Repo For Document AIβ2,934Updated this week
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β863Updated 5 months ago
- Lightweight, performant, deep table extractionβ503Updated 3 weeks ago
- clean & curate your data with LLMs.β490Updated last year
- Open-source platform for extracting structured data from documents using AI.β1,406Updated 3 months ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β665Updated 3 months ago
- Multimodal RAG to search and interact locally with technical documents of any kindβ248Updated 3 weeks ago
- TF-ID: Table/Figure IDentifier for academic papersβ238Updated last year
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β898Updated last year
- DataDM is your private data assistant. Slide into your data's DMsβ386Updated 10 months ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.β279Updated last week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit