junhoyeo / BetterOCRLinks
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β596Updated 5 months ago
Alternatives and similar repositories for BetterOCR
Users that are interested in BetterOCR are comparing it to the libraries listed below
Sorting:
- High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic fβ¦β875Updated this week
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,783Updated 9 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β864Updated last year
- Provides OCR (Optical Character Recognition) services through web applicationsβ704Updated last year
- UniTable: Towards a Unified Table Foundation Modelβ514Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ492Updated 2 years ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β680Updated 6 months ago
- Extract structured text from pdfs quicklyβ629Updated 5 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β876Updated last month
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Frameworkβ341Updated last year
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β245Updated last year
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ785Updated last year
- Lightweight, performant, deep table extractionβ517Updated 3 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,276Updated 8 months ago
- clean & curate your data with LLMs.β490Updated last year
- Talk to any ArXiv paper using ChatGPTβ529Updated last year
- β447Updated last year
- Improved file parsing for LLMβsβ3,137Updated last year
- Multimodal RAG to search and interact locally with technical documents of any kindβ279Updated 3 weeks ago
- A series of top performing Text to SQL LLMsβ866Updated last year
- Detect and extract tables to markdown and csvβ755Updated 10 months ago
- Generate question/answer training pairs out of raw text.β233Updated last year
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.β281Updated last month
- www.biblos.appβ222Updated last year
- Finetune llama2-70b and codellama on MacBook Air without quantizationβ450Updated last year
- Build Secure and Compliant AI agents and MCP Servers. YC W23β153Updated 5 months ago
- Object Detection Model for Scanned Documentsβ93Updated 8 months ago
- Open-source platform for extracting structured data from documents using AI.β1,454Updated 6 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β904Updated 2 years ago
- Open-source personal bookmarks search engineβ690Updated this week