junhoyeo / BetterOCR
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β527Updated 3 weeks ago
Alternatives and similar repositories for BetterOCR:
Users that are interested in BetterOCR are comparing it to the libraries listed below
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,477Updated 6 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β799Updated last month
- UniTable: Towards a Unified Table Foundation Modelβ432Updated 8 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,185Updated 4 months ago
- Improved file parsing for LLMβsβ2,814Updated 3 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β852Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ491Updated last year
- Lightweight, performant, deep table extractionβ410Updated this week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ748Updated 6 months ago
- DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perceptionβ857Updated last month
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servicβ¦β258Updated 2 weeks ago
- (Cross-Platform) An open source approach to locally record and enable searching everything you view on any computer.β265Updated 9 months ago
- Provides OCR (Optical Character Recognition) services through web applicationsβ615Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,580Updated 6 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β828Updated 4 months ago
- Visualise your CSV files in seconds without sending your data anywhereβ493Updated last month
- Create API agents from OpenAPI Specsβ178Updated last year
- β173Updated this week
- DOM to Semantic-Markdown for use with LLMsβ757Updated 2 weeks ago
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β244Updated last year
- High-performance retrieval engine for unstructured dataβ1,169Updated this week
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Frameworkβ338Updated 2 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a ligβ¦β217Updated last month
- A Repo For Document AIβ2,712Updated this week
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...β279Updated last year
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysisβ82Updated last month
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.β517Updated 3 years ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysisβ316Updated 2 years ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.β167Updated 8 months ago
- Detect and extract tables to markdown and csvβ726Updated 3 weeks ago