junhoyeo / BetterOCRLinks
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β550Updated 4 months ago
Alternatives and similar repositories for BetterOCR
Users that are interested in BetterOCR are comparing it to the libraries listed below
Sorting:
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β854Updated 8 months ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ488Updated last year
- Provides OCR (Optical Character Recognition) services through web applicationsβ678Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,652Updated 3 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,606Updated 10 months ago
- UniTable: Towards a Unified Table Foundation Modelβ473Updated 11 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β857Updated last year
- Improved file parsing for LLMβsβ2,977Updated 6 months ago
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Frameworkβ339Updated 6 months ago
- Lightweight, performant, deep table extractionβ463Updated last week
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β891Updated last year
- (Cross-Platform) An open source approach to locally record and enable searching everything you view on any computer.β277Updated last year
- Detect and extract tables to markdown and csvβ746Updated 4 months ago
- Build Secure and Compliant AI agents and MCP Servers. YC W23β131Updated this week
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.β228Updated last year
- Create API agents from OpenAPI Specsβ182Updated last year
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ767Updated 9 months ago
- Visualise your CSV files in seconds without sending your data anywhereβ509Updated last week
- Generate question/answer training pairs out of raw text.β226Updated last year
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β245Updated last year
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,239Updated 2 months ago
- A Repo For Document AIβ2,838Updated this week
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient β¦β221Updated 5 months ago
- Examples for Cerebrium Serverless GPUsβ486Updated last week
- A python wrapper to extract text from images on a mac system. Uses the vision framework from Apple.β395Updated 2 months ago
- Easy-to-Use Apple Vision wrapper for text extraction, scalar representation and clustering using K-means.β104Updated last year
- β443Updated 8 months ago
- TF-ID: Table/Figure IDentifier for academic papersβ235Updated 10 months ago
- Extract structured text from pdfs quicklyβ481Updated this week
- Fully neural approach for text chunkingβ350Updated last month