junhoyeo / BetterOCRLinks
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β564Updated 2 weeks ago
Alternatives and similar repositories for BetterOCR
Users that are interested in BetterOCR are comparing it to the libraries listed below
Sorting:
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,678Updated 4 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β852Updated 3 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β858Updated 9 months ago
- Provides OCR (Optical Character Recognition) services through web applicationsβ679Updated last year
- Extract structured text from pdfs quicklyβ500Updated 2 weeks ago
- UniTable: Towards a Unified Table Foundation Modelβ482Updated last year
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β653Updated last month
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,613Updated 10 months ago
- Object Detection Model for Scanned Documentsβ93Updated 3 months ago
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCRβ127Updated this week
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,250Updated 3 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysisβ351Updated 2 years ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ490Updated last year
- Powerful handwritten text recognition. A simple-to-use, unofficial implementation of the paper "TrOCR: Transformer-based Optical Characteβ¦β206Updated 5 months ago
- Document Layout Analysisβ376Updated 2 weeks ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β857Updated last year
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.β252Updated 2 weeks ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β893Updated last year
- Lightweight, performant, deep table extractionβ478Updated this week
- A series of top performing Text to SQL LLMsβ866Updated last year
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β246Updated last year
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluationβ526Updated last month
- turnkey self-hosted offline transcription and diarization service with llm summaryβ862Updated 9 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!β292Updated 7 months ago
- Turn images of tables into CSV data. Detect tables from images and run OCR on the cells.β521Updated 4 years ago
- A PyTorch implementation of DTrOCR: Decoder-only Transformer for Optical Character Recognitionβ173Updated 3 weeks ago
- β187Updated 2 weeks ago
- Remove background directly in your browser, powered by WebGPUβ467Updated 10 months ago
- A python wrapper to extract text from images on a mac system. Uses the vision framework from Apple.β400Updated 3 months ago
- Visualise your CSV files in seconds without sending your data anywhereβ510Updated last week