junhoyeo / BetterOCRLinks
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β590Updated 4 months ago
Alternatives and similar repositories for BetterOCR
Users that are interested in BetterOCR are comparing it to the libraries listed below
Sorting:
- UniTable: Towards a Unified Table Foundation Modelβ506Updated last year
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ492Updated last year
- Improved file parsing for LLMβsβ3,106Updated 10 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β872Updated last year
- Provides OCR (Optical Character Recognition) services through web applicationsβ698Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,756Updated 7 months ago
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β245Updated last year
- Lightweight, performant, deep table extractionβ513Updated 2 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,269Updated 6 months ago
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Frameworkβ341Updated 10 months ago
- A Repo For Document AIβ2,967Updated 3 weeks ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β867Updated 7 months ago
- Extract structured text from pdfs quicklyβ607Updated 4 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β898Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β861Updated last year
- Safe, Open, High-Performance β PDF for AIβ687Updated this week
- Finetune llama2-70b and codellama on MacBook Air without quantizationβ449Updated last year
- Remove background directly in your browser, powered by WebGPUβ470Updated last year
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.β280Updated last month
- Generate question/answer training pairs out of raw text.β230Updated last year
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...β283Updated last year
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ785Updated last year
- High-performance retrieval engine for unstructured dataβ1,503Updated 2 months ago
- Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMsβ795Updated 8 months ago
- Build Secure and Compliant AI agents and MCP Servers. YC W23β152Updated 4 months ago
- OCR Benchmarkβ572Updated 4 months ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β671Updated 4 months ago
- Detect and extract tables to markdown and csvβ752Updated 8 months ago
- DataDM is your private data assistant. Slide into your data's DMsβ386Updated last year
- Object Detection Model for Scanned Documentsβ94Updated 7 months ago