junhoyeo / BetterOCRLinks
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β585Updated 3 months ago
Alternatives and similar repositories for BetterOCR
Users that are interested in BetterOCR are comparing it to the libraries listed below
Sorting:
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ493Updated last year
- Provides OCR (Optical Character Recognition) services through web applicationsβ699Updated last year
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,752Updated 6 months ago
- Extract structured text from pdfs quicklyβ597Updated 3 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β872Updated 11 months ago
- Lightweight, performant, deep table extractionβ506Updated last month
- OCR Benchmarkβ562Updated 3 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β860Updated last year
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,267Updated 5 months ago
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β244Updated last year
- Improved file parsing for LLMβsβ3,089Updated 10 months ago
- Generate question/answer training pairs out of raw text.β230Updated last year
- UniTable: Towards a Unified Table Foundation Modelβ506Updated last year
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β898Updated last year
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β669Updated 4 months ago
- Talk to any ArXiv paper using ChatGPTβ530Updated last year
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Frameworkβ341Updated 9 months ago
- An API to transcribe audio with OpenAI's Whisper Large v3!β302Updated 10 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β866Updated 6 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ785Updated last year
- TF-ID: Table/Figure IDentifier for academic papersβ240Updated last year
- Open-source platform for extracting structured data from documents using AI.β1,415Updated 4 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,629Updated last year
- A Repo For Document AIβ2,957Updated this week
- A python wrapper to extract text from images on a mac system. Uses the vision framework from Apple.β428Updated 6 months ago
- High-performance retrieval engine for unstructured dataβ1,498Updated last month
- clean & curate your data with LLMs.β490Updated last year
- turnkey self-hosted offline transcription and diarization service with llm summaryβ889Updated 11 months ago
- Finetune llama2-70b and codellama on MacBook Air without quantizationβ448Updated last year
- Multimodal RAG to search and interact locally with technical documents of any kindβ252Updated last month