junhoyeo / BetterOCRLinks
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β611Updated 8 months ago
Alternatives and similar repositories for BetterOCR
Users that are interested in BetterOCR are comparing it to the libraries listed below
Sorting:
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ493Updated 2 years ago
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,855Updated 3 weeks ago
- Provides OCR (Optical Character Recognition) services through web applicationsβ704Updated 2 years ago
- Lightweight, performant, deep table extractionβ526Updated last month
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β867Updated 2 years ago
- High-accuracy PDF-to-Markdown OCR API using LLMs with vision capabilities. Features parallel processing, batching, and auto-retry logic fβ¦β883Updated 2 months ago
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β886Updated 2 months ago
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β681Updated 8 months ago
- Improved file parsing for LLMβsβ3,152Updated last year
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β246Updated 2 years ago
- Open-source platform for extracting structured data from documents using AI.β1,464Updated 8 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ783Updated last year
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,275Updated 10 months ago
- www.biblos.appβ226Updated last year
- UniTable: Towards a Unified Table Foundation Modelβ522Updated last year
- β442Updated last year
- Extract structured text from pdfs quicklyβ661Updated 8 months ago
- TF-ID: Table/Figure IDentifier for academic papersβ245Updated last year
- Generate question/answer training pairs out of raw text.β234Updated 2 years ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β911Updated last month
- OCR Benchmarkβ613Updated 3 months ago
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Frameworkβ341Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,642Updated last year
- A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAGβ425Updated last year
- PDF Parsing for RAG β Convert to Markdown & JSON, Fast, Local, No GPUβ847Updated this week
- Detect and extract tables to markdown and csvβ754Updated last year
- Extension of Langchain for RAG. Easy benchmarking, multiple retrievals, reranker, time-aware RAG, and so on...β284Updated 2 years ago
- clean & curate your data with LLMs.β489Updated last year
- Create API agents from OpenAPI Specsβ184Updated 2 years ago
- An API to transcribe audio with OpenAI's Whisper Large v3!β334Updated last year