junhoyeo / BetterOCR
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β543Updated 3 months ago
Alternatives and similar repositories for BetterOCR:
Users that are interested in BetterOCR are comparing it to the libraries listed below
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,626Updated 2 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,228Updated last month
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β852Updated 7 months ago
- UniTable: Towards a Unified Table Foundation Modelβ465Updated 11 months ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ489Updated last year
- Omni SenseVoice: High-Speed Speech Recognition with words timestamps π£οΈπ―β842Updated 2 months ago
- Provides OCR (Optical Character Recognition) services through web applicationsβ679Updated last year
- clean & curate your data with LLMs.β488Updated 10 months ago
- Official implement of paper "AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation" [EMNLP 24']β463Updated 4 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β854Updated last year
- Examples and guides for using the VLM Run APIβ275Updated this week
- Improved file parsing for LLMβsβ2,943Updated 5 months ago
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,598Updated 9 months ago
- Lightweight, performant, deep table extractionβ457Updated last week
- Document Layout Analysisβ372Updated this week
- (Cross-Platform) An open source approach to locally record and enable searching everything you view on any computer.β276Updated last year
- RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for raβ¦β675Updated 3 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β887Updated last year
- High-performance retrieval engine for unstructured dataβ1,373Updated this week
- A Repo For Document AIβ2,810Updated 3 weeks ago
- Finetune llama2-70b and codellama on MacBook Air without quantizationβ448Updated last year
- Create API agents from OpenAPI Specsβ181Updated last year
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β621Updated last week
- Extract structured text from pdfs quicklyβ471Updated 2 months ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysisβ338Updated 2 years ago
- WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.β248Updated 2 months ago
- A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAGβ381Updated 8 months ago
- [CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasksβ429Updated 3 months ago
- turnkey self-hosted offline transcription and diarization service with llm summaryβ844Updated 7 months ago
- Open-source platform for extracting structured data from documents using AI.β1,303Updated last week