junhoyeo / BetterOCRLinks
π Better text detection by combining multiple OCR engines (EasyOCR, Tesseract, and Pororo) with π§ LLM.
β570Updated last month
Alternatives and similar repositories for BetterOCR
Users that are interested in BetterOCR are comparing it to the libraries listed below
Sorting:
- Enhance Tesseract OCR output for scanned PDFs by applying Large Language Model (LLM) corrections.β2,714Updated 5 months ago
- An open-source OCR API that leverages OpenAI's powerful language models with optimized performance techniques like parallel processing anβ¦β866Updated 10 months ago
- A simple "Be My Eyes" web app with a llama.cpp/llava backendβ490Updated last year
- Provides OCR (Optical Character Recognition) services through web applicationsβ682Updated last year
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.β858Updated last year
- UniTable: Towards a Unified Table Foundation Modelβ487Updated last year
- Lightweight, performant, deep table extractionβ494Updated last week
- Extract structured text from pdfs quicklyβ516Updated last month
- Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)β660Updated 2 months ago
- This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.β1,255Updated 4 months ago
- π»ππ‘ DoctorGPT provides advanced LLM prompting for PDFs and webpages.β245Updated last year
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkitβ779Updated 11 months ago
- www.biblos.appβ213Updated 11 months ago
- Talk to any ArXiv paper using ChatGPTβ527Updated last year
- Connect and chat with your multiple documents (pdf and txt) through GPT 3.5, GPT-4 Turbo, Claude and Local Open-Source LLMsβ798Updated 5 months ago
- Improved file parsing for LLMβsβ3,023Updated 8 months ago
- A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAGβ393Updated 11 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR modeβ¦β895Updated last year
- Build Secure and Compliant AI agents and MCP Servers. YC W23β146Updated last month
- clean & curate your data with LLMs.β492Updated last year
- WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.β1,620Updated 11 months ago
- β443Updated 10 months ago
- Generate question/answer training pairs out of raw text.β226Updated last year
- Open-source personal bookmarks search engineβ674Updated this week
- Remove background directly in your browser, powered by WebGPUβ468Updated 11 months ago
- RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for raβ¦β678Updated 6 months ago
- A Repo For Document AIβ2,899Updated last week
- Finetune llama2-70b and codellama on MacBook Air without quantizationβ447Updated last year
- Math OCR model that outputs LaTeX and markdownβ1,067Updated 6 months ago
- Web scraper made for AI and simplicity in mind. It runs as a CLI that can be parallelized and outputs high-quality markdown content.β524Updated 2 weeks ago