NanoNets / ocr-pythonLinks

OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.

☆110

Alternatives and similar repositories for ocr-python

Users that are interested in ocr-python are comparing it to the libraries listed below

Sorting:

genieincodebottle / parsemypdf
Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…
☆98Updated 2 weeks ago
lesteroliver911 / docling-pdf-processor
PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…
☆47Updated 6 months ago
shitan198u / AnyChat
Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini,…
☆55Updated last year
iamarunbrahma / pdf-to-markdown
Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text,…
☆86Updated 7 months ago
sudarshan-koirala / RAG-chat-with-documents
Chainlit app for advanced RAG. Uses llamaparse, langchain, qdrant and models from groq.
☆46Updated last year
run-llama / llama_extract
☆122Updated 4 months ago
tylerprogramming / 31-day-challenge-ai
☆109Updated last year
sudarshan-koirala / langchain-gemma-ollama-chainlit
Simple Chainlit UI for running llms locally using Ollama and LangChain
☆45Updated last year
neokd / NeoGPT
Chat effortlessly, execute commands, and interpret code with Llama3, Phi3, and more - your local AI assistant. Enjoy seamless interaction…
☆81Updated last year
XinyueZ / chat-your-doc
Awesome LLM application repo
☆85Updated 4 months ago
AdieLaine / Streamly
Streamly - Streamlit Assistant is designed to provide the latest updates from Streamlit, generate code snippets for Streamlit widgets, an…
☆98Updated 11 months ago
denser-org / denser-chat
Chat with PDF files with source highlights
☆142Updated 7 months ago
Nagi-ovo / CRAG-Ollama-Chat
Corrective RAG demo powerd by Ollama
☆103Updated last year
sudarshan-koirala / rag-chat-with-pdf
Simple Chainlit UI for running llms locally using Ollama and LangChain
☆119Updated 11 months ago
ScrapeGraphAI / Scrapegraph-demo
Streamlit demo of Scrapegraph-ai for GPT4-hackaton
☆102Updated this week
Zakk-Yang / nexusync
Intuitive RAG system on top of LllamaIndex
☆13Updated 8 months ago
lone17 / kotaemon-gradio-theme
A clean Gradio theme with dark and light variants.
☆35Updated last year
definitive-io / duckdb-rag
☆38Updated last year
deepanwadhwa / zink
A Python package for zero-shot text anonymization using Transformer-based NER models.
☆31Updated this week
katanaml / llm-mistral-invoice-cpu
Data extraction with LLM on CPU
☆268Updated last year
mattlgroff / pdf-to-markdown
Using GPT-4 Vision and GPT-4 Turbo, take a PDF as input and get a markdown file as output.
☆95Updated 5 months ago
InsightEdge01 / ScrapegraphAIOllamallama3
☆23Updated last year
imanoop7 / Generative-Search-Engine-For-Local-Files
☆66Updated 7 months ago
CVxTz / document_ai_agents
A set of re-usable AI agent for document processing
☆90Updated 6 months ago
Bklieger / Semantic
SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file a…
☆67Updated last year
admineral / PDF-Pilot
React app that highlights relevant segments in a PDF document based on user questions using natural language processing and AI context se…
☆10Updated 2 years ago
docling-project / docling-mcp
Making docling agentic through MCP
☆125Updated last week
lesteroliver911 / ai-pdf-ppt-generator-openai
A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…
☆73Updated 9 months ago
groq / compound-mcp-server
Groq Compound Beta MCP Server
☆27Updated last month
SciPhi-AI / R2R-Application
react + next.js dashboard for R2R: The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a REST…
☆162Updated 2 months ago