NanoNets / ocr-pythonLinks
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
☆124Updated 3 years ago
Alternatives and similar repositories for ocr-python
Users that are interested in ocr-python are comparing it to the libraries listed below
Sorting:
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆167Updated 5 months ago
- Multimodal document parser for high quality data understanding and extraction☆89Updated last week
- Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface. Demo: https://huggingface.co/spaces/seanpedri…☆37Updated this week
- Data extraction with LLM on CPU☆270Updated last year
- AI enabled analysis app, no coding needed. Analyse data, like 44 million Hacker News posts, ask questions. On your computer, your API key…☆30Updated 6 months ago
- TalkNexus: Ollama Chatbot Multi-Model & RAG Interface☆61Updated 2 weeks ago
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆51Updated last year
- A clean Gradio theme with dark and light variants.☆39Updated last year
- ☆23Updated last year
- ☆58Updated 8 months ago
- Turn any input document into a sophisticated, context-dependent mindmap that distills the meaning and structure of the document.☆196Updated 11 months ago
- SmolDocling OCR App built using SmolDocling 256M Model and Streamlit.☆233Updated 10 months ago
- A simple MCP application that delivers curated positive and uplifting news stories.☆44Updated 6 months ago
- Record voice notes & transcribe, summarize, and get tasks☆46Updated last year
- A tool to OCR PDFs using gen-AI models☆46Updated last month
- SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file a…☆78Updated last year
- ☆76Updated last year
- ☆125Updated 11 months ago
- A RAG system designed to process documents with multimodal content. It can generate factual, context-aware answers to user queries, based…☆26Updated last year
- ☆106Updated last year
- Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text,…☆107Updated last year
- Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.☆235Updated last year
- Simple package to extract text with coordinates from programmatic PDFs☆236Updated last week
- ☆106Updated this week
- ☆66Updated 10 months ago
- A MCP server connecting to managed indexes on LlamaCloud☆86Updated 7 months ago
- Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files☆96Updated last year
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆147Updated 8 months ago
- Vibe-coding tools for the LlamaIndex ecosystem☆176Updated 3 months ago
- Intuitive RAG system on top of LllamaIndex☆15Updated last year