NanoNets / ocr-pythonLinks
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
☆110Updated 2 years ago
Alternatives and similar repositories for ocr-python
Users that are interested in ocr-python are comparing it to the libraries listed below
Sorting:
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆98Updated 2 weeks ago
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆47Updated 6 months ago
- Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini,…☆55Updated last year
- Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text,…☆86Updated 7 months ago
- Chainlit app for advanced RAG. Uses llamaparse, langchain, qdrant and models from groq.☆46Updated last year
- ☆122Updated 4 months ago
- ☆109Updated last year
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆45Updated last year
- Chat effortlessly, execute commands, and interpret code with Llama3, Phi3, and more - your local AI assistant. Enjoy seamless interaction…☆81Updated last year
- Awesome LLM application repo☆85Updated 4 months ago
- Streamly - Streamlit Assistant is designed to provide the latest updates from Streamlit, generate code snippets for Streamlit widgets, an…☆98Updated 11 months ago
- Chat with PDF files with source highlights☆142Updated 7 months ago
- Corrective RAG demo powerd by Ollama☆103Updated last year
- Simple Chainlit UI for running llms locally using Ollama and LangChain☆119Updated 11 months ago
- Streamlit demo of Scrapegraph-ai for GPT4-hackaton☆102Updated this week
- Intuitive RAG system on top of LllamaIndex☆13Updated 8 months ago
- A clean Gradio theme with dark and light variants.☆35Updated last year
- ☆38Updated last year
- A Python package for zero-shot text anonymization using Transformer-based NER models.☆31Updated this week
- Data extraction with LLM on CPU☆268Updated last year
- Using GPT-4 Vision and GPT-4 Turbo, take a PDF as input and get a markdown file as output.☆95Updated 5 months ago
- ☆23Updated last year
- ☆66Updated 7 months ago
- A set of re-usable AI agent for document processing☆90Updated 6 months ago
- SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file a…☆67Updated last year
- React app that highlights relevant segments in a PDF document based on user questions using natural language processing and AI context se…☆10Updated 2 years ago
- Making docling agentic through MCP☆125Updated last week
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆73Updated 9 months ago
- Groq Compound Beta MCP Server☆27Updated last month
- react + next.js dashboard for R2R: The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a REST…☆162Updated 2 months ago