NanoNets / ocr-pythonLinks
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
☆124Updated 3 years ago
Alternatives and similar repositories for ocr-python
Users that are interested in ocr-python are comparing it to the libraries listed below
Sorting:
- ☆76Updated last year
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆165Updated 5 months ago
- Redact PDF/image-based documents, Word, or CSV/XLSX files using a graphical user interface. Demo: https://huggingface.co/spaces/seanpedri…☆36Updated last week
- SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file a…☆77Updated last year
- Multimodal document parser for high quality data understanding and extraction☆87Updated last week
- ☆125Updated 11 months ago
- A clean Gradio theme with dark and light variants.☆39Updated last year
- ☆106Updated last year
- Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini,…☆55Updated last year
- Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text,…☆107Updated last year
- Using GPT-4 Vision and GPT-4 Turbo, take a PDF as input and get a markdown file as output.☆99Updated last year
- A tool for querying and interacting with PDF documents using AI. This application uses natural language processing to provide contextuall…☆128Updated 10 months ago
- A Python package for converting PDFs to markdown while extracting images and tables, generate descriptive text descriptions for extracted…☆183Updated 6 months ago
- like firecrawl.dev but free☆51Updated 10 months ago
- ☆79Updated last year
- Chat with PDF files with source highlights☆149Updated last year
- react + next.js dashboard for R2R: The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a REST…☆171Updated 8 months ago
- Intuitive RAG system on top of LllamaIndex☆15Updated last year
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆51Updated last year
- Local-GenAI-Search is a generative search engine based on Llama 3, langchain and qdrant that answers questions based on your local files☆96Updated last year
- A MCP server connecting to managed indexes on LlamaCloud☆86Updated 7 months ago
- Turn topics, links, and files into AI-generated research notebooks — summarize, explore, and ask anything.☆146Updated 7 months ago
- ☆23Updated last year
- Open Source Note GPT. Turn your photos and images into text notes (in obsidian)☆98Updated 11 months ago
- ☆114Updated last year
- Chainlit app for advanced RAG. Uses llamaparse, langchain, qdrant and models from groq.☆47Updated last year
- RAGGENIE: An open-source, low-code platform to build custom Retrieval-Augmented Generation (RAG) Copilets with your own data. Simplify AI…☆180Updated 6 months ago
- A fun project where I use the power of AI to analyze a PDF. The AI extracts key information based on the user's instructions and selectio…☆85Updated last year
- Data extraction with LLM on CPU☆270Updated last year
- Streamly - Streamlit Assistant is designed to provide the latest updates from Streamlit, generate code snippets for Streamlit widgets, an…☆111Updated last year