NanoNets / ocr-pythonLinks
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
☆113Updated 2 years ago
Alternatives and similar repositories for ocr-python
Users that are interested in ocr-python are comparing it to the libraries listed below
Sorting:
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆133Updated this week
- ☆66Updated 8 months ago
- ☆122Updated 6 months ago
- Open Source Note GPT. Turn your photos and images into text notes (in obsidian)☆94Updated 6 months ago
- SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file a…☆69Updated last year
- Data extraction with LLM on CPU☆269Updated last year
- Create-tsi is a generative AI RAG toolkit which generates AI Applications with low code.☆234Updated 9 months ago
- ☆23Updated last year
- PDF intelligence platform combining IBM Docling for document processing, LlamaIndex for data structuring, and Streamlit for a powerful UI…☆47Updated 8 months ago
- Demo of the neural semantic search built with Qdrant☆171Updated 4 months ago
- Groq goes brrrrr... so had to make a basic Streamlit app you can build upon!☆83Updated 7 months ago
- Multimodal document parser for high quality data understanding and extraction☆78Updated last week
- Chainlit app for advanced RAG. Uses llamaparse, langchain, qdrant and models from groq.☆47Updated last year
- A free open source RAG based AI legal assistant.☆171Updated 3 weeks ago
- Chat with PDF files with source highlights☆145Updated 8 months ago
- Awesome LLM application repo☆86Updated 5 months ago
- Code example of how to call your OpenAI assistant via API (Python).☆24Updated last year
- Conversion of PDF documents to structured Markdown, optimized for Retrieval Augmented Generation (RAG) and other NLP tasks. Extract text,…☆92Updated 9 months ago
- Using GPT-4 Vision and GPT-4 Turbo, take a PDF as input and get a markdown file as output.☆96Updated 7 months ago
- ☆38Updated last year
- A set of re-usable AI agent for document processing☆93Updated 7 months ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- VT.ai - multimodal AI chat app with dynamic conversation routing☆86Updated 3 months ago
- ☆64Updated last year
- Open-source RAG evaluation through users' feedback☆201Updated last year
- A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure Cognitive Search for retrieval and Azure OpenAI…☆33Updated last year
- react + next.js dashboard for R2R: The most advanced AI retrieval system. Containerized, Retrieval-Augmented Generation (RAG) with a REST…☆165Updated 4 months ago
- Streamly - Streamlit Assistant is designed to provide the latest updates from Streamlit, generate code snippets for Streamlit widgets, an…☆102Updated last year
- Build robust, production grade function calling assistants that work. Declarative and extensible. Built on top of LangChain ⚡️☆77Updated last year
- Chat with your Documents(PDF, TXT, DOCX, ODT, PPTX etc), Websites and Youtube Chat too!, CSV files. Uses langchain, Ollama, Groq, Gemini,…☆55Updated last year