NanoNets / ocr-python
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
☆80Updated 2 years ago
Alternatives and similar repositories for ocr-python:
Users that are interested in ocr-python are comparing it to the libraries listed below
- Built with Fast Dash, this app uses Embedchain, which abstracts the entire process of loading and chunking datasets, creating embeddings,…☆66Updated last year
- A chatApp with OpenAssistant API☆68Updated last year
- A Chat App built with embedchain and streamlit☆41Updated last year
- DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search …☆115Updated last year
- Data extraction with LLM on CPU☆266Updated last year
- ☆18Updated 2 years ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆27Updated 2 years ago
- Build an AI chatbot with website context retrieved from a vector store like LanceDB.☆84Updated last year
- React app that highlights relevant segments in a PDF document based on user questions using natural language processing and AI context se…☆10Updated last year
- A Streamlit-powered application that provides a user-friendly interface for editing PDF documents.☆57Updated 5 months ago
- Python Streamlit web app utilizing OpenAI (GPT4) and LangChain LLM tools with access to Wikipedia, DuckDuckgo Search, and a ChromaDB with…☆72Updated last year
- 🤖💬💡Chat with any document using conversational AI! Our project allows you to easily ask questions and get answers from any document. B…