bhimrazy / receipt-ocr
Efficient OCR engine for receipt image processing using Python, FastAPI, and Tesseract
☆51Updated last month
Alternatives and similar repositories for receipt-ocr:
Users that are interested in receipt-ocr are comparing it to the libraries listed below
- Receipt OCR using CURL, JavaScript/Node.Js, Java, C# VB.NET, PHP, Python, etc☆86Updated last year
- Advanced receipt OCR and analysis using PaddleOCR, GPT-3.5-turbo, Plotly, and Gradio for interactive visualizations.☆15Updated 8 months ago
- A fast, light, open chat UI with full tool use support across many models☆202Updated 3 months ago
- Perform optical character recognition on receipts☆69Updated last year
- Create 🐍 Python AI Actions and 🤖 Automations, and deploy & operate them anywhere☆494Updated last week
- A free tool to OCR a PDF and add a text "layer" in the original file, making a searchable PDF. Use only open source tools. Please tip!☆279Updated 11 months ago
- img2table is a table identification and extraction Python Library for PDF and images, based on OpenCV image processing☆631Updated 2 months ago
- Simplifies the retrieval, extraction, and training of structured data from various unstructured sources.☆119Updated this week
- Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.☆134Updated this week
- Lightweight library for scraping web-sites with LLMs☆946Updated last week
- Draft42 - Streamlit chatbot with function calling☆31Updated 9 months ago
- Spider ported to Python☆63Updated 3 months ago
- A modern Python REST client for Apache Tika server☆13Updated this week
- A whatsapp client library for python using the new WhatsApp cloud API.☆27Updated last month
- A collection of resources for all your Anvil needs. Including awesome projects from the community, demos, blogs and tutorials.☆39Updated last year
- Web application that converts audio and video to text using AI, supporting various formats and self-hosting.☆65Updated last week
- hotpdf is a fast PDF parsing library to extract text and find text within PDF documents built on top of pdfminer.six☆182Updated last month
- LLM Based NLP Library.☆83Updated 5 months ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆253Updated last year
- Unattended Lightweight Text Classifiers with LLM Embeddings☆182Updated 4 months ago
- Detect and extract tables to markdown and csv☆715Updated last week
- Python example of how to engage with the https://podcastindex.org/ APIs☆9Updated 4 years ago
- 🏭 PDF text extraction pipeline: self-hosted, local-first, Docker-based☆306Updated last year
- A simple feedforward neural network coded from scratch.☆11Updated 5 months ago
- Python library for the instruction and reliable validation of structured outputs (JSON) of Large Language Models (LLMs) with Ollama and P…☆71Updated last month
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆99Updated 4 months ago
- Python module for communicating with the Veryfi OCR API.☆26Updated 4 months ago
- Turn natual language into commands. Your CLI tasks, now as easy as a conversation. Run it 100% offline, or use OpenAI's models.☆55Updated 6 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆69Updated this week
- This code performs PDF layout analysis and optical character recognition (OCR) using the layoutparser library and Tesseract OCR Engine. I…☆13Updated last year