bhimrazy / receipt-ocr
Efficient OCR engine for receipt image processing using Python, FastAPI, and Tesseract
☆81Updated 5 months ago
Alternatives and similar repositories for receipt-ocr
Users that are interested in receipt-ocr are comparing it to the libraries listed below
Sorting:
- Perform optical character recognition on receipts☆74Updated last year
- OnnxTR a docTR (Document Text Recognition) library Onnx pipeline wrapper - for seamless, high-performing & accessible OCR☆109Updated this week
- receipt parsing using donut model, next we will add using LLM + OCR or VLM☆13Updated 10 months ago
- ☆13Updated 7 months ago
- convert RBC PDF statements to CSV☆41Updated last month
- Data extraction with Donut ML model☆57Updated 9 months ago
- ☆115Updated last week
- Simple package to extract text with coordinates from programmatic PDFs☆122Updated last month
- Python-tesseract is an optical character recognition (OCR) tool for python☆140Updated 6 years ago
- Web scraper with a simple REST API living in Docker and using a Headless browser and Readability.js for parsing.☆252Updated last month
- Running Docling as an API service☆376Updated this week
- Receipt-Information-Extraction☆16Updated 3 weeks ago
- A python library to define and validate data types in Docling.☆134Updated this week
- Automatic information extraction from identity card with ocr☆107Updated last year
- ☆36Updated 2 years ago
- Extract structured data from PDF invoices☆1,965Updated this week
- Fast Real-time Object Detection with High-Res Output https://x.com/_akhaliq/status/1840213012818329826 https://x.com/githubprojects/statu…☆555Updated last month
- ☆100Updated last month
- Repository mentioned in https://youtu.be/KE4xEzFGSU8☆38Updated last year
- Materials for the Ultimate Hybrid Search Workshop☆36Updated 5 months ago
- YOLOv11 trained on DocLayNet dataset.☆40Updated 6 months ago
- VLM driven tool that processes surveillance videos, extracts frames, and generates insightful annotations using a fine-tuned Florence-2 V…☆110Updated 8 months ago
- A simple, easy-to-customize pipeline for local RAG evaluation. Starter prompts and metric definitions included.☆10Updated 2 weeks ago
- Receipt OCR using CURL, JavaScript/Node.Js, Java, C# VB.NET, PHP, Python, etc☆89Updated last year
- faster-whisper as serverless endpoint☆98Updated last week
- Docscan is a document scanner. Take a photo of your documents and frame it.☆101Updated 6 months ago
- Pybind11 bindings for Whisper.cpp☆57Updated 2 weeks ago
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆106Updated 2 months ago
- Convenience scripts to finetune (chat-)LLaMa3 and other models for any language☆305Updated 11 months ago
- Phi4 Multimodal Instruct - OpenAI endpoint and Docker Image for self-hosting☆36Updated 2 months ago