marieai / marie-ai
Integrate AI-powered Document Analysis Pipelines
☆62Updated this week
Related projects ⓘ
Alternatives and complementary repositories for marie-ai
- An unofficial Implementation of DocParser: End-to-end OCR-free Information Extraction from Visually Rich Documents☆33Updated last year
- ☆21Updated 8 months ago
- Run OCR, extract information from documents and classify them. In addition, annotate documents and build custom NLP and computer vision m…☆62Updated this week
- DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning mod…☆19Updated last year
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆22Updated 4 years ago
- Repository for deepdoctection tutorial notebooks☆39Updated 4 months ago
- Object Detection Model for Scanned Documents☆83Updated last year
- ☆16Updated 3 years ago
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆69Updated last month
- Pipeline for converting PDFs to raw text with PaddleOCR☆21Updated last year
- Document Image Binarization☆73Updated last month
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 3 months ago
- ReadingBank: A Benchmark Dataset for Reading Order Detection☆91Updated 2 months ago
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆107Updated 6 months ago
- Implementation of the DocLLM paper for Llama models.☆12Updated last month
- Full-fledged Data Exploration Tool for Label Studio☆48Updated 7 months ago
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆46Updated 3 years ago
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆127Updated last week
- ☆11Updated 6 months ago
- DFKI Layout Detection for OCR-D☆47Updated 2 weeks ago
- ☆75Updated 2 years ago
- ☆15Updated 3 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆35Updated 11 months ago
- Table detection (TD) and table structure recognition (TSR) using Yolov5/Yolov8, cand you can get the same (even better) result compared w…☆41Updated 4 months ago
- DocLLM: A layout-aware generative language model for multimodal document understanding☆114Updated 10 months ago
- Example codebase for fine-tuning layoutLMv3 on DocVQA☆49Updated 2 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆46Updated 3 months ago
- An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"☆74Updated last year
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆116Updated last year
- Text and Layout Document Image Understanding. LayoutLM☆21Updated 3 years ago