mindee / notebooks
Home to jupyter notebooks for Mindee OSS projects
☆17Updated 6 months ago
Alternatives and similar repositories for notebooks:
Users that are interested in notebooks are comparing it to the libraries listed below
- DFKI Layout Detection for OCR-D☆47Updated last week
- Use this tool to label forms, bounding boxes, and assigning types to annotations☆22Updated 4 months ago
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated this week
- Full-fledged Data Exploration Tool for Label Studio☆48Updated last year
- A library to encode text as DNA and decode DNA to text.☆12Updated 2 years ago
- A streamlit component to embed Disqus in your applications.☆10Updated 3 years ago
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆12Updated 3 years ago
- A zero-shot captcha solver.☆16Updated last year
- Repository for deepdoctection tutorial notebooks☆44Updated 4 months ago
- Python tools for Tesseract OCR training☆25Updated 2 years ago
- ☆15Updated 3 years ago
- CVPR 2022: Table Structure Recognition☆39Updated 3 years ago
- Utilities for working with videos☆13Updated 3 years ago
- ☆26Updated 2 years ago
- Document processing using transformers☆20Updated 2 years ago
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Updated last year
- A web app built with Streamlit that summarizes input text☆13Updated 4 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆59Updated 5 years ago
- Python and data science snippets on the command line☆21Updated 3 years ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆13Updated 9 months ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Pretrained mixed models to be used with Calamari.☆61Updated 6 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆67Updated 3 weeks ago
- Build interactive big data apps with Altair and Vega easily using Panel + VegaFusion.☆17Updated 3 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆36Updated last year
- ☆22Updated 4 years ago
- Form images from U.S. National Archives annotated with text bounding boxes, classes, relationships, and transcription.☆37Updated 2 years ago
- Accompanying code for the paper: Totally Looks Like - How Humans Compare, Compared to Machines, by Amir Rosenfeld, Markus D. Solbach and …☆38Updated 6 years ago
- Detect textlines in document images☆92Updated 10 months ago
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Updated 2 years ago