mindee / notebooksLinks
Home to jupyter notebooks for Mindee OSS projects
☆17Updated 5 months ago
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 7 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆77Updated this week
- DFKI Layout Detection for OCR-D☆47Updated 7 months ago
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆40Updated 2 years ago
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆15Updated 4 years ago
- Tools for interactive visual exploration of semantic embeddings.☆39Updated last year
- Apply different text recognition services to images of handwritten documents.☆187Updated 2 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆38Updated 2 years ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Updated last year
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Updated 3 years ago
- Streamlit component for Jina neural search☆42Updated 4 years ago
- ☆55Updated last year
- Dvc + Streamlit = ❤️☆40Updated 2 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago
- ☆20Updated 4 years ago
- ☆23Updated 2 years ago
- A streamlit component to embed Disqus in your applications.☆10Updated 4 years ago
- ☆21Updated 3 years ago
- EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-lea…☆58Updated 9 months ago
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆47Updated 4 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆60Updated 6 years ago
- Streamlit component for invoice document labeling☆61Updated 2 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Updated 4 years ago
- Custom recipe and utilities for document processing☆200Updated 3 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆63Updated this week
- Web App Capable of Predicting Next Word Using BERT☆14Updated 3 years ago
- Remove exact and approximate duplicates from your dataset in FiftyOne!☆18Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆21Updated last year
- A Streamlit component integrating Label Studio Frontend in Streamlit applications☆79Updated last year