mindee / notebooksLinks
Home to jupyter notebooks for Mindee OSS projects
☆17Updated 2 months ago
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 4 months ago
- DFKI Layout Detection for OCR-D☆47Updated 4 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆73Updated this week
- Apply different text recognition services to images of handwritten documents.☆184Updated 2 years ago
- Full-fledged Data Exploration Tool for Label Studio☆48Updated last year
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆13Updated 4 years ago
- ☆21Updated 3 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 6 months ago
- Tools for interactive visual exploration of semantic embeddings.☆38Updated last year
- Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins …☆59Updated last year
- EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-lea…☆56Updated 7 months ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆37Updated last year
- Streamlit component for Jina neural search☆42Updated 3 years ago
- ☆20Updated 4 years ago
- Dvc + Streamlit = ❤️☆40Updated last year
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- ☆23Updated 2 years ago
- Pixano Elements - Re-usable web components dedicated to data annotation tasks.☆41Updated 2 years ago
- Matplotlib Image labeller for classifying images☆11Updated 2 months ago
- Awesome Orchest projects, both official and submitted by the community.☆25Updated 2 years ago
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Updated 3 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Search PDFs using Jina, DocArray and Jina Hub☆56Updated 3 years ago
- Repository for deepdoctection tutorial notebooks☆46Updated 3 months ago
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Updated last year
- Automatic Machine Learning (AutoML) for Wave Apps☆32Updated 2 years ago
- Intelligence Task Ontology (ITO)☆74Updated 2 years ago
- A library to encode text as DNA and decode DNA to text.☆13Updated 2 years ago
- Google Colab Demo of CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents☆47Updated 3 years ago