mindee / notebooksLinks
Home to jupyter notebooks for Mindee OSS projects
☆17Updated 3 months ago
Alternatives and similar repositories for notebooks
Users that are interested in notebooks are comparing it to the libraries listed below
Sorting:
- EDS-PDF is a generic, pure-Python framework for text extraction from PDF documents. It provides the machinery to use rule- or machine-lea…☆57Updated 8 months ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆72Updated this week
- Run tesseract with the tesserocr bindings with @OCR-D's interfaces☆39Updated 5 months ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- DFKI Layout Detection for OCR-D☆47Updated 5 months ago
- ☆10Updated 4 years ago
- Apply different text recognition services to images of handwritten documents.☆187Updated 2 years ago
- PromptCraft is a prompt perturbation toolkit from the character, word, and sentence levels for prompt robustness analysis. PyPI Package: …☆19Updated last year
- Async bulk data ingestion and querying in various document, graph and vector databases via their Python clients☆39Updated last year
- A demo that shows how to build a semantic search experience with Typesense's vector search feature and Instantsearch.js☆26Updated 2 years ago
- Generate beautiful, testable documentation with Jupyter Notebooks☆21Updated 3 years ago
- ☆23Updated 2 years ago
- Self-exploratory Streamlit app to know more about palmer penguins.☆11Updated 2 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆39Updated 6 years ago
- Repository for deepdoctection tutorial notebooks☆45Updated 4 months ago
- Tools for interactive visual exploration of semantic embeddings.☆38Updated last year
- Document Image Binarization☆78Updated last year
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆52Updated 7 months ago
- Logical structure analysis for visually structured documents☆92Updated 3 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- A tool designed to extract numerical data from scanned historical weather documents.☆13Updated 10 months ago
- ☆55Updated last year
- streamlit games☆14Updated 2 years ago
- Source code for the paper "Post-OCR Document Correction with Large Ensembles of Character Sequence-to-Sequence Models"☆37Updated last year
- A library to encode text as DNA and decode DNA to text.☆13Updated 2 years ago
- Tool that does layout analysis and/or text recognition using tesseract and outputs the result in Page XML format☆46Updated 6 months ago
- The code for the Sales Dashboard demo☆16Updated 5 months ago
- Examples of vector DB indexing and query with various vector databases.☆13Updated 8 months ago
- ☆20Updated 4 years ago
- A Unet based deeplearning model to line/box/spurious artifacts from text images. Unsupervised training.☆58Updated 6 years ago