A step-by-step C# implementation of the Docstrum algorithm
☆24Dec 13, 2020Updated 5 years ago
Alternatives and similar repositories for simple-docstrum
Users that are interested in simple-docstrum are comparing it to the libraries listed below
Sorting:
- Tools for extract figure, table, text, .. from a pdf document.☆33Nov 25, 2020Updated 5 years ago
- ☆70Apr 3, 2018Updated 7 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆631Oct 1, 2023Updated 2 years ago
- NLP system for identifying patient housing status in Veteran Affairs☆12Feb 18, 2024Updated 2 years ago
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Feb 4, 2022Updated 4 years ago
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆23Sep 11, 2020Updated 5 years ago
- BoundaryNet - A Semi-Automatic Layout Annotation Tool☆24Dec 11, 2021Updated 4 years ago
- Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block…☆28Mar 16, 2020Updated 5 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Dec 31, 2020Updated 5 years ago
- Implementation of BertGrid : https://arxiv.org/abs/1909.04948☆30Apr 10, 2024Updated last year
- PAGE XML format collection for document image page content and more☆70Jan 16, 2026Updated last month
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Overcooked! 2 TAS Development Framework☆10Aug 18, 2023Updated 2 years ago
- Generative and Parametric design code: featuring Processing / Python / Javascript / HTML / CSS☆14Nov 4, 2020Updated 5 years ago
- ☆10Apr 20, 2023Updated 2 years ago
- A simple document layout analysis using Python-OpenCV☆127Aug 11, 2020Updated 5 years ago
- A collection of some awesome public projects about LLM-based Web Agents and Tools.☆12Apr 25, 2024Updated last year
- Elasticsearch provider for Examine in Umbraco v8☆12Jan 15, 2024Updated 2 years ago
- This is my speaker recognition implementation based on the x-vector system described in "X-Vectors: Robust DNN Embeddings for Speaker Rec…☆10Nov 3, 2022Updated 3 years ago
- R script for visualising patient ward movements as timelines☆13May 13, 2022Updated 3 years ago
- This project uses artificial intelligence technology to analyze video. Recognize video and audio for fragmentation into multiple clip sce…☆11Oct 3, 2018Updated 7 years ago
- Provides fully configure Visual Studio Solution for ORTools☆10Aug 30, 2019Updated 6 years ago
- # Supporting-Emergency-Room-Decision-Making-with-Relevant-Scientific-Literature #### Supervised by: Yassine Benajiba #### Course: Introdu…☆10Jan 19, 2018Updated 8 years ago
- Watsonx Assistant with Milvus as Vector Database☆12Mar 31, 2025Updated 11 months ago
- Reddit Data Science Project Ideas☆11Dec 28, 2019Updated 6 years ago
- ☆16Jul 5, 2019Updated 6 years ago
- Template Extraction from unstructured Wikipedia text using NLP techniques.☆41Jun 23, 2020Updated 5 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Aug 20, 2020Updated 5 years ago
- Gamera 4 for Python 3☆14May 16, 2025Updated 9 months ago
- Combination of the RapidFuzz library with Spacy PhraseMatcher☆11Sep 29, 2021Updated 4 years ago
- Using SepFormer☆10Feb 2, 2023Updated 3 years ago
- ☆11Feb 17, 2026Updated 2 weeks ago
- a simple tool to use Animoji/Memoji with a green screen☆13Jul 26, 2024Updated last year
- Robust and ready-to-use tasks and workflows for a variety of bioinformatics pipelines. Use Flyte and Union to orchestrate anything from v…☆12Apr 2, 2025Updated 11 months ago
- The missing funding framework for bootstrappers, indie hackers, and creators☆23Jan 16, 2026Updated last month
- win10 media ocr☆11Jun 22, 2021Updated 4 years ago
- Celery plugin to autoscale based on available CPU, memory, or other system attributes.☆11Dec 8, 2017Updated 8 years ago
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 3 months ago
- EMNLP 2022: Biomedical NER for the Enterprise with Distillated BERN2 and the Kazu Framework☆11Aug 29, 2024Updated last year