A step-by-step C# implementation of the Docstrum algorithm
☆24Dec 13, 2020Updated 5 years ago
Alternatives and similar repositories for simple-docstrum
Users that are interested in simple-docstrum are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tools for extract figure, table, text, .. from a pdf document.☆35Nov 25, 2020Updated 5 years ago
- ☆71Apr 3, 2018Updated 8 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆634Oct 1, 2023Updated 2 years ago
- Document Layout Analysis Projects☆23Sep 4, 2019Updated 6 years ago
- PAGE XML format collection for document image page content and more☆71Jan 16, 2026Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 百度翻译,金山词霸,有道翻译,360翻译 集成四大接口的个性化翻译工具☆13May 23, 2025Updated last year
- This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified an…☆23Sep 11, 2020Updated 5 years ago
- convert PubLayNet data into METS/PAGE-XML☆10Mar 17, 2020Updated 6 years ago
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Feb 4, 2022Updated 4 years ago
- RUN LENGTH SMOOTHING ALGORITHM(RLSA) is a method mainly used for block segmentation and text discrimination. It helps to extract the nece…☆24Jun 21, 2022Updated 4 years ago
- OCR-D post-correction with encoder-attention-decoder LSTMs☆13May 1, 2025Updated last year
- ddddocr的c#版本☆16Mar 3, 2024Updated 2 years ago
- ICDAR 2021 Competition on Scientific Literature Parsing☆35Aug 20, 2020Updated 5 years ago
- SLUB Document Classification and Similarity Analysis☆10Aug 31, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- GloSAT Historical Measurement Table Dataset☆11Dec 3, 2025Updated 7 months ago
- Specification of the @OCR-D technical architecture, interface definitions and data exchange format(s)☆17Sep 18, 2025Updated 9 months ago
- METS 1.x and METS 2 schemas☆26May 28, 2025Updated last year
- Grobid module for superconductor material and properties extraction☆23May 17, 2025Updated last year
- OCR-D post-correction module based on weighted finite-state transducers☆11Jan 13, 2024Updated 2 years ago
- Deep learning based page layout analysis☆197Apr 24, 2019Updated 7 years ago
- OCR-D compliant toolset for optical layout recognition on historical german-language documents published in Brazil☆11Sep 24, 2021Updated 4 years ago
- Training data from "Hauptphase I" of project "Digitalisierung historischer deutscher Zeitungen"☆12Dec 17, 2021Updated 4 years ago
- Rust bindings for the Ghostscript PS/PDF interpreter library☆10Jan 14, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A workflow system for Natural Language Processing.☆21Oct 17, 2019Updated 6 years ago
- Easy to use PDF CLI tool powered by PDFium and go-pdfium☆35Jun 11, 2026Updated 3 weeks ago
- Automated listing of repos in GitHub with XML files containing teiHeader. Find a project using TEI today!☆17Updated this week
- the way to write llvm pass without llvm framework.☆11Feb 9, 2021Updated 5 years ago
- OCR-D wrapper for detectron2 based segmentation models☆16May 1, 2025Updated last year
- An Editor for creating simple or complex OCR workflows☆17Jun 13, 2024Updated 2 years ago
- Repository to use/train segmentation models for document layout analysis☆19Jan 13, 2022Updated 4 years ago
- Convert PAGE (v. 2019) to ALTO (v. 2.0 - 4.2)☆17Jun 5, 2026Updated 3 weeks ago
- Rust library for extracting data from HTML tables.☆13Mar 4, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Haversine distance between two points☆13Jun 20, 2023Updated 3 years ago
- DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confiden…☆26Dec 31, 2020Updated 5 years ago
- Training files for Greek cursive script (in early print)☆15May 26, 2021Updated 5 years ago
- iOS app downloader in TypeScript☆38Aug 3, 2023Updated 2 years ago
- A complete agency API program.☆12Apr 27, 2017Updated 9 years ago
- ☆11May 19, 2026Updated last month
- ☆11Nov 13, 2020Updated 5 years ago