Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
☆28Mar 16, 2020Updated 6 years ago
Alternatives and similar repositories for PdfPigMLNetBlockClassifier
Users that are interested in PdfPigMLNetBlockClassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Feb 4, 2022Updated 4 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆634Oct 1, 2023Updated 2 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆39Sep 21, 2021Updated 4 years ago
- Machine Learning Intrusion Detection and Network Monitor☆14Aug 30, 2024Updated last year
- PdfDocumentParser is a .NET toolset for building PDF parsers.☆45Feb 5, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Extract tables from PDF files (port of tabula-java)☆210Mar 17, 2025Updated last year
- A docx renderer allows outputing Markdown-formatted text into Microsoft Word .docx documents☆19Nov 25, 2023Updated 2 years ago
- Converts docx to html☆14Apr 20, 2026Updated last week
- A step-by-step C# implementation of the Docstrum algorithm☆24Dec 13, 2020Updated 5 years ago
- Service discovery kit for .NET microservices☆17Feb 14, 2017Updated 9 years ago
- MyScript Interactive Ink examples and user interface reference implementation for Windows WPF.☆14Mar 30, 2026Updated 3 weeks ago
- KeePass Import Plugin for MSDN and TechNet key export xml files.☆19Dec 20, 2017Updated 8 years ago
- C# Library for converting PDF files to Searchable PDF Files☆30Jun 7, 2024Updated last year
- Artifacts for the WOOT publication.☆18Oct 3, 2020Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Use the YOLO v3 (ONNX) model for object detection in C# using ML.Net☆21Jul 17, 2021Updated 4 years ago
- Compress PDF documents with help of ITextSharp and FreeImage third party libs. Excellent point to start and customize for your particular…☆22Oct 31, 2017Updated 8 years ago
- The Wacom Ink SDK for devices is designed as one library - Common Device Library (CDL) - that can handle different types of pen or touch …☆14Jan 3, 2025Updated last year
- A PDF fileformat for ImageProcessor☆15Jun 25, 2020Updated 5 years ago
- Mass Downloader For the .Net Framework which allows you do download .Net Framework source code in batch mode☆19Apr 21, 2017Updated 9 years ago
- ☆39Jun 6, 2018Updated 7 years ago
- PDF File Analyzer (Version 3.0 for VS 2022 .NET 6.0)☆43May 28, 2022Updated 3 years ago
- MyScript Interactive Ink examples and user interface reference implementation for Windows UWP.☆18Mar 30, 2026Updated 3 weeks ago
- This sample app is a .NET/Winform tool to test making REST calls for Outlook/Exchange data, using either the Microsoft Graph SDK (client …☆19Nov 17, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Clever Stocker☆11Jan 12, 2022Updated 4 years ago
- MJPEG Streaming (Screen-WinForms, Camera-UWP)☆14Sep 28, 2017Updated 8 years ago
- Sample to generate a PDF using NET Core and Node / Puppeteer☆16Feb 27, 2018Updated 8 years ago
- Control backlight brightness via sysfs☆18Apr 22, 2022Updated 4 years ago
- Fast load and parse of Pdf format in C#☆47Feb 11, 2020Updated 6 years ago
- Bad link reporter for GitHub repositories☆13Mar 25, 2024Updated 2 years ago
- JPEG decoder, encoder and optimizer implemented in C#.☆34Apr 15, 2024Updated 2 years ago
- AEM Author and Publish Docker Containers☆19Jun 28, 2017Updated 8 years ago
- Rust SDK for Rigetti Quantum Cloud Services (QCS)☆14Apr 22, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A Paint.NET FileType plugin that loads and saves Paint Shop Pro images.☆10Jul 24, 2024Updated last year
- .NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!☆79Mar 16, 2026Updated last month
- DocNetExtended is a small extension library built upon the DocNet library, designed to extract text in a readable order from PDFs☆10Nov 12, 2021Updated 4 years ago
- Demonstrates how to use ML.NET to implement some Machine Learning use cases in UWP.☆86Feb 27, 2022Updated 4 years ago
- ☆12May 5, 2015Updated 10 years ago
- Windows ink samples (UWP/WPF/Windows Forms)☆16Jun 22, 2018Updated 7 years ago
- A media player.☆11Apr 1, 2026Updated 3 weeks ago