Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
☆29Mar 16, 2020Updated 6 years ago
Alternatives and similar repositories for PdfPigMLNetBlockClassifier
Users that are interested in PdfPigMLNetBlockClassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Feb 4, 2022Updated 4 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆39Sep 21, 2021Updated 4 years ago
- Machine Learning Intrusion Detection and Network Monitor☆14Aug 30, 2024Updated last year
- PdfDocumentParser is a .NET toolset for building PDF parsers.☆45Feb 5, 2026Updated 4 months ago
- Extract tables from PDF files (port of tabula-java)☆212May 4, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A docx renderer allows outputing Markdown-formatted text into Microsoft Word .docx documents☆19Nov 25, 2023Updated 2 years ago
- Converts docx to html☆14Jun 22, 2026Updated last week
- Service discovery kit for .NET microservices☆17Feb 14, 2017Updated 9 years ago
- KeePass Import Plugin for MSDN and TechNet key export xml files.☆19Dec 20, 2017Updated 8 years ago
- Cross-platform OCR processing using PaddleOCR ONNX models. Based on RapidAI's RapidOCR☆70May 27, 2026Updated last month
- A C# wrapper for the WORLD vocoder☆24Jun 21, 2021Updated 5 years ago
- Artifacts for the WOOT publication.☆17Oct 3, 2020Updated 5 years ago
- Compress PDF documents with help of ITextSharp and FreeImage third party libs. Excellent point to start and customize for your particular…☆22Oct 31, 2017Updated 8 years ago
- Super Simplify validation Minimal APIs in ASP.NET Core 7 or later with Short-circuit endpoint executions.☆20Jun 2, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A PDF fileformat for ImageProcessor☆15Jun 25, 2020Updated 6 years ago
- ☆39Jun 6, 2018Updated 8 years ago
- PDF File Analyzer (Version 3.0 for VS 2022 .NET 6.0)☆44May 28, 2022Updated 4 years ago
- Clever Stocker☆11Jan 12, 2022Updated 4 years ago
- MJPEG Streaming (Screen-WinForms, Camera-UWP)☆14Sep 28, 2017Updated 8 years ago
- Sample to generate a PDF using NET Core and Node / Puppeteer☆16Feb 27, 2018Updated 8 years ago
- Control backlight brightness via sysfs☆18Apr 22, 2022Updated 4 years ago
- Fast load and parse of Pdf format in C#☆47Feb 11, 2020Updated 6 years ago
- Bad link reporter for GitHub repositories☆13Mar 25, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Genzor is an experimental library ideally suited for generating files spanning multiple folders, using Blazor component model to generati…☆16Oct 7, 2021Updated 4 years ago
- JPEG decoder, encoder and optimizer implemented in C#.☆34Apr 15, 2024Updated 2 years ago
- AEM Author and Publish Docker Containers☆19Jun 28, 2017Updated 9 years ago
- ☆11Jan 9, 2025Updated last year
- ☆40Jun 22, 2026Updated last week
- Search Bar Spotlight-like for Windows 10☆11Oct 5, 2023Updated 2 years ago
- Rust SDK for Rigetti Quantum Cloud Services (QCS)☆15May 27, 2026Updated last month
- Batch PDF operations for Swift☆14Jun 22, 2026Updated last week
- A Paint.NET FileType plugin that loads and saves Paint Shop Pro images.☆10Jul 24, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- .NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!☆81Mar 16, 2026Updated 3 months ago
- This tool provides a fast and efficient way to convert text into vector embeddings and store them in the Qdrant search engine. Built with…☆15Mar 31, 2023Updated 3 years ago
- iTunes "itl" database parsing☆16Jan 30, 2020Updated 6 years ago
- Open Source Cross Platform EventStore(see https://eventstore.org/) Solution based on .NET Core 3.1 and Netty(https://github.com/cuteant/d…☆13Aug 29, 2020Updated 5 years ago
- Demonstrates how to use ML.NET to implement some Machine Learning use cases in UWP.☆85Feb 27, 2022Updated 4 years ago
- Lightweight fluent wrapper over HttpClient to make REST calls easier☆66Updated this week
- A C# source generator to automatically generate access methods for embedded resources.☆19Apr 26, 2025Updated last year