Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
☆28Mar 16, 2020Updated 6 years ago
Alternatives and similar repositories for PdfPigMLNetBlockClassifier
Users that are interested in PdfPigMLNetBlockClassifier are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Feb 4, 2022Updated 4 years ago
- Document Layout Analysis resources repos for development with PdfPig.☆635Oct 1, 2023Updated 2 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆39Sep 21, 2021Updated 4 years ago
- PdfDocumentParser is a .NET toolset for building PDF parsers.☆45Feb 5, 2026Updated 4 months ago
- Extract tables from PDF files (port of tabula-java)☆211May 4, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A docx renderer allows outputing Markdown-formatted text into Microsoft Word .docx documents☆19Nov 25, 2023Updated 2 years ago
- Converts docx to html☆14Updated this week
- A step-by-step C# implementation of the Docstrum algorithm☆24Dec 13, 2020Updated 5 years ago
- Service discovery kit for .NET microservices☆17Feb 14, 2017Updated 9 years ago
- KeePass Import Plugin for MSDN and TechNet key export xml files.☆19Dec 20, 2017Updated 8 years ago
- Cross-platform OCR processing using PaddleOCR ONNX models. Based on RapidAI's RapidOCR☆68May 27, 2026Updated last week
- .NET wrapper around Google's PDFium library☆27Jan 10, 2024Updated 2 years ago
- The OpenXML SDK extensions. Including libraries and tools.☆28Apr 29, 2026Updated last month
- Compress PDF documents with help of ITextSharp and FreeImage third party libs. Excellent point to start and customize for your particular…☆22Oct 31, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A PDF fileformat for ImageProcessor☆15Jun 25, 2020Updated 5 years ago
- Mass Downloader For the .Net Framework which allows you do download .Net Framework source code in batch mode☆19Apr 21, 2017Updated 9 years ago
- ☆39Jun 6, 2018Updated 8 years ago
- PDF File Analyzer (Version 3.0 for VS 2022 .NET 6.0)☆44May 28, 2022Updated 4 years ago
- 2016 SkillTree Day AOP Samples☆11Mar 22, 2017Updated 9 years ago
- MJPEG Streaming (Screen-WinForms, Camera-UWP)☆14Sep 28, 2017Updated 8 years ago
- Sample to generate a PDF using NET Core and Node / Puppeteer☆16Feb 27, 2018Updated 8 years ago
- Control backlight brightness via sysfs☆18Apr 22, 2022Updated 4 years ago
- Fast load and parse of Pdf format in C#☆47Feb 11, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Robust Library in C# for Similarity Estimation☆12Nov 30, 2023Updated 2 years ago
- Genzor is an experimental library ideally suited for generating files spanning multiple folders, using Blazor component model to generati…☆16Oct 7, 2021Updated 4 years ago
- AEM Author and Publish Docker Containers☆19Jun 28, 2017Updated 8 years ago
- A template to create your own literature survey engine☆14Jun 1, 2026Updated last week
- ☆11Jan 9, 2025Updated last year
- The Adobe Sign C# Client Library for integrating with Adobe Sign REST API's☆16Aug 19, 2022Updated 3 years ago
- Search Bar Spotlight-like for Windows 10☆11Oct 5, 2023Updated 2 years ago
- Batch PDF operations for Swift☆14Apr 20, 2026Updated last month
- .NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!☆80Mar 16, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- iTunes "itl" database parsing☆16Jan 30, 2020Updated 6 years ago
- DocNetExtended is a small extension library built upon the DocNet library, designed to extract text in a readable order from PDFs☆10Nov 12, 2021Updated 4 years ago
- Open Source Cross Platform EventStore(see https://eventstore.org/) Solution based on .NET Core 3.1 and Netty(https://github.com/cuteant/d…☆13Aug 29, 2020Updated 5 years ago
- Demonstrates how to use ML.NET to implement some Machine Learning use cases in UWP.☆86Feb 27, 2022Updated 4 years ago
- Lightweight fluent wrapper over HttpClient to make REST calls easier☆66May 27, 2026Updated last week
- A C# source generator to automatically generate access methods for embedded resources.☆19Apr 26, 2025Updated last year
- Windows ink samples (UWP/WPF/Windows Forms)☆16Jun 22, 2018Updated 7 years ago