Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
☆28Mar 16, 2020Updated 5 years ago
Alternatives and similar repositories for PdfPigMLNetBlockClassifier
Users that are interested in PdfPigMLNetBlockClassifier are comparing it to the libraries listed below
Sorting:
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Feb 4, 2022Updated 4 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆39Sep 21, 2021Updated 4 years ago
- Converts docx to html☆14Updated this week
- A docx renderer allows outputing Markdown-formatted text into Microsoft Word .docx documents☆19Nov 25, 2023Updated 2 years ago
- Service discovery kit for .NET microservices☆17Feb 14, 2017Updated 9 years ago
- PdfDocumentParser is a .NET toolset for building PDF parsers.☆45Feb 5, 2026Updated 3 weeks ago
- Cross-platform OCR processing using PaddleOCR ONNX models. Based on RapidAI's RapidOCR☆49Nov 22, 2025Updated 3 months ago
- Compress PDF documents with help of ITextSharp and FreeImage third party libs. Excellent point to start and customize for your particular…☆22Oct 31, 2017Updated 8 years ago
- A C# wrapper for the WORLD vocoder☆24Jun 21, 2021Updated 4 years ago
- .NET wrapper around Google's PDFium library☆27Jan 10, 2024Updated 2 years ago
- Extract tables from PDF files (port of tabula-java)☆203Mar 17, 2025Updated 11 months ago
- JPEG decoder, encoder and optimizer implemented in C#.☆34Apr 15, 2024Updated last year
- .NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!☆79Sep 30, 2024Updated last year
- ☆10Feb 11, 2023Updated 3 years ago
- A fully customizable and modern Flutter media picker inspired by Instagram. Supports image/video selection, multi-pick, album browsing, a…☆11Feb 21, 2026Updated last week
- A Curated List of Game Network Programming Resources☆11Jun 15, 2019Updated 6 years ago
- This is my speaker recognition implementation based on the x-vector system described in "X-Vectors: Robust DNN Embeddings for Speaker Rec…☆10Nov 3, 2022Updated 3 years ago
- This project uses artificial intelligence technology to analyze video. Recognize video and audio for fragmentation into multiple clip sce…☆11Oct 3, 2018Updated 7 years ago
- A customizeable React feedback form with optional screenshot via screen capture and canvas editor based on material-ui.☆12Jan 22, 2026Updated last month
- MJPEG Streaming (Screen-WinForms, Camera-UWP)☆14Sep 28, 2017Updated 8 years ago
- This repo contains the souce code for the series of articles about JSInterop in Blazor WebAssembly, on Code Maze☆11Nov 5, 2020Updated 5 years ago
- Elasticsearch provider for Examine in Umbraco v8☆12Jan 15, 2024Updated 2 years ago
- A very lightweight editor to preview your changes in the XAML Path Markup☆11Apr 28, 2014Updated 11 years ago
- ☆39Jun 6, 2018Updated 7 years ago
- ☆10Sep 30, 2014Updated 11 years ago
- A collection of projects I did while at General Assembly Singapore - as part of Data Science Immersive☆11Jan 27, 2021Updated 5 years ago
- A media player.☆11Feb 1, 2026Updated 3 weeks ago
- Some frequently-used modules and helpers for your WPF applications☆16Jun 21, 2025Updated 8 months ago
- Bad link reporter for GitHub repositories☆13Mar 25, 2024Updated last year
- The CaltechDATA InvenioRDM source code☆10Jan 28, 2026Updated last month
- ☆12May 5, 2015Updated 10 years ago
- An example React Native Expo App supporting Android, iOS, Web, and Electron desktop apps☆11Jul 22, 2022Updated 3 years ago
- ☆11Updated this week
- DocNetExtended is a small extension library built upon the DocNet library, designed to extract text in a readable order from PDFs☆10Nov 12, 2021Updated 4 years ago
- A collection of extensions over Microsoft TPL Dataflow library.☆11Aug 26, 2019Updated 6 years ago
- MFM workshop project☆14Jan 25, 2021Updated 5 years ago
- A simple NER implementation using a DistilBERT based model with ML.NET☆13May 6, 2021Updated 4 years ago
- A ipcMain & ipcRenderer Electron example application☆11Nov 10, 2022Updated 3 years ago
- Uno Platform Fonts☆11Feb 6, 2026Updated 3 weeks ago