BobLd / PdfPigMLNetBlockClassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
☆27Updated 4 years ago
Alternatives and similar repositories for PdfPigMLNetBlockClassifier:
Users that are interested in PdfPigMLNetBlockClassifier are comparing it to the libraries listed below
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆31Updated 3 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆34Updated 3 years ago
- Natural Language Processing Engine built with ML.NET☆25Updated 2 years ago
- .NET wrapper of spaCy (Industrial-strength NLP)☆18Updated 5 years ago
- ONNX format parsing and manipulation in C#.☆28Updated last month
- Personal Assistant Engine built with ML.NET.☆18Updated 2 years ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 4 years ago
- PdfDocumentParser is a .NET toolset for building PDF parsers.☆44Updated 8 months ago
- Create binding from .Net to JavaScript☆15Updated this week
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆19Updated 2 years ago
- Inject deep copy constructors into C# types☆13Updated 2 years ago
- Word2Vec.Net-CSharp☆18Updated 6 years ago
- Some C# Jupyter Notebooks covering ML.NET scenarios☆16Updated 5 years ago
- PMS full-text search engine with no external dependencies written in C#☆23Updated last year
- .NET library to easily create Voice Command Control feature.☆40Updated 2 years ago
- A tool for detecting identifiable information in data sources (CSV, DICOM, Relational Database and MongoDB)☆14Updated this week
- A simple NER implementation using a DistilBERT based model with ML.NET☆13Updated 3 years ago
- ☆23Updated 3 months ago
- Converts XAML code into C#☆43Updated 9 years ago
- Cross-platform library to render pdf documents as images with PdfPig using SkiaSharp☆24Updated this week
- .NET client for Qdrant vector database☆17Updated last year
- A .NET Standard port of the Argotic Syndication Framework for RSS / ATOM / RSD / OPML / APML / BlogML / Yahoo Media / iTunes☆16Updated this week
- Reed-Solomon Erasure Coding in C#/.NET☆26Updated last year
- A rules engine. Set values of the properties of an object graph when other properties change☆32Updated 4 years ago
- A lightweight C# Library to render PDFs with Google's Pdfium in .NET Core and .NET Framwork Apps.☆72Updated 4 years ago
- NLTK library wrapper for .NET☆47Updated 3 years ago
- ☆26Updated this week
- This is a simple C# demo for stable-diffusion.cpp with safe code only.☆15Updated 10 months ago
- GUI for Entity Framework Core migrations.☆23Updated 3 months ago
- ☆16Updated last year