BobLd / PdfPigMLNetBlockClassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
☆26Updated 4 years ago
Alternatives and similar repositories for PdfPigMLNetBlockClassifier:
Users that are interested in PdfPigMLNetBlockClassifier are comparing it to the libraries listed below
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆31Updated 2 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆33Updated 3 years ago
- Natural Language Processing Engine built with ML.NET☆25Updated 2 years ago
- .NET wrapper of spaCy (Industrial-strength NLP)☆18Updated 5 years ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 4 years ago
- Word2Vec.Net-CSharp☆18Updated 5 years ago
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆19Updated 2 years ago
- Inject deep copy constructors into C# types☆13Updated 2 years ago
- Converts docx to html☆12Updated last month
- Generic Graph Data Structures and Algorithms for .NET☆10Updated 2 years ago
- PMS full-text search engine with no external dependencies written in C#☆22Updated last year
- Reed-Solomon Erasure Coding in C#/.NET☆26Updated last year
- Open source project for BERT Tokenizers in C#.☆83Updated last year
- A number plate recognition library for .NET based off javaanpr☆14Updated 5 months ago
- C# Library for converting PDF files to Searchable PDF Files☆28Updated 7 months ago
- PDF viewer and editor toolset.☆37Updated 4 months ago
- Easily add searching to your apps☆14Updated this week
- .Net Implementation for google word2vec tools.☆37Updated 2 years ago
- Free PDF renderer for .NET☆63Updated this week
- Extensions methods for PDFSharp to simplify common operations, including image extraction.☆34Updated 2 years ago
- ☆15Updated last year
- Create binding from .Net to JavaScript☆15Updated last month
- A differencing utility for Window desktop written in C#.☆46Updated 7 months ago
- .NET assembly class responsible for converting OpenXml based documents into corrisponding dotnet code☆39Updated 11 months ago
- Semantic-Fleet serves as a specialized extension hub for the Semantic-Kernel ecosystem. It houses a diverse array of connectors designed …☆29Updated last year
- A .NET Standard port of the Argotic Syndication Framework for RSS / ATOM / RSD / OPML / APML / BlogML / Yahoo Media / iTunes☆16Updated this week
- .Net Or-Mapper working with dynamically implemented abstract Classes☆14Updated last year
- A rules engine. Set values of the properties of an object graph when other properties change☆32Updated 4 years ago
- A WPF TreeView that actually supports virtualization.☆18Updated 5 months ago