BobLd / PdfPigMLNetBlockClassifier
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
☆23Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for PdfPigMLNetBlockClassifier
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆31Updated 2 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆33Updated 3 years ago
- Cross-platform library to render pdf documents as images with PdfPig using SkiaSharp☆17Updated last week
- PMS full-text search engine with no external dependencies written in C#☆22Updated 11 months ago
- Free PDF renderer for .NET☆61Updated last week
- Natural Language Processing Engine built with ML.NET☆25Updated last year
- A .NET library to aid WebView2 control hosting, .NET/JavaScript interop and Html to Pdf Conversion☆26Updated this week
- A tool for detecting identifiable information in data sources (CSV, DICOM, Relational Database and MongoDB)☆14Updated last week
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆19Updated last year
- .NET wrapper of spaCy (Industrial-strength NLP)☆18Updated 5 years ago
- Text clustering algorithm, implemented in .NET☆23Updated last year
- Word2Vec.Net-CSharp☆18Updated 5 years ago
- NLTK library wrapper for .NET☆46Updated 3 years ago
- An IQueryable wrapper implementation for LiteDB with additional async extensions.☆22Updated last week
- Copy your files from a source to a destination. For instance you can copy your local files to your OneDrive account or an external disk.☆30Updated 4 months ago
- A rules engine. Set values of the properties of an object graph when other properties change☆32Updated 4 years ago
- PDF viewer and editor toolset.☆33Updated 2 months ago
- .NET libraries used in all Menees utilities.☆28Updated 5 months ago
- Reed-Solomon Erasure Coding in C#/.NET☆26Updated last year
- ONNX format parsing and manipulation in C#.☆24Updated 2 months ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 4 years ago
- A fork of SharpNLP (http://sharpnlp.codeplex.com/), SharperNLP is also a collection of natural language processing tools written in C#.☆33Updated 13 years ago
- .Net Implementation for google word2vec tools.☆37Updated last year
- Simple application to full-text searching in file system