mikhail-barg / huge-file-processorLinks
An utility to randomize and split really huge (100+ GB) text files
☆21Updated 8 years ago
Alternatives and similar repositories for huge-file-processor
Users that are interested in huge-file-processor are comparing it to the libraries listed below
Sorting:
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- Advanced Utility Libs☆24Updated 5 years ago
- Bridging the gap between IEnumerable and IDataReader for dealing with unstructured and loosely-structured data, plus fast ETL + SQL Bulk …☆10Updated 3 months ago
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆20Updated 2 years ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- Generic Graph Data Structures and Algorithms for .NET☆13Updated 3 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 6 years ago
- Fork of http://expressiontree.codeplex.com/☆18Updated 12 years ago
- Natural Language Processing Engine built with ML.NET☆26Updated 2 years ago
- WuManber text search/matching implementation using c#☆155Updated 3 years ago
- A simple NER implementation using a DistilBERT based model with ML.NET☆13Updated 4 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆39Updated 4 years ago
- A .net rules engine based on the RETE algorithm☆31Updated 9 years ago
- pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-complian…☆46Updated last week
- A lightweight library for designing and executing workflows in .NET Core☆23Updated 7 years ago
- NReco Recommender is a .NET port of Apache Mahout CF java engine (standalone, non-Hadoop version)☆39Updated 7 years ago
- Parse sql using antlr.☆13Updated 2 years ago
- A compiler which can transform the convention of C# code to the convention of Python☆19Updated 6 years ago
- Converts RTF (Rich Text Format) text to plain text, html and xml. Port from Jani Giannoudis's RTF Converter v1.5.0☆12Updated 7 years ago
- C# Code Compiler | Code Analysis and Code Compiler for Eval-Expression.NET and Eval-SQL.NET☆12Updated last year
- Post compile tool using Fody to implement the double check locking pattern.☆25Updated 6 years ago
- Image Classification, Time Series, Transformer, Object Detection☆33Updated 2 years ago
- Windows, Linux, Mono C# port of the SQLite library☆43Updated 6 years ago
- BK-tree with Damerau-Levenshtein distance and Trie with Levenshtein distance☆19Updated 8 years ago
- MessageCommunicator is a library + testing Gui for message-based communication. The library can be integrated in .NET apps (current .NET …☆14Updated 10 months ago
- Web Crawling and Scraping Framework☆12Updated 6 years ago
- A collection of extensions over Microsoft TPL Dataflow library.☆11Updated 6 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 6 months ago
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Updated 3 years ago
- .NET controls that display multiple sub-controls without creating a unique window handle for each child. Instead each child is drawn usin…☆23Updated 2 years ago