mikhail-barg / huge-file-processorLinks
An utility to randomize and split really huge (100+ GB) text files
☆21Updated 8 years ago
Alternatives and similar repositories for huge-file-processor
Users that are interested in huge-file-processor are comparing it to the libraries listed below
Sorting:
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- Advanced Utility Libs☆24Updated 5 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆39Updated 4 years ago
- A compiler which can transform the convention of C# code to the convention of Python☆19Updated 6 years ago
- AIMLBot (Program#) is a small, fast, standards-compliant yet easily customizable .NET dll implementation of an AIML (Artificial Intellige…☆17Updated 8 years ago
- Web Crawling and Scraping Framework☆12Updated 6 years ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- A lightweight library for designing and executing workflows in .NET Core☆23Updated 7 years ago
- A simple NER implementation using a DistilBERT based model with ML.NET☆13Updated 4 years ago
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆34Updated 2 years ago
- Generic Graph Data Structures and Algorithms for .NET☆13Updated 3 years ago
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆20Updated 3 years ago
- .NET wrapper of spaCy (Industrial-strength NLP)☆18Updated 6 years ago
- Inject deep copy constructors into C# types☆15Updated 3 years ago
- Simple application to full-text searching in file system☆18Updated 5 years ago
- Natural Language Processing Engine built with ML.NET☆26Updated 3 years ago
- Set of .Net Libraries written in C# to create Listeners, Extractors, Writers and possibly more. These libraries allow you to (a) listen f…☆21Updated last year
- WuManber text search/matching implementation using c#☆155Updated 3 years ago
- Create binding from .Net to JavaScript☆17Updated 2 weeks ago
- Define your business rules in boo DSL, express flow chart in code with fluent interface☆37Updated 14 years ago
- Fast Word Segmentation with Triangular Matrix☆85Updated 4 years ago
- NReco Recommender is a .NET port of Apache Mahout CF java engine (standalone, non-Hadoop version)☆39Updated 7 years ago
- English name parser☆33Updated last year
- pdfOCR is an iText add-on to recognize and extract text in scanned documents and images. It can also convert them into fully ISO-complian…☆46Updated last week
- Image Classification, Time Series, Transformer, Object Detection☆34Updated 2 years ago
- .NET controls that display multiple sub-controls without creating a unique window handle for each child. Instead each child is drawn usin…☆23Updated 2 years ago
- .NET NLP library☆48Updated 2 years ago
- .NET library for handling asynchronous file system operations☆25Updated 8 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 6 years ago
- C# implementation of the PageRank algorithm☆13Updated 7 years ago