mikhail-barg / huge-file-processorLinks
An utility to randomize and split really huge (100+ GB) text files
☆21Updated 9 years ago
Alternatives and similar repositories for huge-file-processor
Users that are interested in huge-file-processor are comparing it to the libraries listed below
Sorting:
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆123Updated 5 years ago
- Advanced Utility Libs☆24Updated 5 years ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- WuManber text search/matching implementation using c#☆155Updated 3 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆39Updated 4 years ago
- A lightweight library for designing and executing workflows in .NET Core☆23Updated 7 years ago
- ☆18Updated 2 years ago
- Natural Language Processing Engine built with ML.NET☆26Updated 3 years ago
- Python for .NET is a package that gives Python programmers nearly seamless integration with the .NET Common Language Runtime (CLR) and pr…☆20Updated 6 years ago
- A simple NER implementation using a DistilBERT based model with ML.NET☆13Updated 4 years ago
- .NET wrapper of spaCy (Industrial-strength NLP)☆18Updated 6 years ago
- Converts RTF (Rich Text Format) text to plain text, html and xml. Port from Jani Giannoudis's RTF Converter v1.5.0☆12Updated 7 years ago
- OpenNLP for .NET☆92Updated last month
- MathExtensions is a library for .NET with simple and useful extensions methods regarding various mathematical domains: combinatorics, seq…☆34Updated 5 years ago
- A compiler which can transform the convention of C# code to the convention of Python☆19Updated 6 years ago
- Natural Language Processing in .NET Core☆119Updated 3 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 9 months ago
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆20Updated 3 years ago
- Bridging the gap between IEnumerable and IDataReader for dealing with unstructured and loosely-structured data, plus fast ETL + SQL Bulk …☆10Updated 6 months ago
- Web Crawling and Scraping Framework☆12Updated 6 years ago
- Image Classification, Time Series, Transformer, Object Detection☆35Updated 2 years ago
- An improved method of locality-sensitive hashing for scalable instance matching. In this study, we propose a scalable approach for autom…☆14Updated 3 years ago
- Define your business rules in boo DSL, express flow chart in code with fluent interface☆37Updated 14 years ago
- Generic Graph Data Structures and Algorithms for .NET☆13Updated 4 years ago
- Inject deep copy constructors into C# types☆15Updated 3 years ago
- Ludwig is a toolbox that allows to train and test deep learning models without the need to write code.☆26Updated 6 years ago
- A .net rules engine based on the RETE algorithm☆31Updated 9 years ago
- C# (.net) Stomp Client.☆19Updated 11 years ago
- TF*IDF Term Frequency Inverse Document Frequency in C# .NET☆62Updated 4 years ago
- Parse sql using antlr.☆13Updated 2 years ago