mikhail-barg / huge-file-processorLinks
An utility to randomize and split really huge (100+ GB) text files
☆21Updated 8 years ago
Alternatives and similar repositories for huge-file-processor
Users that are interested in huge-file-processor are comparing it to the libraries listed below
Sorting:
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 5 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆38Updated 4 years ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- A simple NER implementation using a DistilBERT based model with ML.NET☆13Updated 4 years ago
- Natural Language Processing Engine built with ML.NET☆26Updated 2 years ago
- A compiler which can transform the convention of C# code to the convention of Python☆19Updated 6 years ago
- Advanced Utility Libs☆24Updated 5 years ago
- Palaso Library: A set of .Net libraries useful for developers of Language Software.☆43Updated this week
- Simple application to full-text searching in file system☆19Updated 5 years ago
- C# library for fast embeddings projection using Uniform Manifold Approximation and Projection☆47Updated 11 months ago
- A lightweight library for designing and executing workflows in .NET Core☆23Updated 7 years ago
- MessageCommunicator is a library + testing Gui for message-based communication. The library can be integrated in .NET apps (current .NET …☆14Updated 9 months ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 5 months ago
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆20Updated 2 years ago
- Bridging the gap between IEnumerable and IDataReader for dealing with unstructured and loosely-structured data, plus fast ETL + SQL Bulk …☆10Updated 2 months ago
- MathExtensions is a library for .NET with simple and useful extensions methods regarding various mathematical domains: combinatorics, seq…☆34Updated 5 years ago
- Image Classification, Time Series, Transformer, Object Detection☆33Updated last year
- WuManber text search/matching implementation using c#☆154Updated 3 years ago
- 100% Native C# Key Value Store High Performance NoSQL Database with ACID Transaction☆42Updated 7 years ago
- Inject deep copy constructors into C# types☆15Updated 3 years ago
- Windows, Linux, Mono C# port of the SQLite library☆43Updated 6 years ago
- Fork of http://expressiontree.codeplex.com/☆18Updated 12 years ago
- Configurable data bridge for permanent ETL jobs☆22Updated 2 years ago
- ☆18Updated 2 years ago
- Pure C#.Net SQLite provider and supports REAL memory database for .net standard 2.0.☆16Updated 8 years ago
- dotnet watch we always wanted☆103Updated 3 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 6 years ago
- Simple FIFO and LRU cache in C#☆35Updated 9 months ago
- Generic Graph Data Structures and Algorithms for .NET☆12Updated 3 years ago
- Fast Word Segmentation with Triangular Matrix☆82Updated 3 years ago