mikhail-barg / huge-file-processor
An utility to randomize and split really huge (100+ GB) text files
☆20Updated 8 years ago
Alternatives and similar repositories for huge-file-processor
Users that are interested in huge-file-processor are comparing it to the libraries listed below
Sorting:
- Advanced Utility Libs☆24Updated 5 years ago
- Language model☆9Updated 9 years ago
- A BK tree implementation for fast fuzzy string matching☆28Updated 6 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 6 years ago
- Fast approximate strings search & spelling correction☆58Updated 3 years ago
- .Net Implementation for google word2vec tools.☆37Updated 2 years ago
- Configurable data bridge for permanent ETL jobs☆21Updated 2 years ago
- Inverted file indexing and retrieval optimized for short texts. Supports auto-suggest and query segment classification.☆34Updated last year
- Bridging the gap between IEnumerable and IDataReader for dealing with unstructured and loosely-structured data, plus fast ETL + SQL Bulk …☆10Updated 5 months ago
- A .net rules engine based on the RETE algorithm☆31Updated 8 years ago
- C# implementation of Peter Norvig’s spelling corrector☆10Updated 2 years ago
- Fast Word Segmentation with Triangular Matrix☆81Updated 3 years ago
- Set of .Net Libraries written in C# to create Listeners, Extractors, Writers and possibly more. These libraries allow you to (a) listen f…☆21Updated last year
- Continous LINQ is a .NET Framework 3.5 extension that builds on the LINQ query syntax to create continuous, self-updating result sets.☆9Updated 14 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆121Updated 4 years ago
- Observable Linq query implementation.☆36Updated 9 years ago
- .NET bindings for native numerical computing☆31Updated 7 years ago
- .NET Reflection Made Fast and Simple ⛺☆28Updated 4 years ago
- Fork of http://expressiontree.codeplex.com/☆18Updated 11 years ago
- Clone of NCalc, ported to netstandard2.0☆11Updated 7 years ago
- A compiler which can transform the convention of C# code to the convention of Python☆19Updated 5 years ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- Benchmarks for the InfoQ Article Implementation Strategies for the Repository Pattern with Entity Framework, Dapper, and Chain☆8Updated 3 years ago
- dotnet watch we always wanted☆102Updated 3 years ago
- NReco Recommender is a .NET port of Apache Mahout CF java engine (standalone, non-Hadoop version)☆38Updated 6 years ago
- WuManber text search/matching implementation using c#☆154Updated 2 years ago
- Static analysis of MSIL based on the analysis-net infrastructure☆19Updated 2 years ago
- Clone of sharphsql.codeplex.com at Commit 80132 (2012-10-02)☆13Updated 9 years ago
- A C# port of JavaFastPFOR☆54Updated last year
- A library of cross platform C# data structures. Generic B-tree written in C#, which can be replaced with NoSQL database stored in the mem…☆34Updated 7 years ago