mikhail-barg / huge-file-processorLinks
An utility to randomize and split really huge (100+ GB) text files
☆21Updated 9 years ago
Alternatives and similar repositories for huge-file-processor
Users that are interested in huge-file-processor are comparing it to the libraries listed below
Sorting:
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆123Updated 5 years ago
- A simple NER implementation using a DistilBERT based model with ML.NET☆13Updated 4 years ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 5 years ago
- Advanced Utility Libs☆24Updated 5 years ago
- Bridging the gap between IEnumerable and IDataReader for dealing with unstructured and loosely-structured data, plus fast ETL + SQL Bulk …☆10Updated 6 months ago
- WuManber text search/matching implementation using c#☆155Updated 3 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆39Updated 4 years ago
- Natural Language Processing Engine built with ML.NET☆26Updated 3 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆35Updated 9 months ago
- AIMLBot (Program#) is a small, fast, standards-compliant yet easily customizable .NET dll implementation of an AIML (Artificial Intellige…☆17Updated 8 years ago
- .NET wrapper of spaCy (Industrial-strength NLP)☆18Updated 6 years ago
- Image Classification, Time Series, Transformer, Object Detection☆35Updated 2 years ago
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆20Updated 3 years ago
- A C# library to extract tabular data from PDFs (port of camelot Python version using PdfPig).☆35Updated 4 years ago
- Pure C#.Net SQLite provider and supports REAL memory database for .net standard 2.0.☆16Updated 8 years ago
- C# library for fast embeddings projection using Uniform Manifold Approximation and Projection☆50Updated 2 weeks ago
- Seq2SeqSharp is a tensor based fast & flexible encoder-decoder deep neural network framework written by .NET (C#). It has many highlighte…☆17Updated last year
- A compiler which can transform the convention of C# code to the convention of Python☆19Updated 6 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆22Updated 6 years ago
- Fork of http://expressiontree.codeplex.com/☆18Updated 12 years ago
- Python for .NET is a package that gives Python programmers nearly seamless integration with the .NET Common Language Runtime (CLR) and pr…☆20Updated 6 years ago
- Natural Language Processing in .NET Core☆119Updated 3 years ago
- Some C# Jupyter Notebooks covering ML.NET scenarios☆16Updated 5 years ago
- Inject deep copy constructors into C# types☆15Updated 3 years ago
- Easily add searching to your apps☆14Updated this week
- .Net Implementation for google word2vec tools.☆37Updated 3 years ago
- Machine Learning in .NET Core.☆39Updated 6 years ago
- Simple application to full-text searching in file system☆20Updated 5 years ago
- Desktop App for constraint optimization of linear programming problems☆13Updated 9 years ago
- Run and train Transformer based Large Language Models (LLMS) natively in .NET using TorchSharp☆24Updated last year