mikhail-barg / huge-file-processor
An utility to randomize and split really huge (100+ GB) text files
☆20Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for huge-file-processor
- Image Classification, Time Series, Transformer, Object Detection☆26Updated last year
- Natural Language Processing Engine built with ML.NET☆25Updated last year
- .NET wrapper of spaCy (Industrial-strength NLP)☆18Updated 5 years ago
- Advanced Utility Libs☆23Updated 4 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆33Updated 3 years ago
- Inject deep copy constructors into C# types☆13Updated 2 years ago
- A simple synchronization and/or backup system☆15Updated 2 years ago
- A compiler which can transform the convention of C# code to the convention of Python☆19Updated 5 years ago
- Generic Graph Data Structures and Algorithms for .NET☆10Updated 2 years ago
- CRFSharp is Conditional Random Fields implemented by .NET(C#), a machine learning algorithm for learning from labeled sequences of exampl…☆122Updated 4 years ago
- Ludwig is a toolbox that allows to train and test deep learning models without the need to write code.☆26Updated 5 years ago
- WuManber text search/matching implementation using c#☆154Updated 2 years ago
- Microcharts.Matplotlib is a wrapper of Microcharts for Data Science and Machine Learning☆22Updated last year
- A simple NER implementation using a DistilBERT based model with ML.NET☆13Updated 3 years ago
- C# implementation of Peter Norvig’s spelling corrector☆10Updated last year
- .NET bindings for native numerical computing☆31Updated 6 years ago
- open source search projects☆80Updated last year
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 4 years ago
- Web Crawling and Scraping Framework☆12Updated 5 years ago
- MessageCommunicator is a library + testing Gui for message-based communication. The library can be integrated in .NET Standard 2, .NET 6.…☆13Updated 10 months ago
- Memory mapped data structures: dictionary, array and list☆26Updated 4 years ago
- .NET Core Proxy library based on HttpClient works with FreeProxyList.net☆19Updated last year
- Word2Vec.Net-CSharp☆18Updated 5 years ago
- 100% Native C# Key Value Store High Performance NoSQL Database with ACID Transaction☆40Updated 6 years ago
- OCRonet is optical character recognition (OCR) and document analysis system based on Convolutional Neural Networks (LeNet-5) and OCRopus.☆21Updated 5 years ago
- ☆19Updated last month
- Python for .NET is a package that gives Python programmers nearly seamless integration with the .NET Common Language Runtime (CLR) and pr…☆20Updated 5 years ago