trufanov-nok / shuf-tLinks
This application shuffles the input file lines skipping (optionaly) the header. It's optimized for files bigger than available RAM.
☆25Updated 8 years ago
Alternatives and similar repositories for shuf-t
Users that are interested in shuf-t are comparing it to the libraries listed below
Sorting:
- Example project presented at the Succinct Data Structure Tutorial at SIGIR 2016☆25Updated 6 years ago
- C++ program for finding strings that are over-represented in one of two texts☆17Updated 7 years ago
- Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition☆55Updated 11 years ago
- Samsung Natural Language Processing Pipeline (basically for Russian language): morphology, dependency parser and much more☆59Updated 4 years ago
- A parallel IRWLS library to solve SVMs and budgeted SVMs☆59Updated 7 years ago
- C++ implementation of the Hellinger PCA for computing word embeddings.☆32Updated 8 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- this is a high performance cuda porting of cbow model of word2vec☆43Updated 10 years ago
- An efficient character based RNN☆91Updated 6 years ago
- Leon Bottou's SGD☆33Updated 13 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆127Updated 7 months ago
- Parallelizing word2vec in shared and distributed memory☆190Updated 2 years ago
- Recurrent Neural Network language modeling toolkit☆38Updated 11 years ago
- A deep, LSTM-based part of speech tagger and sentiment analyser using character embeddings instead of words. Compatible with Theano and T…☆91Updated 8 years ago
- cuda implementation of CBOW model (word2vec)☆117Updated 11 years ago
- A locality-sensitive hashing library☆46Updated 11 years ago
- NLP SENNA (http://ml.nec-labs.com/senna) interface to LuaJIT☆49Updated 10 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Reimplementation of deepwalk algorithm from https://github.com/phanein/deepwalk☆38Updated 9 years ago
- An autoencoder to calculate word embeddings as mentioned in Lebret/Collobert paper 2015☆74Updated 8 years ago
- Deep neural network based model for sequence to sequence classification☆76Updated 7 years ago
- Simple and light framework for building neural nets with different architectures 2.0 (Python)☆13Updated 9 years ago
- Official repository of QuickRank: A C++ suite of Learning to Rank algorithms.☆131Updated 6 years ago
- C++ implementation for Neural Network-based NLP, such as LSTM machine translation!☆87Updated 7 years ago
- ☆70Updated 10 years ago
- Sequential convolutional architectures for text classification☆29Updated 9 years ago
- Implementation of Word Embedding-based Antonym Detection using Thesauri and Distributional Information in NAACL2015☆35Updated 3 years ago
- Slides/code for the Lisbon machine learning school 2017☆28Updated 7 years ago
- A Python framework for exploring distributional semantic models.☆85Updated 9 years ago
- Automatically exported from code.google.com/p/sofia-ml☆61Updated 4 years ago