trufanov-nok / shuf-t
This application shuffles the input file lines skipping (optionaly) the header. It's optimized for files bigger than available RAM.
☆25Updated 8 years ago
Alternatives and similar repositories for shuf-t:
Users that are interested in shuf-t are comparing it to the libraries listed below
- Recurrent Neural Network language modeling toolkit☆38Updated 11 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆28Updated 8 years ago
- A locality-sensitive hashing library☆46Updated 10 years ago
- ☆28Updated 5 years ago
- Example project presented at the Succinct Data Structure Tutorial at SIGIR 2016☆25Updated 6 years ago
- Creates models to classify documents into categories☆66Updated 7 years ago
- My solution for Kagge Allen AI Challenge ( 3rd place )☆19Updated 8 years ago
- Demo of random projections at BerlinBuzzwords 2015☆22Updated 4 years ago
- Leon Bottou's SGD☆33Updated 13 years ago
- Fast Word Clustering Software☆78Updated last week
- Simple and light framework for building neural nets with different architectures 2.0 (Python)☆13Updated 8 years ago
- ☆16Updated 8 years ago
- Spectral Word Embedding Learning for Language (SWELL) toolkit☆27Updated 10 years ago
- High-performance Non-negative Matrix Factorizations (NMF) - Python/C++☆49Updated 6 years ago
- Boilerplate code for quickly getting set up to run language modeling experiments☆35Updated 8 years ago
- Implementation of a deep recursive net over binary parse trees (code for NIPS2014 paper)☆28Updated 10 years ago
- various simple RNNs trained on synthetic grammars☆30Updated 9 years ago
- C++ library for modeling with Pitman-Yor processes☆34Updated 7 years ago
- Parallel (asynchronous) sparse coding implementation for obtaining sparse overcomplete word vectors☆53Updated 7 years ago
- Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition☆55Updated 11 years ago
- hacky exploratory variants on NN language models☆9Updated 9 years ago
- ☆16Updated 9 years ago
- Fast and memory-efficient svmlight / libsvm file loader for Python.☆116Updated 5 years ago
- this is a high performance cuda porting of cbow model of word2vec☆43Updated 10 years ago
- Parallelizing word2vec in shared and distributed memory☆190Updated 2 years ago
- Question Answering via Integer Programming (TableILP)☆28Updated 8 years ago
- C++ program for finding strings that are over-represented in one of two texts☆17Updated 7 years ago
- My solution for the Kaggle "Allen AI science challenge"☆48Updated 8 years ago
- Code for EMNLP 2016 paper: Morphological Priors for Probabilistic Word Embeddings☆52Updated 8 years ago