ziyuang / mincemeatpyLinks
Lightweight MapReduce in Python3
☆36Updated 5 years ago
Alternatives and similar repositories for mincemeatpy
Users that are interested in mincemeatpy are comparing it to the libraries listed below
Sorting:
- Lightweight MapReduce in python☆478Updated 4 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- Experimental parallel data analysis toolkit.☆121Updated 3 years ago
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago
- Python Approximate Nearest Neighbor Search in very high dimensional spaces with optimised indexing.☆215Updated 3 years ago
- Distributed Numpy☆148Updated 7 years ago
- unofficial git mirror of http://svn.whoosh.ca svn repo☆49Updated 15 years ago
- SDK for Turi's GraphLab Create.☆148Updated 7 years ago
- Battle-tested Apache Storm Multi-Lang implementation for Python☆70Updated 2 weeks ago
- PredictionIO Python SDK☆196Updated 7 years ago
- Sentiment analysis made easy; built on top off solid libraries.☆24Updated 8 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 9 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆108Updated 12 years ago
- Wabbit Wappa is a full-featured Python wrapper for the Vowpal Wabbit machine learning utility.☆101Updated 7 years ago
- A tiny python utility that converts data crawled from different services into a cloud of words☆30Updated 7 years ago
- A wrapper for libhdfs3 to interact with HDFS from Python☆137Updated 4 years ago
- Transition-based statistical parser☆417Updated 7 years ago
- LSH based high dimensional clustering for sets and points☆79Updated 10 years ago
- Unified interface for local and distributed ndarrays☆157Updated 6 years ago
- ☆162Updated 4 years ago
- Quickly start YARN cluster on EC2☆30Updated 8 years ago
- Running Tensorflow on Spark in the scalable, fast and compatible style☆21Updated 8 years ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55Updated last year
- A pure Python implementation of Apache Spark's RDD and DStream interfaces.☆270Updated 11 months ago
- A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.☆147Updated 11 months ago
- Simple PMML exporter for Keras Deep Learning models.☆32Updated 6 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- A simple pure-Python decision tree construction algorithm☆56Updated 4 years ago
- cuda implementation of CBOW model (word2vec)☆117Updated 11 years ago