cloudera / python-ngramsLinks
☆75Updated 12 years ago
Alternatives and similar repositories for python-ngrams
Users that are interested in python-ngrams are comparing it to the libraries listed below
Sorting:
- A Python MapReduce and HDFS API for Hadoop☆241Updated 9 months ago
- PredictionIO Python SDK☆196Updated 7 years ago
- Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world…☆128Updated 12 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 6 years ago
- Experimental parallel data analysis toolkit.☆122Updated 4 years ago
- DEPRECATED - HBase Stargate (REST API) client wrapper for Python.☆53Updated 7 years ago
- Film recommendations with Apache Spark and Python☆61Updated 10 years ago
- Rapid Machine Learning Prototyping in Python☆656Updated 9 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆108Updated 11 years ago
- SDK for Turi's GraphLab Create.☆148Updated 7 years ago
- Repository containing files for my PyCon 2014 scikit-learn tutorial.☆225Updated 9 years ago
- Lightweight MapReduce in python☆480Updated 4 years ago
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆241Updated 9 years ago
- DEPRECATED: Orange 2 (Python 2) data mining suite. NEW: https://github.com/biolab/orange3☆307Updated 5 years ago
- Sparkling Pandas☆25Updated 8 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Updated 7 years ago
- Hadoop (Utilities, Patches and Examples)☆244Updated 9 years ago
- Training materials for Strata, AMP Camp, etc☆148Updated 9 years ago
- Source code for exploring MLlib blog post☆11Updated 10 years ago
- Reactive Factorization Engine☆104Updated 10 years ago
- How-To code samples for working with GraphLab Create.☆207Updated 8 years ago
- Item and User-based KNN recommendation algorithms using PySpark☆124Updated 7 years ago
- Material for talk "Machine Learning 101" https://speakerdeck.com/kastnerkyle/pycon2015 https://us.pycon.org/2015/schedule/presentation/36…☆87Updated 10 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated 2 years ago
- Zipfian capstone project - Dan Morris☆30Updated 8 years ago
- python library for interacting with SolrCloud☆36Updated 4 years ago
- Code reference from my Qbox blog posts.☆87Updated 10 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Updated 11 years ago
- Applied Machine Learning in Python with scikit-learn☆48Updated 14 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆37Updated 10 years ago