cloudera / python-ngrams
☆75Updated 11 years ago
Alternatives and similar repositories for python-ngrams:
Users that are interested in python-ngrams are comparing it to the libraries listed below
- DEPRECATED - HBase Stargate (REST API) client wrapper for Python.☆53Updated 6 years ago
- My capstone project for Galvanize (Zipfian Academy)☆38Updated 6 years ago
- 阅读论文备份☆17Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 9 years ago
- Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world…☆127Updated 11 years ago
- Machine Learning Using Spark☆7Updated 9 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- An extension of the kafka-python package that adds features like multiprocess consumers.☆39Updated last year
- Film recommendations with Apache Spark and Python☆61Updated 9 years ago
- Experimental parallel data analysis toolkit.☆121Updated 3 years ago
- Public code files for the DDL blog☆56Updated 6 years ago
- PredictionIO Python SDK☆196Updated 6 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Updated 10 years ago
- Kaggle Avazu beat-the-benchmark model☆36Updated 10 years ago
- C++ native client for Impala and Hive, with Python / pandas bindings☆72Updated 6 years ago
- ☆24Updated 9 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆107Updated 10 years ago
- Source code for exploring MLlib blog post☆11Updated 9 years ago
- Recommender Systems in Depth: An introduction to Recommender Systems using Python and Crab☆44Updated 11 years ago
- SDK for Turi's GraphLab Create.☆149Updated 7 years ago
- Chapter-wise code for Agile Data the O'Reilly book☆157Updated 11 years ago
- RHive is an R extension facilitating distributed computing via Apache Hive.☆123Updated 7 years ago
- Applied Machine Learning in Python with scikit-learn☆47Updated 13 years ago
- A Python MapReduce and HDFS API for Hadoop☆237Updated last week
- PySpark for Elastic Search☆55Updated 7 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Python Streaming Example☆17Updated 10 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- 7th in a competition organised by ICT☆24Updated 9 years ago
- Kaggle Criteo https://www.kaggle.com/c/criteo-display-ad-challenge☆98Updated 10 years ago