KristianHolsheimer / pyspark-setup-guideLinks
A guide for setting up Spark + PySpark under Ubuntu linux
☆56Updated 8 years ago
Alternatives and similar repositories for pyspark-setup-guide
Users that are interested in pyspark-setup-guide are comparing it to the libraries listed below
Sorting:
- Code & Data for Introduction to Machine Learning with Scikit-Learn☆81Updated 6 years ago
- ☆146Updated 9 years ago
- Code reference from my Qbox blog posts.☆87Updated 9 years ago
- Oracle Data Science Bootcamp 2014☆25Updated 10 years ago
- Kaggle Submission for "Detecting Insults in Social Commentary"☆150Updated 9 years ago
- Notebooks (and slides) for my PyData NYC 2014 tutorial on the more advanced features of scikit-learn.☆69Updated 10 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- ☆27Updated 10 years ago
- Predicting happiness from demographics and poll answers☆45Updated 8 years ago
- Collection of dask example notebooks☆58Updated 7 years ago
- Repository for my 'K-Means Clustering with Scikit-Learn' talk materials.☆43Updated 6 years ago
- Spark 2.0 Python Machine Learning examples☆99Updated 5 years ago
- All Kaggle competitions☆91Updated 8 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- An entry to kaggle's 'Sentiment Analysis on Movie Reviews' competition☆180Updated 6 years ago
- Kaggle competition☆23Updated 9 years ago
- A short guide for transitioning from Python to Scala☆65Updated 9 years ago
- Simple demonstration of how to build a complex real time machine learning visualization tool.☆16Updated 9 years ago
- Natural Language Processing with Spark's MLlib☆62Updated 7 years ago
- Training models with Apache Spark, PySpark for Titanic Kaggle competition☆14Updated 8 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- Pydata NYC 2014 Scikit Learn Tutorial☆65Updated 10 years ago
- Code for the Kaggle acquire valued shoppers challenge☆66Updated 11 years ago
- EuroScipy 2014 tutorial: Introduction to predictive analytics with pandas and scikit-learn☆85Updated 10 years ago
- Advanced Scikit-learn training session☆118Updated 8 years ago
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 5 years ago
- A short tutorial notebook on PySpark☆15Updated 9 years ago
- Slides and notebooks for PyData Strata San Jose☆50Updated 10 years ago