wattsteve / pyspark-tutorial
Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS
☆8Updated 9 years ago
Related projects: ⓘ
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 9 years ago
- Spark Tutorial at the University of Maryland☆38Updated 9 years ago
- Materials for Strata Singapore "Machine learning In Python with scikit-learn" tutorial.☆9Updated 8 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Apache Spark under Docker☆9Updated 8 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- Apache Toree quickstart tutorial☆29Updated 8 years ago
- ☆27Updated this week
- ☆21Updated this week
- Some IPython notebooks I've created...☆29Updated 8 years ago
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Updated 2 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 9 years ago
- Training materials for Strata, AMP Camp, etc☆150Updated 8 years ago
- ☆41Updated 7 years ago
- Coding exercises for Apache Spark☆103Updated 9 years ago
- Material and slides for Boston NLP meetup May 23rd 2016☆17Updated 8 years ago
- Process large amount of Twitter data using Spark SQL (and its JSON support). Answers questions like "What are the most popular languages?…☆9Updated 9 years ago
- self organizing map and variations implemented in Spark☆9Updated 8 years ago
- ☆23Updated 7 years ago
- ☆12Updated 8 years ago
- Predicting sales with Pandas☆15Updated 8 years ago
- Materials for dask talk at PyData NYC☆15Updated 8 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 8 years ago
- An Apache Spark-shell backend for IPython☆107Updated 3 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 9 years ago
- ☆24Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- Machine Learning for Cascading☆82Updated 9 years ago
- Zeppelin notebook examples☆26Updated 8 years ago