wattsteve / pyspark-tutorial
Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS
☆8Updated 10 years ago
Alternatives and similar repositories for pyspark-tutorial:
Users that are interested in pyspark-tutorial are comparing it to the libraries listed below
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 9 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- ☆23Updated 7 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- Spark in Kaggle competitions☆9Updated 9 years ago
- ☆41Updated 7 years ago
- Apache Spark under Docker☆9Updated 8 years ago
- Zeppelin notebook examples☆26Updated 9 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- Real time and offline time series analysis with Spark, Spark Streaming and Storm☆21Updated 4 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Coursera Machine Learning class examples in Spark☆43Updated 11 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 9 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 8 years ago
- Predicting sales with Pandas☆15Updated 9 years ago
- Simple Spark example of generating table stats for use of data quality checks☆28Updated 7 years ago
- Building Python Data Application Tutorials☆23Updated 8 months ago
- ☆15Updated 7 years ago
- Oracle Data Science Bootcamp 2014☆24Updated 10 years ago
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Updated 2 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- ☆12Updated 9 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago