PiercingDan / spark-Jupyter-AWS
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
☆262Updated 7 years ago
Alternatives and similar repositories for spark-Jupyter-AWS:
Users that are interested in spark-Jupyter-AWS are comparing it to the libraries listed below
- Content for architecting a data science platform for products using Luigi, Spark & Flask.☆163Updated 5 years ago
- Deep Learning for Pugs☆74Updated 7 years ago
- ☆85Updated 6 years ago
- ☆263Updated 5 years ago
- Curated list of all dataset websites that I find☆84Updated 6 years ago
- VM based deployment for prototyping Big Data tools on Amazon Web Services☆128Updated 4 years ago
- Open source Flotilla☆192Updated this week
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Magic functions for using Jupyter Notebook with Apache Spark and a variety of SQL databases.☆172Updated 6 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- PyData NYC 2015 conference☆94Updated 9 years ago
- ☆146Updated 8 years ago
- Directory of Jupyter notebooks exploring various topics☆316Updated 7 years ago
- ☆318Updated 3 years ago
- PyData, The Complete Works of☆298Updated 8 years ago
- ☆52Updated 8 years ago
- DePy 2015 Talk☆117Updated 7 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- Observations from Ian on successfully delivering data science products☆543Updated 3 years ago
- Arbalest is a Python data pipeline orchestration library for Amazon S3 and Amazon Redshift. It automates data import into Redshift and ma…☆41Updated 9 years ago
- A fork of the cookiecutter-data-science leveraging Docker for local development.☆130Updated 5 years ago
- Start a cluster in EC2 for dask.distributed☆106Updated 4 years ago
- A luigi powered analytics / warehouse stack☆87Updated 7 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆85Updated 8 years ago
- ☆117Updated last month
- Standard evaluations for binary classifiers so you don't have to☆315Updated 6 years ago
- Repeatable analysis plugin for Jupyter notebook☆260Updated 2 years ago
- ☆160Updated 8 years ago
- All materials for workshops - HackOn(Data) - Toronto☆33Updated 7 years ago
- Jupyter Notebook extension for Apache Spark integration☆191Updated 4 years ago