mcolebrook / DSboxLinks
Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.
☆39Updated 7 years ago
Alternatives and similar repositories for DSbox
Users that are interested in DSbox are comparing it to the libraries listed below
Sorting:
- Apache Zeppelin on Kubernetes.☆28Updated 6 years ago
- Mirror of Apache Zeppelin (Incubating)☆45Updated 9 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- An external PySpark module that works like R's read.csv or Panda's read_csv, with automatic type inference and null value handling. Parse…☆90Updated 9 years ago
- Deep Learning for Pugs☆74Updated 8 years ago
- spark backend for dplyr☆48Updated 9 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 9 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- Code for Pythonic visualization blog post☆40Updated 8 years ago
- Some IPython notebooks I've created...☆29Updated 9 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- DEPRECATED Build, manage and deploy H2O's high-speed machine learning models.☆61Updated 6 years ago
- Docker container for Shiny Server☆14Updated 9 years ago
- ☆146Updated 9 years ago
- R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks☆122Updated 7 years ago
- Spark MOOC setup and labs for DBC users☆45Updated 10 years ago
- Docker images for data science from Wise.io☆50Updated 9 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- Source Material for using Python and Hadoop together☆13Updated 8 years ago
- Docker container with a PyData stack and JupyterHub server☆36Updated 9 years ago
- A package that allows R developers to use Hadoop HDFS☆64Updated 7 years ago
- ☆41Updated 8 years ago
- Splittable SAS (.sas7bdat) Input Format for Hadoop and Spark SQL☆92Updated last year
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 6 years ago
- Rebooting ggplot2 for scalable big data visualization☆28Updated 8 years ago
- Google Container Engine, JupyterHub, and Jupyter for classroom scenarios☆59Updated 7 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- Chapter-wise code for Agile Data the O'Reilly book☆159Updated 11 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 11 years ago
- Python and R: Writing Cross Language Tools (SciPy 2016 Talk)☆20Updated 9 years ago