ispras / spark-openstack
Scripts to setup Spark cluster (any version) in any Openstack environment with optional useful tools.
☆31Updated 3 years ago
Alternatives and similar repositories for spark-openstack:
Users that are interested in spark-openstack are comparing it to the libraries listed below
- ☆41Updated 7 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- ☆106Updated 2 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- analytics tool kit☆43Updated 8 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Affinity Propagation on Spark☆19Updated 3 years ago
- Templates for projects based on top of H2O.☆37Updated 3 months ago
- ☆110Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- An Integrated and collaborative cloud environment for building and running Spark applications on PKS/Kubernetes☆82Updated 4 years ago
- Another, hopefully better, implementation of ALS on Spark☆14Updated 9 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- ☆37Updated 5 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 8 years ago
- ☆24Updated 9 years ago
- Jupyter extensions for SWAN☆58Updated this week
- Ansible playbooks to construct distributed computing environments☆62Updated 3 years ago
- Apache Yarn cluster docker image☆35Updated 7 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 9 years ago
- Cascading on Apache Flink®☆54Updated last year
- HopsWorks - Hadoop for Humans☆117Updated 5 years ago