ispras / spark-openstack
Scripts to setup Spark cluster (any version) in any Openstack environment with optional useful tools.
☆31Updated 3 years ago
Alternatives and similar repositories for spark-openstack
Users that are interested in spark-openstack are comparing it to the libraries listed below
Sorting:
- ☆41Updated 7 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- [DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestr…☆23Updated 4 years ago
- ☆106Updated 2 years ago
- An Apache Spark-shell backend for IPython☆105Updated 3 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Coding exercises for Apache Spark☆104Updated 9 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- Spark ML Lib serving library☆48Updated 6 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34Updated 9 years ago
- This project provides association rule mining for Apache Spark. The algorithms are based on the work of Philippe Fournier-Viger and comp…☆31Updated 10 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- analytics tool kit☆43Updated 8 years ago
- ☆111Updated 8 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Updated 10 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- ☆146Updated 9 years ago
- HDFS / Spark / Mesos / Elasticsearch / Kibana / Zeppelin BigDataLab with Ansible☆31Updated 8 years ago
- Support Highcharts in Apache Zeppelin☆81Updated 7 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Groovy client library for Apache Ambari's REST API☆20Updated 3 years ago
- KDC for Cloudbreak provisioned Hadoop clusters☆15Updated 3 years ago
- functionstest☆33Updated 8 years ago
- Flowmix is a flexible event processing engine for Apache Storm. It supports complex correlations of events via sliding/tumbling windows. …☆58Updated 9 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago