ispras / spark-openstackLinks
Scripts to setup Spark cluster (any version) in any Openstack environment with optional useful tools.
☆31Updated 3 years ago
Alternatives and similar repositories for spark-openstack
Users that are interested in spark-openstack are comparing it to the libraries listed below
Sorting:
- ☆41Updated 7 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- analytics tool kit☆43Updated 8 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Example for experimenting with how JupyterHub can be configured to work with Kerberos☆33Updated 7 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 7 years ago
- Machine Learning Pipeline Stages for Spark (exposed in Scala/Java + Python)☆74Updated last year
- Spark Parameter Optimization and Tuning☆31Updated 7 years ago
- An Apache Spark-shell backend for IPython☆105Updated 4 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Big Data Science Swiss Army Knife - http://www.tuktu.io --☆60Updated 7 years ago
- ☆146Updated 9 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 9 years ago
- Portable Format for Analytics☆27Updated 8 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- Code to allow running BIDMach on Spark including HDFS integration and lightweight sparse model updates (Kylix).☆15Updated 4 years ago
- ☆110Updated 8 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 10 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Updated last year
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- self organizing map and variations implemented in Spark☆9Updated 9 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Updated 9 years ago
- HopsWorks - Hadoop for Humans☆117Updated 6 years ago
- Pig on Apache Spark☆83Updated 10 years ago
- ☆106Updated 2 years ago
- Using JPMML Evaluator to validate the PMML models exported from Spark☆19Updated 8 years ago