Dockerfiles and scripts for Spark and Shark Docker images
☆259Jun 19, 2014Updated 11 years ago
Alternatives and similar repositories for docker-scripts
Users that are interested in docker-scripts are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Scripts to launch cluster used for Strata☆33Feb 11, 2014Updated 12 years ago
- ☆760Mar 11, 2021Updated 5 years ago
- Large scale query engine benchmark☆99Apr 5, 2016Updated 10 years ago
- A Storm Based DRPC Search Engine☆31Aug 26, 2015Updated 10 years ago
- Ansible recipes for Berkeley Data Analytics Stack deployment☆17Aug 7, 2015Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆146Mar 14, 2016Updated 10 years ago
- Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR☆34May 13, 2016Updated 9 years ago
- An efficient updatable key-value store for Apache Spark☆255Mar 11, 2017Updated 9 years ago
- Library and accelerator backend☆15Updated this week
- An Apache Spark-shell backend for IPython☆105Jul 2, 2021Updated 4 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52May 13, 2016Updated 9 years ago
- A set of scripts and config files to run a Cassandra cluster from Docker☆215Apr 24, 2014Updated 11 years ago
- Host and Container metrics using CAdvisor and Collectd☆23Nov 13, 2017Updated 8 years ago
- Dockerfiles for building a storm cluster.☆230Mar 2, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Helper for consuming Divolte events from Kafka queues and deserializing Avro records into Java objects using Avro's generated code.☆15Nov 6, 2014Updated 11 years ago
- Distributed Matrix Library☆72Jan 28, 2017Updated 9 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Nov 30, 2014Updated 11 years ago
- Benchmarking toolkit for variant calling☆48Oct 13, 2020Updated 5 years ago
- Dockerize SciDB☆15Oct 20, 2017Updated 8 years ago
- Simple Spark Application☆76Dec 17, 2023Updated 2 years ago
- Helper for using augeas with puppet☆43Mar 27, 2026Updated 2 weeks ago
- Sparkling Pandas☆362Jul 6, 2023Updated 2 years ago
- Docker containers for the IPython notebook (+SciPy Stack)☆187Jun 21, 2016Updated 9 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- analytics tool kit☆41Jan 23, 2017Updated 9 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Aug 14, 2014Updated 11 years ago
- Utilities for converting to and from JSON from Avro records via Hadoop streaming or Hive.☆29Oct 13, 2020Updated 5 years ago
- Utilities to use Avro files from Hadoop Map/Reduce jobs and Streaming☆26Sep 10, 2013Updated 12 years ago
- force libinput to emulate a middle click when pressing left and right buttons simultaneously☆21Apr 1, 2020Updated 6 years ago
- This project combines Apache Spark and Elasticsearch to enable mining & prediction for Elasticsearch.☆212Nov 8, 2014Updated 11 years ago
- Scripts used to setup a Spark cluster on EC2☆387Nov 22, 2017Updated 8 years ago
- Quickly provision a multi-VM Cassandra cluster☆53Aug 10, 2018Updated 7 years ago
- Cassandra in Docker☆129Jul 22, 2015Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Approximate nearest neighbors in Java☆145Oct 13, 2020Updated 5 years ago
- Making multiple server Storm setups easy, in Docker☆42Dec 26, 2014Updated 11 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆108Oct 21, 2014Updated 11 years ago
- CUDA kernel and JNI code which is called by Apache Spark's MLlib.☆19Jun 18, 2016Updated 9 years ago
- Source code for 'Pro Hadoop Data Analytics' by Kerry Koitzsch☆14Jul 6, 2023Updated 2 years ago
- A sane date/time python interface #hubspot-open-source☆58Feb 6, 2019Updated 7 years ago
- Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1☆125Jan 31, 2016Updated 10 years ago