dnafrance/vagrant-hadoop-spark-cluster

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dnafrance/vagrant-hadoop-spark-cluster)

dnafrance / vagrant-hadoop-spark-cluster

Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1

☆125

Alternatives and similar repositories for vagrant-hadoop-spark-cluster

Users that are interested in vagrant-hadoop-spark-cluster are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vangj / vagrant-hadoop-2.4.1-spark-1.0.1
View on GitHub
Vagrant project to spin up a cluster virtual machines with Hadoop v2.4.1 and Spark v1.0.1
☆83Aug 21, 2015Updated 10 years ago
manuparra / volleyball-performance-analysis
View on GitHub
R package to Volleyball Performance Analysis and Visualization
☆11Apr 22, 2017Updated 9 years ago
ofermend / practical-data-science-with-hadoop-and-spark
View on GitHub
☆26Jan 2, 2024Updated 2 years ago
Cascading / vagrant-cascading-hadoop-cluster
View on GitHub
Deploying apache-hadoop in a virtualized cluster as easy as 1-2-3.
☆127Jan 16, 2017Updated 9 years ago
allenday / spark-genome-alignment-demo
View on GitHub
An example of bioinformatics and bigdata tools can playing nicely together
☆14May 17, 2016Updated 10 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
caioquirino / docker-cloudera-quickstart
View on GitHub
Docker Cloudera Quick Start Image
☆92Jul 29, 2017Updated 8 years ago
clear-code / libreoffice-export-all-to-csv
View on GitHub
Provides "Export All Sheets to CSV files" menu for LibreOffice/OpenOffice.org Calc
☆16Apr 18, 2017Updated 9 years ago
RBigData / pbdML
View on GitHub
☆15Jul 12, 2019Updated 7 years ago
felixcheung / vagrant-projects
View on GitHub
Vagrant projects for various use-cases with Spark, Zeppelin, IPython / Jupyter, SparkR
☆34May 13, 2016Updated 10 years ago
jdye64 / docker-hwx
View on GitHub
Combination of Dockerized Hortonworks projects and other Hadoop ecosystem components
☆10Oct 11, 2019Updated 6 years ago
holdenk / high-performance-spark-examples
View on GitHub
Examples for High Performance Spark
☆16Oct 25, 2025Updated 8 months ago
git4impatient / quickKerberos
View on GitHub
☆16Jun 22, 2015Updated 11 years ago
VariantEffect / MaveDB
View on GitHub
MaveDB database web application
☆13Nov 17, 2023Updated 2 years ago
ispras / spark-openstack
View on GitHub
Scripts to setup Spark cluster (any version) in any Openstack environment with optional useful tools.
☆31Oct 22, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Cascading / cascading.samples
View on GitHub
Sample applications using Cascading
☆20Jun 7, 2015Updated 11 years ago
Lewuathe / docker-hadoop-cluster
View on GitHub
Multiple node cluster on Docker for self development.
☆91Jul 7, 2018Updated 8 years ago
chop-dbhi / data-models-sqlalchemy
View on GitHub
SQLAlchemy models and DDL and ERD generation from chop-dbhi/data-models style JSON endpoints.
☆11May 22, 2023Updated 3 years ago
manuparra / Time-series---state-of-the-art
View on GitHub
State of the art on DeepLearn and Time Series
☆18Mar 22, 2017Updated 9 years ago
manuparra / PracticasCC
View on GitHub
Guión de prácticas de Cloud Computing - Máster en Ingeniería Informática - www.ugr.es
☆11Dec 16, 2019Updated 6 years ago
sbcd90 / siddhi-kafka-cep
View on GitHub
This is a simple CEP Engine leveraging the Kafka Streams platform
☆16Apr 25, 2017Updated 9 years ago
fabric8io / docker-gerrit
View on GitHub
a docker image for gerrit
☆12Aug 17, 2018Updated 7 years ago
yorek / zeppelin
View on GitHub
Apache Zeppelin with support for SQL Server
☆16Sep 25, 2017Updated 8 years ago
jayunit100 / SparkStreamingApps
View on GitHub
A spark sbt blueprint to build your own spark apps off of (for cloud native runtime, see the kube/spark examples)
☆57Jun 1, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
shagunsodhani / Lambda-Architecture
View on GitHub
Notes on Lambda Architecture
☆11Feb 9, 2018Updated 8 years ago
ericduq / hadoop-scripts
View on GitHub
☆19Mar 29, 2014Updated 12 years ago
obophenotype / obsolete-hdo
View on GitHub
New repository for managing the source for the human disease ontology
☆15Jul 28, 2015Updated 10 years ago
mkuthan / example-spark-kafka
View on GitHub
Apache Spark and Apache Kafka integration example
☆122Dec 21, 2017Updated 8 years ago
chali / hadoop-cdh-pseudo-docker
View on GitHub
☆47May 1, 2017Updated 9 years ago
amplab / training
View on GitHub
Training materials for Strata, AMP Camp, etc
☆150Nov 20, 2015Updated 10 years ago
jkleint / ansible-hadoop
View on GitHub
THIS REPOSITORY IS VERY OUTDATED. See Ansible Galaxy instead.
☆28Oct 23, 2018Updated 7 years ago
jeffreybreen / talk-201210-data-deluge
View on GitHub
"Tapping the Data Deluge with R" lightning talk at Predictive Analytics World, Boston, October 1, 2012
☆22Oct 2, 2012Updated 13 years ago
bjonnh / FilePermissionsPlugin
View on GitHub
File Permissions Plugin is a repository that provides a simple plugin to change file permissions directly from IntelliJ.
☆10Jul 1, 2026Updated 2 weeks ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
joseph-rickert / DataScienceRWebinar
View on GitHub
The repository contains slides, code and markdown files for the Revolution Analytics Webinar: Data Science with R given 9/25/14
☆15Sep 24, 2014Updated 11 years ago
irifed / ansible-bdas
View on GitHub
Ansible recipes for Berkeley Data Analytics Stack deployment
☆17Aug 7, 2015Updated 10 years ago
pmelsted / AM_2024
View on GitHub
Analaysis for the batch correction paper
☆12Apr 26, 2025Updated last year
grycap / ansible-role-hadoop
View on GitHub
Ansible Role to install a Hadoop Cluster
☆16Apr 1, 2026Updated 3 months ago
spark-mooc / mooc-setup
View on GitHub
Information for setting up for the BerkeleyX Spark Intro MOOC, and lab assignments for the course
☆347Mar 19, 2021Updated 5 years ago
BenFradet / dashing
View on GitHub
Dashboards to monitor your open source organization's health
☆11Dec 19, 2019Updated 6 years ago
kite-sdk / kite-examples
View on GitHub
Kite SDK Examples
☆99May 8, 2021Updated 5 years ago