Spirals-Team / hadoop-benchmarkLinks

Docker containers to build an Hadoop infrastructure and experiment feedback control loops atop of it.

☆9

Alternatives and similar repositories for hadoop-benchmark

Users that are interested in hadoop-benchmark are comparing it to the libraries listed below

Sorting:

allenday / spark-genome-alignment-demo
An example of bioinformatics and bigdata tools can playing nicely together
☆14Updated 9 years ago
karamelchef / karamel
Reproducing Distributed Systems and Experiments on Cloud
☆40Updated last year
mdymczyk / iot-pipeline
☆15Updated 7 years ago
marcbux / Hi-WAY
Heterogeneity-incorporating Workflow ApplicationMaster for YARN
☆26Updated 7 years ago
deepsense-ai / seahorse-workflow-executor
☆41Updated 8 years ago
Anchormen / spark-hdfs-on-kubernetes
☆20Updated 7 years ago
Accla / d4m
Dynamic Distributed Dimensional Data Model
☆43Updated last year
med-at-scale / high-health
Integrate the GA4GH schemas and probably a scala impl of the service.
☆14Updated 9 years ago
trustedanalytics / atk
analytics tool kit
☆43Updated 8 years ago
nrpowell / grakn-movie-recommender
☆19Updated 7 years ago
asimjalis / apache-toree-quickstart
Apache Toree quickstart tutorial
☆29Updated 9 years ago
AtlasPilotPuppy / SparkAlgorithms
Additional useful algorithms that can be used with spark.
☆24Updated 10 years ago
unchartedsoftware / salt-core
Looking at big data? Add a little salt.
☆59Updated 2 years ago
d-miller / correlation-scatter
Correlation matrix with scatter plot using d3.js
☆19Updated 10 years ago
4Quant / spark-summit-2014-presentation
The presentation at Spark Summit 2014 showing how 4Quant does production scale image processing and analysis using Spark
☆17Updated 11 years ago
googlegenomics / dockerflow
Dockerflow is a workflow runner that uses Dataflow to run a series of tasks in Docker with the Pipelines API
☆99Updated 7 years ago
amplab / ml-matrix
Distributed Matrix Library
☆72Updated 8 years ago
bigdatagenomics / avocado
A Variant Caller, Distributed. Apache 2 licensed.
☆71Updated 6 years ago
bigdatagenomics / bdg-formats
Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.
☆40Updated 6 months ago
antidata / htm-moclu
Hierarchical Temporal Memory Models Cluster implementation
☆13Updated 5 years ago
bellettif / sparkGeoTS
☆12Updated 9 years ago
opencb / hpg-bigdata
This repository implements converters and tools for working with NGS data in HPC or Hadoop cluster
☆17Updated 7 years ago
googlegenomics / dataflow-java
Google Cloud Dataflow pipelines such as Identity-By-State as well as useful utility classes.
☆37Updated 2 years ago
jshmain / cloudera-search
☆18Updated 9 years ago
h2oai / h2o-kubeflow
☆37Updated 6 years ago
adobe-research / spark-gpu
GPU Acceleration for Apache Spark
☆34Updated 9 years ago
BauerLab / VariantSpark
VariantSpark is a framework for applying Spark-based Machine Learning methods to whole-genome variant information
☆33Updated 7 years ago
WhiteFangBuck / CDSW-DL
Set up tools for running a few DL libraries on CDH and CDSW
☆17Updated 5 years ago
ceteri / spark-exercises
Coding exercises for Apache Spark
☆104Updated 10 years ago
rapidsai / spark-examples
[ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples
☆72Updated 5 years ago