Spirals-Team / hadoop-benchmarkLinks
Docker containers to build an Hadoop infrastructure and experiment feedback control loops atop of it.
☆9Updated 7 years ago
Alternatives and similar repositories for hadoop-benchmark
Users that are interested in hadoop-benchmark are comparing it to the libraries listed below
Sorting:
- An example of bioinformatics and bigdata tools can playing nicely together☆14Updated 9 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆40Updated last year
- ☆15Updated 7 years ago
- Heterogeneity-incorporating Workflow ApplicationMaster for YARN☆26Updated 7 years ago
- ☆41Updated 8 years ago
- ☆20Updated 7 years ago
- Dynamic Distributed Dimensional Data Model☆43Updated last year
- Integrate the GA4GH schemas and probably a scala impl of the service.☆14Updated 9 years ago
- analytics tool kit☆43Updated 8 years ago
- ☆19Updated 7 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Looking at big data? Add a little salt.☆59Updated 2 years ago
- Correlation matrix with scatter plot using d3.js☆19Updated 10 years ago
- The presentation at Spark Summit 2014 showing how 4Quant does production scale image processing and analysis using Spark☆17Updated 11 years ago
- Dockerflow is a workflow runner that uses Dataflow to run a series of tasks in Docker with the Pipelines API☆99Updated 7 years ago
- Distributed Matrix Library☆72Updated 8 years ago
- A Variant Caller, Distributed. Apache 2 licensed.☆71Updated 6 years ago
- Open source formats for scalable genomic processing systems using Avro. Apache 2 licensed.☆40Updated 6 months ago
- Hierarchical Temporal Memory Models Cluster implementation☆13Updated 5 years ago
- ☆12Updated 9 years ago
- This repository implements converters and tools for working with NGS data in HPC or Hadoop cluster☆17Updated 7 years ago
- Google Cloud Dataflow pipelines such as Identity-By-State as well as useful utility classes.☆37Updated 2 years ago
- ☆18Updated 9 years ago
- ☆37Updated 6 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- VariantSpark is a framework for applying Spark-based Machine Learning methods to whole-genome variant information☆33Updated 7 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Updated 5 years ago
- Coding exercises for Apache Spark☆104Updated 10 years ago
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆72Updated 5 years ago