Spirals-Team / hadoop-benchmark
Docker containers to build an Hadoop infrastructure and experiment feedback control loops atop of it.
☆9Updated 6 years ago
Alternatives and similar repositories for hadoop-benchmark:
Users that are interested in hadoop-benchmark are comparing it to the libraries listed below
- Scripts to setup Spark cluster (any version) in any Openstack environment with optional useful tools.☆31Updated 3 years ago
- CS294 RISE Course Material☆32Updated 6 years ago
- Apache Spark under Docker☆9Updated 8 years ago
- ☆41Updated 7 years ago
- Benchmarks of BLAS libraries with Scala interface☆30Updated 9 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- My MSc on Data Science final project. This is a library for Data Pre-processing Algorithms for Streaming in Flink (DPASF)☆18Updated 5 years ago
- Docker Grid5000 client (CLI and Go library)☆12Updated 7 years ago
- analytics tool kit☆43Updated 8 years ago
- Trending on Accumulo☆40Updated 12 years ago
- GPU Acceleration for Apache Spark☆34Updated 9 years ago
- ☆12Updated 9 years ago
- Security log file challenge☆28Updated 8 years ago
- A deep learning library for Apache SystemML.☆9Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- ☆15Updated 7 years ago
- Set up tools for running a few DL libraries on CDH and CDSW☆17Updated 4 years ago
- ☆24Updated 8 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- A framework for scalable graph computing.☆147Updated 6 years ago
- ☆31Updated 5 years ago
- Implementation of the Loopy Belief Propagation algorithm for Apache Spark☆41Updated 4 years ago
- Course material for Algorithms and Data Structures (TU Delft TI3110TU)☆10Updated 6 years ago
- Spark GPU and SIMD Support☆61Updated 4 years ago
- ☆23Updated 7 years ago
- Drizzle integration with Apache Spark☆120Updated 6 years ago
- Dynamic Distributed Dimensional Data Model☆41Updated 11 months ago
- Yggdrasil: Faster Decision Trees Using Column Partitioning in Spark☆31Updated 6 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆93Updated 9 years ago
- [DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestr…☆23Updated 4 years ago