Aloja / alojaLinks
[DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestrate benchmarking, collect and manage results, and analyze them in Web app including Predictive Analytic tools. Check the website for the datasets and papers
☆23Updated 4 years ago
Alternatives and similar repositories for aloja
Users that are interested in aloja are comparing it to the libraries listed below
Sorting:
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Updated 7 years ago
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- A scalable, distributed Time Series Database.☆28Updated 10 years ago
- Cascading on Apache Flink®☆54Updated last year
- Platform documentation☆16Updated 9 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 10 years ago
- install Cloudera's distribution of Hadoop including Cloudera Manager and Cloudera Search (Beta)☆31Updated 11 years ago
- Allows wrapping existing WebUI pages and present them as Ambari Views☆9Updated 9 years ago
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- A tutorial that explains how to build a simple distributed fault-tolerant framework on top of Mesos☆47Updated 2 years ago
- ☆10Updated 10 years ago
- ☆9Updated 9 years ago
- Extensible Python Framework for Apache Mesos☆33Updated 7 years ago
- Presto K8S Operator☆9Updated 5 years ago
- Exelixi is a distributed framework for running genetic algorithms at scale. The framework is based on Apache Mesos and the code is mostly…☆34Updated 11 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago
- Tool for gathering blocks and replicas meta data from HDFS. It also builds a heat map showing how replicas are distributed along disks an…☆55Updated 8 years ago
- Cloudbreak Deployer Tool☆34Updated last year
- A template-based cluster provisioning system☆61Updated 2 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 7 years ago
- Groovy client library for Apache Ambari's REST API☆20Updated 3 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Zipkin Mesos Framework☆31Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 10 years ago
- Anomaly Detection Framework☆24Updated 9 years ago