Aloja / aloja
[DEPRECATED] For read-only reference of the ALOJA Big Data Benchmarking platform: includes tools to define and deploy clusters, orchestrate benchmarking, collect and manage results, and analyze them in Web app including Predictive Analytic tools. Check the website for the datasets and papers
☆23Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for aloja
- Reproducing Distributed Systems and Experiments on Cloud☆39Updated last year
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- Workshop for Hadoop Operations Best Practices☆10Updated 9 years ago
- Hadoop Profiler, or hprofiler, is a tool which is able to analyze on- and off-CPU workloads on distributed computing environments.☆24Updated 8 years ago
- Cascading on Apache Flink®☆54Updated 9 months ago
- Starter examples to writes distributed fault-tolerant YARN applications☆9Updated 9 years ago
- Scriptable scheduler for periodical Hadoop workflows☆22Updated 6 years ago
- Tools to deploy Hadoop on EMC Isilon☆18Updated 8 years ago
- HDFS Automatic Snapshot Service for Linux☆12Updated 8 years ago
- Apache Incubator Proposal for Heron☆22Updated 8 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 7 years ago
- A template-based cluster provisioning system☆61Updated last year
- Ansible playbook for automated HDP 2.x deployment install with Kerberos☆19Updated 8 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Use cases built on SnappyData. Use cases contained here: 1. Ad Analytics 2. Streaming data ingestion from RabbitMQ.☆32Updated 2 years ago
- Ambari View for the Ambari Store☆15Updated 9 years ago
- Muppet☆126Updated 3 years ago
- MPICH2 Hydra scheduler for Apache Mesos.☆29Updated 10 years ago
- Apache Zeppelin Service for Apache Ambari Service. Installation and management of Zeppelin via Ambari.☆14Updated 8 years ago
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Updated 2 years ago
- Presto ODBC Driver☆27Updated 10 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 9 years ago
- Exelixi is a distributed framework for running genetic algorithms at scale. The framework is based on Apache Mesos and the code is mostly…☆34Updated 10 years ago