toddlipcon / haatkitLinks
Toolkit of simple scripts useful for managing Hadoop
☆16Updated 13 years ago
Alternatives and similar repositories for haatkit
Users that are interested in haatkit are comparing it to the libraries listed below
Sorting:
- Bigtop is a project for the development of packaging and tests of the Apache Hadoop ecosystem. The primary goal of Bigtop is to build a …☆50Updated 14 years ago
- Log processing system using Flume and Cassandra☆75Updated 14 years ago
- A set of examples and utilities for using Pig with Cassandra. For the latest jar release, check the Downloads link.☆85Updated 11 years ago
- S4 repository☆140Updated 13 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆292Updated 3 years ago
- Hadoop Input and Ouput formats for MongoDB☆29Updated 13 years ago
- Graphite dashboard system☆155Updated 12 years ago
- Bulk loading for elastic search☆185Updated last year
- Our puppet modules☆18Updated 13 years ago
- Server automation framework and application☆86Updated 13 years ago
- git-bzr mirror of graphite trunk☆81Updated 4 years ago
- Honu is a large scale data collection and processing pipeline☆83Updated 14 years ago
- Continuous Streaming SQL Queries for Flume☆95Updated 13 years ago
- Equivalent of stress.py, but more powerful☆33Updated 13 years ago
- Chef orchestration layer -- your system diagram come to life. Provision EC2, OpenStack or Vagrant without changes to cookbooks or configu…☆502Updated 11 years ago
- Ruby interface to Hadoop's HDFS via Thrift☆50Updated 11 years ago
- DEPRECATED & read-only!! - This repo exists for GPL compliance only. CloudStack development has moved to the ASF - see http://cloudstack.…☆253Updated 12 years ago
- Galaxy is a lightweight software deployment and management tool. We use it at Ning to manage the Java cores and Apache httpd instances th…☆67Updated 14 years ago
- A plugin for flume that allows you to use Cassandra as a sink.☆59Updated 13 years ago
- S4 is a general-purpose, distributed, scalable, partially fault-tolerant, pluggable platform that allows programmers to easily develop ap…☆233Updated 14 years ago
- Blueprint I/O moves blueprints around☆61Updated 14 years ago
- Loggly's main syslog/network input system☆15Updated 14 years ago
- Some utilities for Lucene☆111Updated 12 years ago
- Coordination and configuration manager for distributed applications.☆19Updated 13 years ago
- Flexible data workflow glue.☆28Updated 14 years ago
- coordination helpers for distributed celluloid for use with systems automation frameworks e.g. Chef☆25Updated 13 years ago
- Elasticsearch Puppet Module☆37Updated 14 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆583Updated 11 years ago
- Mirror of Apache Whirr☆94Updated 8 years ago
- Hardware discovery and configuration component of Crowbar☆30Updated 13 years ago