jblomo / oddjob
useful JVM classes for the mrjob hadoop streaming framework
☆31Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for oddjob
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆243Updated 8 years ago
- Django based dashboard for an Apache ZooKeeper cluster.☆166Updated 7 years ago
- https://github.com/apache/incubator-myriad is our new home. See☆253Updated 8 years ago
- Pyleus is a Python framework for developing and launching Storm topologies.☆403Updated 5 years ago
- ☆147Updated 4 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆585Updated 10 years ago
- Oozie - workflow engine for Hadoop☆373Updated 7 years ago
- One click deploy for Storm clusters on AWS☆516Updated 9 years ago
- Hadoop on Mesos☆176Updated 2 years ago
- A Python wrapper for Cascading☆222Updated 4 years ago
- Gearman API - Client, worker, and admin client interfaces☆242Updated 8 years ago
- Send Kafka Metrics to StatsD.☆135Updated 3 years ago
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆174Updated 12 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,425Updated 10 years ago
- A time task management framework, support multiple projects, built on top of luigi.☆37Updated 9 years ago
- Python connector for ElasticSearch - the pythonic way to use ElasticSearch☆606Updated 3 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 4 years ago
- Tools for working with parquet, impala, and hive☆134Updated 3 years ago
- Distributed database specialized in exporting key/value data from Hadoop☆558Updated 10 years ago
- Python implementation of the Statsd client/server☆358Updated 3 years ago
- Serving system for batch generated data sets☆176Updated 7 years ago
- KingPin is the toolset used at Pinterest for service discovery and application configuration.☆69Updated 5 years ago
- python elasticsearch client☆360Updated 2 years ago
- Repository of user-contributed gmetric scripts☆148Updated 6 years ago
- Dumps state of Storm Kafka consumers☆96Updated 6 years ago
- Wordnik Open Source Software☆168Updated 9 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆330Updated 2 years ago