jblomo / oddjobLinks
useful JVM classes for the mrjob hadoop streaming framework
☆31Updated 12 years ago
Alternatives and similar repositories for oddjob
Users that are interested in oddjob are comparing it to the libraries listed below
Sorting:
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆242Updated 9 years ago
- Oozie - workflow engine for Hadoop☆373Updated 8 years ago
- One click deploy for Storm clusters on AWS☆515Updated 10 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,428Updated 11 years ago
- A Python wrapper for Cascading☆222Updated 5 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆583Updated 11 years ago
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,136Updated 2 years ago
- Python module that allows one to easily write and run Hadoop programs.☆1,030Updated 7 years ago
- A stats collector & reporter for Scala servers (deprecated)☆770Updated 6 years ago
- A distributed publish/subscribe messaging service☆560Updated 2 years ago
- Pyleus is a Python framework for developing and launching Storm topologies.☆400Updated 6 years ago
- Wordnik Open Source Software☆168Updated 10 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- [DEPRECATED] This project is deprecated. It will be archived on December 1, 2017.☆147Updated 8 years ago
- ☆146Updated 5 years ago
- Zohmg is a data store for aggregation of multi-dimensional time series data, built on top of Hadoop, Dumbo and HBase.☆174Updated 12 years ago
- A pure python HDFS client☆857Updated 3 years ago
- Scala client for the Twitter streaming api☆66Updated 14 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- Repository of other user-contributed tools and programs☆149Updated 3 years ago
- Distributed database specialized in exporting key/value data from Hadoop☆559Updated 11 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆328Updated 3 years ago
- initial setup for a scala library or server, using sbt☆123Updated 8 years ago
- S4 repository☆141Updated 13 years ago
- Hadoop on Mesos☆175Updated 2 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆552Updated last year
- too cool for me☆61Updated 12 years ago
- Crux is a reporting application for HBase. Crux provides a simple web based graphical interface to access HBase, query data and create re…☆100Updated 12 years ago
- A Scala productivity framework for Hadoop.☆482Updated 3 years ago
- Send Kafka Metrics to StatsD.☆135Updated 4 years ago