jblomo / oddjobLinks
useful JVM classes for the mrjob hadoop streaming framework
☆31Updated 12 years ago
Alternatives and similar repositories for oddjob
Users that are interested in oddjob are comparing it to the libraries listed below
Sorting:
- Python MapReduce library written in Cython. Visit us in #hadoopy on freenode. See the link below for documentation and tutorials.☆241Updated 10 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,427Updated 11 years ago
- A Python wrapper for Cascading☆221Updated 6 years ago
- Oozie - workflow engine for Hadoop☆374Updated 8 years ago
- A distributed publish/subscribe messaging service☆564Updated 2 years ago
- Python module that allows one to easily write and run Hadoop programs.☆1,032Updated 8 years ago
- Python client library for Mesos Marathon's REST API☆195Updated 5 years ago
- [DEPRECATED] This project is deprecated. It will be archived on December 1, 2017.☆147Updated 9 years ago
- Pyleus is a Python framework for developing and launching Storm topologies.☆400Updated 7 years ago
- https://github.com/apache/incubator-myriad is our new home. See☆253Updated 10 years ago
- One click deploy for Storm clusters on AWS☆515Updated 10 years ago
- A pure python HDFS client☆860Updated 3 years ago
- Exelixi is a distributed framework based on Apache Mesos, mostly implemented in Python using gevent for high-performance concurrency. It …☆131Updated 12 years ago
- Python connector for ElasticSearch - the pythonic way to use ElasticSearch☆606Updated 4 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆582Updated 11 years ago
- A stats collector & reporter for Scala servers (deprecated)☆768Updated 6 years ago
- Hadoop on Mesos☆176Updated 3 years ago
- Gearman API - Client, worker, and admin client interfaces☆241Updated 9 years ago
- Serving system for batch generated data sets☆177Updated 8 years ago
- A Graph Server (no longer active - see Apache TinkerPop)☆431Updated 2 years ago
- too cool for me☆61Updated 13 years ago
- A Scala productivity framework for Hadoop.☆483Updated 3 years ago
- Wordnik Open Source Software☆169Updated 10 years ago
- async Amazon DynamoDB library for Tornado☆58Updated 9 years ago
- Python implementation of the Statsd client/server☆356Updated 4 years ago
- [DEPRECATED] This project is deprecated. It will be archived on December 1, 2017.☆184Updated 8 years ago
- Scala client for the Twitter streaming api☆66Updated 14 years ago
- Refactored version of code.google.com/hadoop-gpl-compression for hadoop 0.20☆551Updated last year
- A Functional and Performance Test Framework for Distributed Systems☆159Updated 10 years ago
- The metric correlation component of Etsy's Kale system☆709Updated 8 years ago