mfisk / filemapLinks
File-Based Map-Reduce. Zero-install: easily use any collection of computers as a map-reduce cluster for command-line analytics.
☆228Updated 4 years ago
Alternatives and similar repositories for filemap
Users that are interested in filemap are comparing it to the libraries listed below
Sorting:
- C network daemon for HyperLogLogs☆450Updated 4 years ago
- mapreduce in bash☆920Updated 5 years ago
- Automatically exported from code.google.com/p/crush-tools☆150Updated 9 years ago
- ScalienDB is a scalable, replicated datastore.☆86Updated 12 years ago
- Gremlins is a python framework for fault-testing distributed systems☆123Updated 11 years ago
- F@#$*&%Q (Message queue that is fast, brokered, in C and gets out of your way)☆284Updated 3 months ago
- A utility for sorting really big files. http://kmkeen.com/gz-sort/☆94Updated 7 years ago
- A python RPC client stack☆45Updated 3 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- A Directed Acyclic Graph task dependency scheduler designed to simplify complex distributed pipelines☆131Updated 7 years ago
- A quixotic quest to coordinate StatsD implementations☆142Updated 12 years ago
- Create disposable redis servers on the fly for testing☆16Updated 9 years ago
- a small, lightweight pre-forking container☆21Updated 4 years ago
- A single-command bittorrent distribution system, based on Twitter's Murder☆412Updated 8 years ago
- A consistent-hashing relay for statsd and carbon metrics☆101Updated 4 years ago
- A key/value store for serving static batch data☆175Updated 2 years ago
- Performance metrics, based on Coda Hale's Yammer metrics☆196Updated 2 years ago
- Fork of Hustle - Originally developed at Chango - A column oriented, embarrassingly distributed relational event database.☆44Updated 10 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- Exelixi is a distributed framework based on Apache Mesos, mostly implemented in Python using gevent for high-performance concurrency. It …☆133Updated 11 years ago
- Fast Protocol Buffers module for Python☆40Updated 9 years ago
- Automated message queue orchestration for scaled-up benchmarking.☆237Updated 9 years ago
- A column oriented, embarrassingly distributed relational event database.☆238Updated 7 years ago
- ☆44Updated 3 years ago
- Python bindings for TrailDB☆38Updated 5 years ago
- A port of HdrHistogram in native python☆154Updated 6 months ago
- UI for interactive data analysis | https://snorkel.logv.org☆164Updated last year
- pesos is a pure python implementation of the mesos framework api☆47Updated 9 years ago
- Mount Everest Application Framework☆119Updated 3 weeks ago
- Implementations of a data structure with false negatives but no false positives.☆358Updated last year