mfisk / filemap
File-Based Map-Reduce. Zero-install: easily use any collection of computers as a map-reduce cluster for command-line analytics.
☆227Updated 4 years ago
Alternatives and similar repositories for filemap:
Users that are interested in filemap are comparing it to the libraries listed below
- C network daemon for HyperLogLogs☆448Updated 4 years ago
- a small, lightweight pre-forking container☆21Updated 4 years ago
- Fork of Hustle - Originally developed at Chango - A column oriented, embarrassingly distributed relational event database.☆44Updated 10 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- A consistent-hashing relay for statsd and carbon metrics☆101Updated 4 years ago
- A column oriented, embarrassingly distributed relational event database.☆240Updated 7 years ago
- ScalienDB is a scalable, replicated datastore.☆86Updated 12 years ago
- A python RPC client stack☆45Updated 3 years ago
- Tools for working with parquet, impala, and hive☆134Updated 4 years ago
- Tail a log file and send log lines automatically to a kafka topic☆57Updated 12 years ago
- An implementation of the HyperLogLog algorithm backed by Redis☆172Updated 9 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- ☆43Updated 3 years ago
- A synchronized bounded message queue built on Redis.☆83Updated 10 years ago
- counters and logarithmically bucketed histograms for distributed systems☆84Updated 7 years ago
- F@#$*&%Q (Message queue that is fast, brokered, in C and gets out of your way)☆284Updated 4 months ago
- Github mirror of "analytics/kafkatee" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆21Updated last year
- Binary of pullcontainer☆10Updated 10 years ago
- Last-seen sketch implementation in Go☆16Updated 4 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Utils around luigi.☆66Updated 4 years ago
- Fast Protocol Buffers module for Python☆40Updated 9 years ago
- Exelixi is a distributed framework based on Apache Mesos, mostly implemented in Python using gevent for high-performance concurrency. It …☆133Updated 11 years ago
- A tool for executing scripts when ZooKeeper nodes change.☆66Updated 14 years ago
- Automatically exported from code.google.com/p/crush-tools☆150Updated 9 years ago
- Zookeeper CLI designed to be fast, easy to install, and Unix-friendly☆59Updated 9 years ago
- Personal site for sharing open source stuff.☆10Updated 9 months ago
- Storm Spout + Kafka State Inspector☆58Updated 5 years ago
- Pure Python CDB reader/writer☆44Updated last year
- A Python HTTP process management utility.