mfisk / filemap
File-Based Map-Reduce. Zero-install: easily use any collection of computers as a map-reduce cluster for command-line analytics.
☆227Updated 4 years ago
Alternatives and similar repositories for filemap:
Users that are interested in filemap are comparing it to the libraries listed below
- developer repository for https://github.com/fuse-kafka/fuse_kafka☆27Updated 9 years ago
- Fast Protocol Buffers module for Python☆40Updated 9 years ago
- A column oriented, embarrassingly distributed relational event database.☆240Updated 7 years ago
- Tail a log file and send log lines automatically to a kafka topic☆57Updated 12 years ago
- Fork of Hustle - Originally developed at Chango - A column oriented, embarrassingly distributed relational event database.☆44Updated 10 years ago
- C network daemon for HyperLogLogs☆448Updated 4 years ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- a small, lightweight pre-forking container☆21Updated 4 years ago
- A consistent-hashing relay for statsd and carbon metrics☆101Updated 4 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Last-seen sketch implementation in Go☆16Updated 4 years ago
- Apache Mesos Platform as a Service Deploy☆21Updated 8 years ago
- Mesos scheduling framework for Changes.☆16Updated 8 years ago
- Collect local Mesos slave, underlying operating system and machine metrics and produce to Apache Kafka☆20Updated 9 years ago
- ☆60Updated 3 years ago
- Create disposable redis servers on the fly for testing☆16Updated 9 years ago
- Simulating the performance of various streaming algorithms. #experimentalmathematics☆59Updated 7 years ago
- F@#$*&%Q (Message queue that is fast, brokered, in C and gets out of your way)☆284Updated 4 months ago
- Notes from VLDB conference☆31Updated 9 years ago
- Gremlins is a python framework for fault-testing distributed systems☆122Updated 10 years ago
- approximate streaming quantiles☆31Updated 10 years ago
- Exelixi is a distributed framework based on Apache Mesos, mostly implemented in Python using gevent for high-performance concurrency. It …☆133Updated 11 years ago
- Ranked Prefix Search for Large Data on External Memory optimized for Mobile with ZERO lag initialization time☆16Updated 6 years ago
- Triton/Manta DNS server over Apache Zookeeper☆25Updated 2 weeks ago
- mapreduce in bash☆920Updated 5 years ago
- ☆146Updated 5 years ago
- A lightweight lumberjack protocol compliant logstash indexer☆54Updated 9 years ago
- Compute on demand in Docker containers☆63Updated 10 years ago
- distkv is a distributed K/V store library for Go powered by the raft consensus algorithm.☆55Updated 8 years ago
- ScalienDB is a scalable, replicated datastore.☆86Updated 12 years ago