erikfrey / bashreduceLinks
mapreduce in bash
☆922Updated 6 years ago
Alternatives and similar repositories for bashreduce
Users that are interested in bashreduce are comparing it to the libraries listed below
Sorting:
- File-Based Map-Reduce. Zero-install: easily use any collection of computers as a map-reduce cluster for command-line analytics.☆229Updated 4 years ago
- A single-command bittorrent distribution system, based on Twitter's Murder☆410Updated 9 years ago
- Short, simple, direct scripts for creating ASCII graphical histograms in the terminal.☆458Updated 4 years ago
- Like awk, but with SQL and table joins☆315Updated last year
- MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.☆507Updated 7 years ago
- Large-scale Monitoring and Trend Analysis System☆246Updated 3 months ago
- C network daemon for HyperLogLogs☆451Updated 4 years ago
- Convert text from a file or from stdin into SQL table and query it instantly. Uses sqlite as backend. The idea is to make SQL into a tool…☆288Updated 5 years ago
- commandline tools for slicing and dicing JSON records.☆304Updated 5 years ago
- Automatically exported from code.google.com/p/crush-tools☆150Updated 9 years ago
- Distributed Named Pipes☆455Updated 8 years ago
- Jshon is a JSON parser designed for maximum convenience within the shell.☆392Updated 2 years ago
- ScalienDB is a scalable, replicated datastore.☆87Updated 12 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆583Updated 11 years ago
- (DEPRECATED. This project is no longer used or maintained at LiveRamp.) Hank is a high performance distributed key-value NoSQL database t…☆175Updated 5 years ago
- Honu is a large scale data collection and processing pipeline☆83Updated 14 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆293Updated 3 years ago
- A RFC of a syslog replacement☆55Updated 9 years ago
- OAuth wrapper for cURL on the command line☆119Updated 8 years ago
- ☆171Updated 10 years ago
- Distributed database specialized in exporting key/value data from Hadoop☆558Updated 11 years ago
- Connect UNIX pipes and message queues☆438Updated 7 years ago
- scaling, counting, bloom filter library☆967Updated 6 years ago
- Fast Web log analyzer using probabilistic data structures☆390Updated 8 months ago
- Tiny data structures that pack a punch!☆101Updated 13 years ago
- Create an index on a compressed text file☆649Updated 2 years ago
- scales - Metrics for Python☆920Updated 2 years ago
- JSON in your Bash scripts☆584Updated 5 years ago
- S4 repository☆140Updated 14 years ago
- Gremlins is a python framework for fault-testing distributed systems☆123Updated 11 years ago