erikfrey / bashreduceLinks
mapreduce in bash
☆921Updated 5 years ago
Alternatives and similar repositories for bashreduce
Users that are interested in bashreduce are comparing it to the libraries listed below
Sorting:
- Short, simple, direct scripts for creating ASCII graphical histograms in the terminal.☆458Updated 3 years ago
- C network daemon for HyperLogLogs☆450Updated 4 years ago
- A single-command bittorrent distribution system, based on Twitter's Murder☆412Updated 8 years ago
- File-Based Map-Reduce. Zero-install: easily use any collection of computers as a map-reduce cluster for command-line analytics.☆228Updated 4 years ago
- Like awk, but with SQL and table joins☆315Updated 10 months ago
- Distributed Named Pipes☆455Updated 8 years ago
- commandline tools for slicing and dicing JSON records.☆304Updated 5 years ago
- Convert text from a file or from stdin into SQL table and query it instantly. Uses sqlite as backend. The idea is to make SQL into a tool…☆288Updated 5 years ago
- C network daemon for bloom filters☆1,249Updated 2 years ago
- Command line utilities for data analysis☆1,938Updated last year
- Automatically exported from code.google.com/p/crush-tools☆150Updated 9 years ago
- Jshon is a JSON parser designed for maximum convenience within the shell.☆392Updated 2 years ago
- MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.☆507Updated 7 years ago
- Large-scale Monitoring and Trend Analysis System☆245Updated 3 weeks ago
- Gremlins is a python framework for fault-testing distributed systems☆123Updated 11 years ago
- Create an index on a compressed text file☆637Updated 2 years ago
- Fast Web log analyzer using probabilistic data structures☆388Updated 6 months ago
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆583Updated 11 years ago
- A terminal-only version of Sumo written in Go☆327Updated 7 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆292Updated 3 years ago
- ScalienDB is a scalable, replicated datastore.☆87Updated 12 years ago
- Scalable C - The Book☆352Updated 2 years ago
- ☆172Updated 10 years ago
- Connect UNIX pipes and message queues☆438Updated 7 years ago
- Shell supporting pipelines to and from multiple processes☆335Updated last year
- Implementations of a data structure with false negatives but no false positives.☆358Updated last year
- One click deploy for Storm clusters on AWS☆516Updated 10 years ago
- A tool for executing scripts when ZooKeeper nodes change.☆66Updated 14 years ago
- A consistent-hashing relay for statsd and carbon metrics☆101Updated 4 years ago