erikfrey / bashreduce
mapreduce in bash
☆920Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for bashreduce
- Distributed Named Pipes☆453Updated 7 years ago
- A single-command bittorrent distribution system, based on Twitter's Murder☆412Updated 8 years ago
- Like awk but with SQL and table joins☆310Updated 6 months ago
- Short, simple, direct scripts for creating ASCII graphical histograms in the terminal.☆456Updated 3 years ago
- Jshon is a JSON parser designed for maximum convenience within the shell.☆385Updated last year
- Distributed database specialized in exporting key/value data from Hadoop☆558Updated 10 years ago
- Create an index on a compressed text file☆622Updated last year
- MADlib has moved to Apache MADlib (incubating). Please send pull requests to the Apache repository.☆507Updated 6 years ago
- Hadoop library for large-scale data processing, now an Apache Incubator project☆584Updated 10 years ago
- C network daemon for HyperLogLogs☆449Updated 3 years ago
- GoldenOrb is an open-source implementation of Pregel, Google's graph processing framework☆293Updated 2 years ago
- Lightning-fast cluster computing in Java, Scala and Python.☆1,425Updated 10 years ago
- DRY and RAD for regular expressions and then some.☆244Updated 2 years ago
- C network daemon for bloom filters☆1,237Updated last year
- eliminate bugs and weeds from shell scripts☆429Updated 7 years ago
- scales - Metrics for Python☆920Updated last year
- Berkeley Tree Database (BTrDB) server☆909Updated 3 years ago
- A probabilistic data structure service and storage☆771Updated 8 years ago
- commandline tools for slicing and dicing JSON records.☆300Updated 4 years ago
- Convert text from a file or from stdin into SQL table and query it instantly. Uses sqlite as backend. The idea is to make SQL into a tool…☆284Updated 4 years ago
- Bash on Balls☆860Updated 8 years ago
- Implementations of a data structure with false negatives but no false positives.☆354Updated 11 months ago
- Large-scale Monitoring and Trend Analysis System☆245Updated 9 months ago
- Shell supporting pipelines to and from multiple processes☆327Updated 6 months ago
- Python module that allows one to easily write and run Hadoop programs.☆1,035Updated 6 years ago
- A terminal-only version of Sumo written in Go☆327Updated 6 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆338Updated 13 years ago