google / crush-toolsLinks
Automatically exported from code.google.com/p/crush-tools
☆150Updated 9 years ago
Alternatives and similar repositories for crush-tools
Users that are interested in crush-tools are comparing it to the libraries listed below
Sorting:
- commandline tools for slicing and dicing JSON records.☆303Updated 4 years ago
- Like awk, but with SQL and table joins☆314Updated 6 months ago
- Convert text from a file or from stdin into SQL table and query it instantly. Uses sqlite as backend. The idea is to make SQL into a tool…☆287Updated 5 years ago
- Transform nested JSON data into tabular data in the shell.☆288Updated 7 years ago
- A terminal-only version of Sumo written in Go☆328Updated 6 years ago
- JSON -> Relational DB Column Types☆63Updated 2 years ago
- A utility for sorting really big files. http://kmkeen.com/gz-sort/☆94Updated 6 years ago
- The (large) data files needed for the Data Science Toolkit project☆232Updated 11 years ago
- HyperMinHash: Bringing intersections to HyperLogLog☆304Updated 7 years ago
- Convert an XML input to a JSON output, using xml-mapping☆162Updated 8 years ago
- mapreduce in bash☆920Updated 5 years ago
- Num: number utilities for mathematics☆133Updated last year
- Enables common unix utlities like cut, awk, wc, head to work correctly with csv data containing delimiters and newlines☆447Updated last year
- Timberlake is a Job Tracker for Hadoop.☆177Updated 5 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆125Updated 10 years ago
- Create an index on a compressed text file☆631Updated 2 years ago
- File-Based Map-Reduce. Zero-install: easily use any collection of computers as a map-reduce cluster for command-line analytics.☆227Updated 4 years ago
- File format conversion tools☆291Updated 4 years ago
- utilities to assist running periodic batch processing jobs☆119Updated last year
- Implementations of a data structure with false negatives but no false positives.☆358Updated last year
- tutorial for shellfire☆53Updated 7 years ago
- A Directed Acyclic Graph task dependency scheduler designed to simplify complex distributed pipelines☆131Updated 6 years ago
- [DEPRECATED] Simple local encrypted credential management with GPG 🔐☆125Updated 8 years ago
- Create APIs out of public datasources☆89Updated 7 years ago
- Quick and dirty statistics tool for the UNIX pipeline☆61Updated 8 years ago
- An inverted trigram index for accelerated string matching in Sqlite.☆78Updated 11 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- Compilation and rule-based optimization framework for relational algebra. Raco is the language, optimization, and query translation layer…☆72Updated 7 years ago
- Utils around luigi.☆66Updated 4 years ago
- A simple data consistency checker☆30Updated 8 years ago