petewarden / dstkdataLinks
The (large) data files needed for the Data Science Toolkit project
☆233Updated 12 years ago
Alternatives and similar repositories for dstkdata
Users that are interested in dstkdata are comparing it to the libraries listed below
Sorting:
- Like awk, but with SQL and table joins☆315Updated 10 months ago
- Automatically exported from code.google.com/p/crush-tools☆150Updated 9 years ago
- Enables common unix utlities like cut, awk, wc, head to work correctly with csv data containing delimiters and newlines☆451Updated 2 years ago
- File format conversion tools☆291Updated 2 months ago
- Num: number utilities for mathematics☆134Updated 2 years ago
- Convert text from a file or from stdin into SQL table and query it instantly. Uses sqlite as backend. The idea is to make SQL into a tool…☆288Updated 5 years ago
- Elastic tabstops for Rust.☆268Updated 3 weeks ago
- Quick and dirty statistics tool for the UNIX pipeline☆61Updated 8 years ago
- Remove bad records from a CSV file and normalize☆57Updated 3 years ago
- Transform nested JSON data into tabular data in the shell.☆289Updated 7 years ago
- commandline tools for slicing and dicing JSON records.☆304Updated 5 years ago
- A system to programmatically run data pipelines☆223Updated last week
- nifty command line date and time utilities; fast date calculations and conversion in the shell☆638Updated 2 weeks ago
- Select elements from large XML files, fast.☆54Updated 10 months ago
- Ben Franklin-esque Schedule in LaTeX☆16Updated 9 years ago
- The tool I used to write my book, Effective Python.☆83Updated 7 years ago
- Command-line tool for manipulating CSV data☆74Updated 7 years ago
- PAWK - A Python line processor (like AWK)☆524Updated last year
- Dataframe structure and operations in Rust☆145Updated 7 years ago
- paexec - distributes tasks over network or CPUs☆65Updated last year
- cha(rs) is a commandline tool to display information about unicode characters☆187Updated this week
- A utility for sorting really big files. http://kmkeen.com/gz-sort/☆94Updated 7 years ago
- An efficient way to filter duplicate lines from input, à la uniq.☆218Updated last year
- Fast tar archiver☆107Updated 3 years ago
- A converter that generates a bash one-liner from an SQL Select query (no DB necessary)☆291Updated 9 years ago
- A self-documenting build automation tool☆268Updated 5 years ago
- Shell supporting pipelines to and from multiple processes☆357Updated last year
- Say "ni" to data of any size☆85Updated 2 months ago
- Convert an XML input to a JSON output, using xml-mapping☆162Updated 9 years ago
- Search lots of data sets for spurious correlations☆61Updated 3 years ago