petewarden / dstkdataLinks
The (large) data files needed for the Data Science Toolkit project
☆232Updated 12 years ago
Alternatives and similar repositories for dstkdata
Users that are interested in dstkdata are comparing it to the libraries listed below
Sorting:
- Automatically exported from code.google.com/p/crush-tools☆150Updated 9 years ago
- Like awk, but with SQL and table joins☆315Updated last year
- File format conversion tools☆292Updated 3 months ago
- Transform nested JSON data into tabular data in the shell.☆291Updated 7 years ago
- Enables common unix utlities like cut, awk, wc, head to work correctly with csv data containing delimiters and newlines☆450Updated 2 years ago
- Num: number utilities for mathematics☆134Updated 2 years ago
- commandline tools for slicing and dicing JSON records.☆304Updated 5 years ago
- Elastic tabstops for Rust.☆269Updated 2 months ago
- Remove bad records from a CSV file and normalize☆57Updated 3 years ago
- A system to programmatically run data pipelines☆225Updated 2 weeks ago
- Convert text from a file or from stdin into SQL table and query it instantly. Uses sqlite as backend. The idea is to make SQL into a tool…☆288Updated 5 years ago
- Dataframe structure and operations in Rust☆145Updated 7 years ago
- Quick and dirty statistics tool for the UNIX pipeline☆61Updated 8 years ago
- The tool I used to write my book, Effective Python.☆83Updated 7 years ago
- A Python data analysis library that is optimized for humans instead of machines.☆1,195Updated last week
- Rename anything☆367Updated last month
- Select elements from large XML files, fast.☆54Updated 11 months ago
- cha(rs) is a commandline tool to display information about unicode characters☆187Updated last month
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆242Updated last week
- GNU-alike tools for parsing RFC 4180 CSVs at high speed.☆107Updated 2 months ago
- A utility for sorting really big files. http://kmkeen.com/gz-sort/☆94Updated 7 years ago
- Data workflow tool, like a "Make for data"☆1,484Updated 3 years ago
- Create APIs out of public datasources☆89Updated 7 years ago
- Source files for "An Introduction to VisiData"☆76Updated 9 months ago
- Say "ni" to data of any size☆85Updated last week
- public mise ( http://en.wikipedia.org/wiki/Mise_en_place )☆100Updated 2 weeks ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆125Updated 4 years ago
- Making Data, the DataMade Way☆290Updated 4 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- fzz makes your command line interactive!☆201Updated 9 years ago