clarkgrubb / data-tools
File format conversion tools
☆292Updated 3 years ago
Alternatives and similar repositories for data-tools:
Users that are interested in data-tools are comparing it to the libraries listed below
- Convert an XML input to a JSON output, using xml-mapping☆162Updated 8 years ago
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆239Updated 3 weeks ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆123Updated 10 years ago
- Automatically exported from code.google.com/p/crush-tools☆150Updated 9 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Analyzes a CSV file and generates database table schema, all within the browser☆315Updated 8 years ago
- Utils around luigi.☆65Updated 4 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆112Updated 9 years ago
- Bringing the python data stack to the shell prompt☆789Updated 4 years ago
- commandline tools for slicing and dicing JSON records.☆303Updated 4 years ago
- command line tool to convert json to csv☆808Updated last year
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆124Updated 3 years ago
- Command-line tool for manipulating CSV data☆75Updated 7 years ago
- Enables common unix utlities like cut, awk, wc, head to work correctly with csv data containing delimiters and newlines☆448Updated last year
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Like awk, but with SQL and table joins☆311Updated 3 months ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 8 years ago
- Magic functions for using Jupyter Notebook with Apache Spark and a variety of SQL databases.☆172Updated 6 years ago
- Transform nested JSON data into tabular data in the shell.☆286Updated 7 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 4 years ago
- ☆84Updated 7 years ago
- A (comprehensive) collection of open source tools used by the data community.☆51Updated 9 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- Web application for creating publication ready charts and graphs☆56Updated 6 years ago
- Vertica CLI with auto-completion and syntax highlighting☆76Updated 8 years ago
- Qualitative visualization of the data types of CSV files☆256Updated 10 years ago
- Docker images for data science from Wise.io☆50Updated 9 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆389Updated last year