clarkgrubb / data-toolsLinks
File format conversion tools
☆292Updated 4 months ago
Alternatives and similar repositories for data-tools
Users that are interested in data-tools are comparing it to the libraries listed below
Sorting:
- Convert an XML input to a JSON output, using xml-mapping☆162Updated 9 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆125Updated 4 years ago
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆242Updated 3 weeks ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆391Updated 2 years ago
- Command-line tool for manipulating CSV data☆74Updated 7 years ago
- The (large) data files needed for the Data Science Toolkit project☆232Updated 12 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆125Updated 10 years ago
- Convert text from a file or from stdin into SQL table and query it instantly. Uses sqlite as backend. The idea is to make SQL into a tool…☆288Updated 5 years ago
- Tool for visual exploration of complex data.☆193Updated 7 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 10 years ago
- A (comprehensive) collection of open source tools used by the data community.☆52Updated 10 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- Extract tables from PDF files☆359Updated 9 years ago
- A Python data analysis library that is optimized for humans instead of machines.☆1,195Updated 3 weeks ago
- A wrapper around gitpython to produce pandas dataframes for analysis☆190Updated 5 months ago
- Data Pipes for CSV☆115Updated 2 years ago
- Analyzes a CSV file and generates database table schema, all within the browser☆315Updated 9 years ago
- Generate a diff between two tabular datasets expressed in CSV files.☆132Updated 4 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- Guesses table DDL based on data☆276Updated 3 years ago
- Open source large document set visualization platform☆270Updated 2 years ago
- Code to transform Hillary's emails from raw PDF documents to a SQLite database☆161Updated 10 years ago
- Tools for generating CSV and other flat versions of the structured data☆109Updated 2 weeks ago
- A desktop CSV editor for data publishers☆287Updated 2 years ago
- Data workflow tool, like a "Make for data"☆1,484Updated 3 years ago
- JSON -> Relational DB Column Types☆63Updated 2 years ago
- Python package for data.world☆101Updated last year
- Magic functions for using Jupyter Notebook with Apache Spark and a variety of SQL databases.☆171Updated 7 years ago