clarkgrubb / data-toolsLinks
File format conversion tools
☆292Updated 5 months ago
Alternatives and similar repositories for data-tools
Users that are interested in data-tools are comparing it to the libraries listed below
Sorting:
- Convert an XML input to a JSON output, using xml-mapping☆161Updated 9 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆125Updated 10 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆125Updated 4 years ago
- The (large) data files needed for the Data Science Toolkit project☆233Updated 12 years ago
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆242Updated 3 weeks ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 10 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆392Updated 2 years ago
- A (comprehensive) collection of open source tools used by the data community.☆52Updated 10 years ago
- Like awk, but with SQL and table joins☆315Updated last year
- Tool for visual exploration of complex data.☆194Updated 7 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 5 years ago
- Extract tables from PDF files☆359Updated 9 years ago
- Command-line tool for manipulating CSV data☆74Updated 7 years ago
- A converter that generates a bash one-liner from an SQL Select query (no DB necessary)☆295Updated 9 years ago
- A library for extracting tables from PDF files☆89Updated 12 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆59Updated 4 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- Qualitative visualization of the data types of CSV files☆258Updated 11 years ago
- ☆92Updated 10 years ago
- Utilities for processing tab-separated files☆134Updated 6 years ago
- Docker images for data science from Wise.io☆51Updated 9 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 10 years ago
- Utils around luigi.☆66Updated 5 months ago
- ☆34Updated 9 years ago
- Collecting thoughts about data versioning☆108Updated 6 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- ☆84Updated 7 years ago
- Material for some talks I have given☆61Updated last year
- GNU-alike tools for parsing RFC 4180 CSVs at high speed.☆108Updated 5 months ago
- Code for Pythonic visualization blog post☆40Updated 8 years ago