clarkgrubb / data-toolsLinks
File format conversion tools
☆291Updated 4 years ago
Alternatives and similar repositories for data-tools
Users that are interested in data-tools are comparing it to the libraries listed below
Sorting:
- Automatically exported from code.google.com/p/crush-tools☆150Updated 9 years ago
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆238Updated 5 months ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆125Updated 10 years ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆124Updated 4 years ago
- Tool for visual exploration of complex data.☆191Updated 6 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 9 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- Extract tables from PDF files☆357Updated 9 years ago
- Generate a diff between two tabular datasets expressed in CSV files.☆131Updated 4 years ago
- Qualitative visualization of the data types of CSV files☆257Updated 10 years ago
- A (comprehensive) collection of open source tools used by the data community.☆51Updated 9 years ago
- A converter that generates a bash one-liner from an SQL Select query (no DB necessary)☆291Updated 9 years ago
- The (large) data files needed for the Data Science Toolkit project☆233Updated 12 years ago
- Utilities for processing tab-separated files☆128Updated 5 years ago
- A library for extracting tables from PDF files☆89Updated 11 years ago
- Docker images for data science from Wise.io☆50Updated 9 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- Utils around luigi.☆66Updated 4 years ago
- Like awk, but with SQL and table joins☆315Updated 8 months ago
- An interactive map of Stack Exchange tags for all sites.☆126Updated last year
- A wrapper around gitpython to produce pandas dataframes for analysis☆191Updated 3 weeks ago
- Data workflow tool, like a "Make for data"☆1,485Updated 3 years ago
- Import tables from any Wikipedia article as a dataset in Python☆292Updated 3 years ago
- Open source large document set visualization platform☆269Updated 2 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- A desktop CSV editor for data publishers☆287Updated 2 years ago
- Keshif - Data Made Explorable (Prototype)☆457Updated 7 years ago
- A complete environment for busy polyglot data scientists☆471Updated 4 years ago