clarkgrubb / data-toolsLinks
File format conversion tools
☆291Updated 4 years ago
Alternatives and similar repositories for data-tools
Users that are interested in data-tools are comparing it to the libraries listed below
Sorting:
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆238Updated 3 months ago
- Like awk, but with SQL and table joins☆315Updated 6 months ago
- GNU-alike tools for parsing RFC 4180 CSVs at high speed.☆103Updated last year
- Enables common unix utlities like cut, awk, wc, head to work correctly with csv data containing delimiters and newlines☆447Updated last year
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆124Updated 3 years ago
- Convert an XML input to a JSON output, using xml-mapping☆162Updated 8 years ago
- Command-line tool for manipulating CSV data☆75Updated 7 years ago
- Convert text from a file or from stdin into SQL table and query it instantly. Uses sqlite as backend. The idea is to make SQL into a tool…☆288Updated 5 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆125Updated 10 years ago
- Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily☆111Updated 9 years ago
- Tool for visual exploration of complex data.☆191Updated 6 years ago
- A (comprehensive) collection of open source tools used by the data community.☆51Updated 9 years ago
- Bringing the python data stack to the shell prompt☆787Updated 4 years ago
- Analyze the structure and dynamics of an open source project's developer community, using graph algorithms, etc.☆58Updated 4 years ago
- Sample repo for luigi tasks & config☆36Updated 9 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- [DEPRECATED]☆55Updated 10 years ago
- Extract tables from PDF files☆357Updated 9 years ago
- A proofreader for your data☆694Updated 2 years ago
- knitpy: Elegant, flexible and fast dynamic report generation with python☆368Updated 4 years ago
- A library for extracting tables from PDF files☆89Updated 11 years ago
- A wrapper around gitpython to produce pandas dataframes for analysis☆190Updated this week
- Quick plotting and data visualization of pandas and numpy data.☆57Updated 8 years ago
- Refinery - A locally deployable open-source web platform for analysis of large document collections☆101Updated 8 years ago
- Keshif - Data Made Explorable (Prototype)☆457Updated 7 years ago
- Portland Python Meetup March 2015☆40Updated 10 years ago
- commandline tools for slicing and dicing JSON records.☆303Updated 4 years ago
- Run IPython, Pattern, NLTK, Pandas, NumPy, SciPy, Numba, Biopython inside Docker☆47Updated 10 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆390Updated 2 years ago
- Analyzes a CSV file and generates database table schema, all within the browser☆316Updated 9 years ago