okfn / messytablesLinks
Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py
☆390Updated 2 years ago
Alternatives and similar repositories for messytables
Users that are interested in messytables are comparing it to the libraries listed below
Sorting:
- A Python library for creating fast, repeatable and self-documenting data analysis pipelines.☆238Updated 3 months ago
- Python library for reading and writing tabular data via streams.☆237Updated 4 years ago
- ☆84Updated 7 years ago
- A Python library for working with Table Schema.☆264Updated 6 months ago
- Parse, normalize and render postal addresses.☆184Updated last year
- Parser and standardizer for politician, individual and organization names.☆129Updated 8 years ago
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- Tools for generating CSV and other flat versions of the structured data☆107Updated last month
- Creating Rickshaw.js visualizations with Python Pandas☆265Updated 8 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- SQLCell is a magic function for the Jupyter Notebook that executes raw, parallel, parameterized SQL queries with the ability to accept Py…☆151Updated 2 years ago
- Generate SQL tables, load and extract data, based on JSON Table Schema descriptors.☆62Updated last year
- Magic functions for using Jupyter Notebook with Apache Spark and a variety of SQL databases.☆171Updated 6 years ago
- A Python toolkit for processing tabular data☆417Updated 3 months ago
- mito ETL tool☆163Updated 4 years ago
- NOTE: a magic we developed at The Data Incubator from this basis: https://github.com/thedataincubator/ihtml☆88Updated 9 years ago
- Declarative statistical visualization library for Python☆237Updated 6 years ago
- A Python data analysis library that is optimized for humans instead of machines.☆1,181Updated 3 months ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- Data analysis and reporting tool for quick access to custom charts and tables in Jupyter Notebooks and in the shell.☆120Updated last year
- Dump (freeze) SQL query results from a database into a selection of file formats☆92Updated 6 years ago
- Partitioned storage system based on blosc. **No longer actively maintained.**☆152Updated 8 years ago
- E. Tufte slope graph implementation in Python☆140Updated 9 years ago
- PyData Seattle 2015: Python Data Bikeshed☆127Updated 9 years ago
- Dedupe/batch geocode addresses and venues around the world with libpostal☆83Updated 3 years ago
- Analyzes a CSV file and generates database table schema, all within the browser☆316Updated 9 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- The OpenRefine Python Client Library provides an interface to communicating with an OpenRefine server.☆178Updated 5 years ago
- a python library for parsing unstructured western names into name components.☆607Updated 2 weeks ago