OpenRefine / refine-pythonLinks
Python client library for controlling Google Refine
☆42Updated 12 years ago
Alternatives and similar repositories for refine-python
Users that are interested in refine-python are comparing it to the libraries listed below
Sorting:
- A data processing pipeline that schedules and runs content harvesters, normalizes their data, and outputs that normalized data to a varie…☆42Updated 9 years ago
- The OpenRefine Python Client Library provides an interface to communicating with an OpenRefine server.☆180Updated 6 years ago
- Definitions of Pardon jargon to help Python beginners understand Pythonista gobbletigook☆55Updated 5 years ago
- Python client library for controlling Google Refine☆83Updated 8 years ago
- WaterButler is a Python web application for interacting with various file storage services via a single RESTful API, developed at Center …☆62Updated 3 weeks ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 4 years ago
- Python package for data.world☆101Updated last year
- Parser and standardizer for politician, individual and organization names.☆128Updated 8 years ago
- ScraperWiki Python library for scraping and saving data; in maintenance mode☆158Updated last week
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Open Knowledge coding standards and style guide.☆35Updated 6 years ago
- Tools for parsing and querying Wikimedia Foundation pageview data from both static dumps and the online API.☆66Updated 3 years ago
- Find which links on a web page are pagination links☆29Updated 9 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Manage and load dataprotocols.org Data Packages☆27Updated 10 years ago
- Tools for parsing messy tabular data. This is now superseded by https://github.com/frictionlessdata/tabulator-py☆392Updated 2 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 9 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- A Python library for working with Data Packages.☆191Updated last year
- a Simple API for RDF☆29Updated 16 years ago
- (Archived) A Python library for record linkage and deduplication.☆19Updated last year
- Named-Entity Recognition extension for Google Refine / OpenRefine☆73Updated 8 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- smappPy addresses common tasks programmers dealing with lots of data☆25Updated 9 years ago
- Framework for processing data packages in pipelines of modular components.☆123Updated 7 months ago
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Updated 9 years ago
- A Binder-compatible repo with a requirements.txt file☆26Updated 8 years ago
- Tools for text tokenization and encoding☆84Updated 4 years ago