OpenRefine / refine-python
Python client library for controlling Google Refine
☆39Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for refine-python
- Multidimensional data explorer and visualization tool.☆52Updated 7 years ago
- Python binding for gumbo-parser using Cython☆14Updated 8 years ago
- The OpenRefine Python Client Library provides an interface to communicating with an OpenRefine server.☆177Updated 5 years ago
- A Topic Modeling toolbox☆93Updated 8 years ago
- Demo code for learning_text_transformer☆25Updated 9 years ago
- BatchRefine adds batch processing capabilities to OpenRefine☆50Updated 7 years ago
- Python client library for controlling Google Refine☆83Updated 7 years ago
- a Simple API for RDF☆29Updated 15 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 9 years ago
- Named-Entity Recognition extension for Google Refine / OpenRefine☆72Updated 7 years ago
- Execute OpenRefine JSON scripts without OpenRefine (or Java)☆29Updated last year
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- A data processing pipeline that schedules and runs content harvesters, normalizes their data, and outputs that normalized data to a varie…☆41Updated 8 years ago
- Beginner's Guide to Machine Learning Competitions, EuroPython 2015, Tutorial☆29Updated 8 years ago
- Materials for the workshop Advanced Text Analysis with SpaCy and Scikit-Learn, given at NYU during NYCDH Week 2017, at PyData NYC in Nov.…☆82Updated last year
- Manage and load dataprotocols.org Data Packages☆27Updated 9 years ago
- Free-for-all repository of TEI and plain text files for you (to do cool stuff) provided by the Digital Collections Services group at the …☆27Updated 7 years ago
- pyfpds is a python wrapper around the FPDS ATOM feed☆13Updated 5 years ago
- Partial result caching for pandas in Python.☆18Updated 5 years ago
- These are the IPython notebook files for the CSC 432 Spring '13 course.☆23Updated 9 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 12 years ago
- Stream processing in Python of twitter searches using public APIs.☆9Updated 8 years ago
- Building Python Data Application Tutorials☆23Updated 3 months ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Definitions of Pardon jargon to help Python beginners understand Pythonista gobbletigook☆53Updated 4 years ago
- Data science tools from Moz☆22Updated 7 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.☆86Updated 6 years ago
- Parser and standardizer for politician, individual and organization names.☆128Updated 7 years ago
- Concept discovery and recommendation library built on top of the IBM Watson cognitive API.☆24Updated 8 years ago