gigisr / data_etl
☆10Updated 4 years ago
Alternatives and similar repositories for data_etl:
Users that are interested in data_etl are comparing it to the libraries listed below
- A browser user interface for manual labeling of record pairs.☆42Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- A maximum-strength name parser for record linkage.☆36Updated 5 months ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- pipeline library☆12Updated 6 years ago
- Today I Learned Some Computer Stuff☆39Updated 6 years ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Public repository for versioning machine learning data☆42Updated 3 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 4 years ago
- ☆15Updated 6 years ago
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- stemgraphic python package for visualization of data and text☆17Updated 3 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- ☆13Updated 5 years ago
- Compilation of Vega-Lite & Altair Tutorials☆23Updated last year
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- Intro to Testing in Data Science Tutorial☆35Updated 2 years ago
- Interactive notebooks containing demonstration code of the splink library☆37Updated 11 months ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Predict age and gender from a first name☆60Updated 6 years ago
- Cohort extractor tool which can generate dummy data, or real data against OpenSAFELY-compliant research databases☆39Updated 4 months ago
- Aho-Corasick string replacement utility☆24Updated 5 years ago
- A pedagogical implementation of panel apps served up on a remote machine.☆14Updated 3 years ago
- Python library for Ceteris Paribus Plots (What-if plots)☆19Updated 3 years ago
- A python module that will check for package updates.☆28Updated 3 years ago
- A web application that identifies party in political discourse and an example of operationalized machine learning.☆28Updated 6 years ago
- Data exploration library with a pandas-like API☆74Updated 4 years ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated 3 months ago
- Scalable String Similarity Joins in Python☆38Updated 6 months ago