gigisr / data_etl
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for data_etl
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- ☆15Updated 6 years ago
- A browser user interface for manual labeling of record pairs.☆41Updated last year
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- Today I Learned Some Computer Stuff☆39Updated 6 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- pipeline library☆12Updated 6 years ago
- python package for performing deduplication using flexible text matching and cleaning in pandas dataframe☆25Updated 3 years ago
- Public repository for versioning machine learning data☆42Updated 2 years ago
- Describe your scikit-learn estimators for posterity!☆15Updated 7 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- Generate Pandas frames, load and extract data, based on JSON Table Schema descriptors.☆52Updated 3 years ago
- An analysis of all 1.3 million public Jupyter Notebooks on Github in July 2017☆72Updated 6 years ago
- Data exploration library with a pandas-like API☆74Updated 4 years ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆16Updated 4 years ago
- Pipeline Explorer - Explore and analyze millions of pipelines learned using MLBlocks and MLPrimitives.☆17Updated last year
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- (Archived) A Python library for record linkage and deduplication.☆19Updated 8 months ago
- Python library to infer date format from examples☆42Updated 3 years ago
- Compilation of Vega-Lite & Altair Tutorials☆24Updated last year
- Python package for Bayesian & Frequentist A/B Testing☆12Updated last year
- ☆29Updated 2 years ago
- A list of resources for current and aspiring data science managers☆15Updated 3 years ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Updated 6 years ago
- Visualize uncertainty☆27Updated last year
- A xlsx and html rendering library for rendering data available in Pandas DataFrames.☆27Updated 6 months ago
- ☆16Updated 2 months ago
- ☆70Updated last year
- Python library for Ceteris Paribus Plots (What-if plots)☆19Updated 3 years ago