mshearer0 / HandsOnEntityResolutionLinks
This repository accompanies Hands On Entity Resolution by O'Reilly
☆29Updated last year
Alternatives and similar repositories for HandsOnEntityResolution
Users that are interested in HandsOnEntityResolution are comparing it to the libraries listed below
Sorting:
- Code and materials for Effective Polars book☆83Updated last year
- Course materials for our "Getting Started with NLP and spaCy" course at Talk Python☆38Updated 11 months ago
- Intro to Polars Tutorial☆22Updated 2 years ago
- SQLMesh example projects☆39Updated 7 months ago
- Check the basic quality of any dataset☆12Updated 4 years ago
- Datasets for ML, Analysis, etc☆62Updated 9 months ago
- ☆18Updated 2 years ago
- Pandas Training © MetaSnake 2022, CC BY-NC☆18Updated 3 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆21Updated 2 years ago
- A Beginner's Guide to DuckDB's Python Client☆42Updated last year
- Good Practice Tables - an XlsxWriter wrapper to write consistently formatted statistical tables to Excel.☆44Updated 2 months ago
- sktime - python toolbox for time series: pipelines and transformers☆25Updated 3 years ago
- This is a simple analytic project using DuckDB & dbt with air quality data.☆24Updated last year
- (WIP) Getting started with Docker - An introduction to Docker with data science and engineering applications☆129Updated 2 years ago
- Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide☆303Updated last month
- Code repository for the "PySpark in Action" book☆211Updated 7 months ago
- Data Analysis with Polars, Published by Packt☆32Updated last year
- A Python Environment Template for VScode with UV☆81Updated 4 months ago
- Applied Computational Thinking with Python, published by Packt☆48Updated 2 years ago
- Getting started with DuckDB, by Packt Publishing☆69Updated last year
- ☆12Updated 3 years ago
- Cleaning Data for Effective Data Science, published by Packt☆102Updated last month
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- Graph Data Modeling in Python, by Packt Publishing☆43Updated 10 months ago
- Interactive notebooks containing demonstration code of the splink library☆40Updated 2 years ago
- Material for PyData NYC Tutorial on Large Scale Timeseries Forecasting☆27Updated 2 years ago
- Files for my "Pandas Workout" book☆101Updated last year
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆118Updated 2 months ago
- ☆23Updated last year