mshearer0 / HandsOnEntityResolutionLinks
This repository accompanies Hands On Entity Resolution by O'Reilly
☆27Updated last year
Alternatives and similar repositories for HandsOnEntityResolution
Users that are interested in HandsOnEntityResolution are comparing it to the libraries listed below
Sorting:
- Intro to Polars Tutorial☆22Updated 2 years ago
- Code and materials for Effective Polars book☆83Updated last year
- Course materials for our "Getting Started with NLP and spaCy" course at Talk Python☆38Updated 7 months ago
- Prototype search engine for ONS bulletins☆24Updated 2 weeks ago
- SQLMesh example projects☆36Updated 4 months ago
- construct a network graph to explore and visualize how people connect in an organisation☆19Updated last year
- Datasets for ML, Analysis, etc☆62Updated 6 months ago
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆15Updated 3 weeks ago
- ☆15Updated 3 years ago
- (WIP) Getting started with Docker - An introduction to Docker with data science and engineering applications☆128Updated 2 years ago
- A minimal example to deploy a Streamlit Application in GCP Cloud Run.☆11Updated 4 years ago
- Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide☆268Updated last month
- This repo is for LinkedIn Learning course: Data Pipeline Automation with GitHub Actions☆57Updated this week
- Good Practice Tables - an XlsxWriter wrapper to write consistently formatted statistical tables to Excel.☆40Updated last week
- PyData London 2022 sktime workshop☆11Updated 2 years ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆21Updated last year
- ☆17Updated 2 years ago
- This is a simple analytic project using DuckDB & dbt with air quality data.☆23Updated last year
- A repository of runnable examples using ibis☆46Updated last year
- Graph Data Modeling in Python, by Packt Publishing☆43Updated 7 months ago
- Sentiment and language detection for text analytics.☆17Updated last year
- ☆11Updated 2 years ago
- Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.☆50Updated 3 months ago
- Check the basic quality of any dataset☆11Updated 4 years ago
- A downloadable pdf containing summary of frequently used pandas operations.☆10Updated 5 years ago
- sktime - python toolbox for time series: pipelines and transformers☆25Updated 2 years ago
- A Beginner's Guide to DuckDB's Python Client☆42Updated last year
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- Interactive notebooks containing demonstration code of the splink library☆40Updated last year
- Apache Airflow Best Practices, published by Packt☆49Updated 11 months ago