mshearer0 / HandsOnEntityResolution
This repository accompanies Hands On Entity Resolution by O'Reilly
☆22Updated last year
Alternatives and similar repositories for HandsOnEntityResolution:
Users that are interested in HandsOnEntityResolution are comparing it to the libraries listed below
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- sktime - python toolbox for time series: pipelines and transformers☆24Updated 2 years ago
- Code and materials for Effective Polars book☆76Updated 11 months ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 7 months ago
- Data Analysis with Polars, Published by Packt☆32Updated 6 months ago
- Code for data quality with greatexpectations blog☆12Updated 8 months ago
- SQLMesh example projects☆26Updated 4 months ago
- Intro to Polars Tutorial☆23Updated last year
- Blocking records for record linkage and data deduplication based on ANN algorithms in Python.☆12Updated 2 weeks ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated last year
- ☆15Updated 3 years ago
- Jupyter Cell / Line Magics for DuckDB☆47Updated last month
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated 10 months ago
- A pipeline to detect data drift and retrain the model when there is drift☆23Updated last year
- Linear regression in SQL using dbt☆69Updated 2 months ago
- ☆11Updated 2 years ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆16Updated last year
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data from…☆33Updated 2 months ago
- Course materials for our "Getting Started with NLP and spaCy" course at Talk Python☆38Updated 3 weeks ago
- ☆33Updated 3 weeks ago
- ☆15Updated 11 months ago
- Apache Airflow Best Practices, published by Packt☆40Updated 4 months ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.☆130Updated last year
- A Beginner's Guide to DuckDB's Python Client☆41Updated 5 months ago
- learning-by-doing data model built with dbt-core☆11Updated 3 months ago
- Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide☆162Updated last month
- Python package implementing transformers for pre processing steps for machine learning.☆56Updated this week
- The SQL/Ibis powered sklearn of record linkage☆14Updated last week
- dbt tutorial using a local PostgreSQL database☆38Updated 2 years ago