mllite / sklearn2sql-demo
Demo of an In-database processing tool for scikit-learn
☆13Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for sklearn2sql-demo
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Plugin for Intake to read from SQL servers☆15Updated last year
- Fully unit tested utility functions for data engineering. Python 3 only.☆14Updated 2 months ago
- ☆29Updated 10 months ago
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- This repository auto-configures an Apache Pinot and Superset cluster for analyzing IRA tweets from FiveThirtyEight.☆11Updated 4 years ago
- Build your feature store with macros right within your dbt repository☆37Updated last year
- The sane way of building a data layer in Airflow☆24Updated 4 years ago
- real-time data + ML pipeline☆54Updated 2 weeks ago
- A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable fro…☆26Updated 2 years ago
- A utility tool to automate certain tasks with Jupyter notebooks.☆9Updated 8 months ago
- Collaboration app for sharing and reviewing jupyter notebooks☆16Updated last year
- A python package to create a database on the platform using our moj data warehousing framework☆21Updated 2 months ago
- Utilities for creating ETL pipelines with mara☆36Updated 2 years ago
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 4 years ago
- Interactive notebooks containing demonstration code of the splink library☆38Updated 9 months ago
- Comparing Polars to Pandas and a small introduction☆43Updated 3 years ago
- Titus 2 : Portable Format for Analytics (PFA) implementation for Python 3.4+☆23Updated last year
- ☆27Updated 5 years ago
- This is a compilation of Data Governance resources, examples, models and communities☆10Updated 5 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- ☆16Updated last year
- ☆24Updated 4 years ago
- A pyspark lib to validate data quality☆18Updated 2 years ago
- A python client library for the Stitch Import API☆42Updated 10 months ago
- Abstractions for feature engineering on large graphs of tabular data.☆22Updated this week
- Automated Jupyter notebook testing. 📙☆41Updated 9 months ago
- ☆13Updated last year
- Record matching and entity resolution at scale in Spark☆31Updated last year
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆25Updated this week