microsoft / pyspark_propensity_matching
library for conducting propensity matching on spark scale
☆14Updated last year
Alternatives and similar repositories for pyspark_propensity_matching:
Users that are interested in pyspark_propensity_matching are comparing it to the libraries listed below
- A Scalable Data Cleaning Library for PySpark.☆26Updated 5 years ago
- Cookiecutter template for testing Python scikit-learn clustering learners.☆17Updated 2 years ago
- Notebooks which will provide a demo of Qgrid functionality☆20Updated 5 years ago
- Cookiecutter template for testing Python scikit-learn regression learners.☆15Updated last year
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- 📝 A blog post about report generation and automation in python☆40Updated 5 years ago
- Cookiecutter template for testing Python scikit-learn classifiers.☆36Updated last year
- Material from presentations☆13Updated 3 years ago
- Unpack the source files from a Databricks .dbc archive file.☆26Updated 10 months ago
- Simple samples for writing ETL transform scripts in Python☆22Updated 3 years ago
- Explore tips and tricks to deploy machine learning models with Docker.☆13Updated last year
- AWS Big Data Certification☆25Updated last month
- ☆16Updated 8 years ago
- Demo assets for DAIS 2021 'Learn to use Databricks for the full ML lifecycle' Talk☆13Updated 3 years ago
- ☆21Updated last year
- ☆44Updated 9 months ago
- Synthetic Call Detail Record (CDR) generator using Spark☆22Updated 10 years ago
- Abstractions for feature engineering on large graphs of tabular data.☆21Updated 3 weeks ago
- The PEDSnet Data Quality Assessment Toolkit (OMOP CDM)☆24Updated 3 years ago
- Repository containing all the necessary documents for the conferences☆13Updated last year
- Interactive notebooks containing demonstration code of the splink library☆37Updated last year
- Directions and Source code for Insight's Docker workshop.☆22Updated 2 years ago
- ☆45Updated 11 months ago
- Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on sing…☆45Updated 3 months ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Quick descriptions and answers of common data science tasks and questions☆25Updated 8 years ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 4 years ago
- Retail industry solutions for product price optimization using the Cortana Intelligence Suite with end-to-end walkthrough☆86Updated 7 years ago
- Spark NLP for Streamlit☆15Updated 3 years ago
- DuckDB with Dashboarding tools demo evidence, streamlit and rill☆15Updated last year