marshackVB / databricks_feature_storeLinks
☆10Updated 3 years ago
Alternatives and similar repositories for databricks_feature_store
Users that are interested in databricks_feature_store are comparing it to the libraries listed below
Sorting:
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆184Updated last year
- Recommendations at "Reasonable Scale": joining dataOps with recSys through dbt, Merlin and Metaflow☆237Updated 2 years ago
- ☆17Updated last year
- 🛠 Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.☆147Updated last year
- [DEPRECATED] Demo repository implementing an end-to-end MLOps workflow on Databricks. Project derived from dbx basic python template☆114Updated 2 years ago
- Joining the modern data stack with the modern ML stack☆196Updated 2 years ago
- Example repo to kickstart integration with mlflow pipelines.☆76Updated 2 years ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- ☆9Updated 2 years ago
- A library to find and visualise the most interesting slices in multidimensional data☆108Updated 3 months ago
- ☆11Updated 2 years ago
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆69Updated last month
- Data pipeline with dbt, Airflow, Great Expectations☆163Updated 3 years ago
- Template repo for kickstarting recipes for regression use case☆55Updated 6 months ago
- Food for thoughts around data contracts☆25Updated 3 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆217Updated last week
- This repository provides various demos/examples of using Snowpark for Python.☆276Updated last year
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this three …☆240Updated 4 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- Demo of Streamlit application with Databricks SQL Endpoint☆35Updated 2 years ago
- ☆13Updated last year
- A package for automatic data collection and feature engineering☆25Updated 2 years ago
- PySpark test helper methods with beautiful error messages☆699Updated 2 weeks ago
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆183Updated 11 months ago
- Template for a data contract used in a data mesh.☆472Updated last year
- Data Quality assessment with one line of code☆446Updated last week
- A Databricks framework for quick Agent solutions☆20Updated last year
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆217Updated this week
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆255Updated this week
- Delta Lake helper methods in PySpark☆326Updated 9 months ago