marshackVB / databricks_feature_store
☆10Updated 2 years ago
Alternatives and similar repositories for databricks_feature_store
Users that are interested in databricks_feature_store are comparing it to the libraries listed below
Sorting:
- Joining the modern data stack with the modern ML stack☆197Updated 2 years ago
- [DEPRECATED] Demo repository implementing an end-to-end MLOps workflow on Databricks. Project derived from dbx basic python template☆112Updated 2 years ago
- Recommendations at "Reasonable Scale": joining dataOps with recSys through dbt, Merlin and Metaflow☆238Updated 2 years ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- ☆10Updated 2 years ago
- Accompanying solution accelerator notebook for the Databricks blog on transformer models☆15Updated 2 years ago
- Example repo to kickstart integration with mlflow pipelines.☆76Updated 2 years ago
- ☆9Updated 2 years ago
- ☆13Updated last year
- A SQL port of python's scikit-learn preprocessing module, provided as cross-database dbt macros.☆184Updated last year
- A library to find and visualise the most interesting slices in multidimensional data☆108Updated last month
- 🧪 Simple data science experimentation & tracking with jupyter, papermill, and mlflow.☆183Updated 10 months ago
- Delta Lake helper methods in PySpark☆323Updated 8 months ago
- Playground for using large language models into the Modern Data Stack for entity matching☆107Updated 2 years ago
- Capturing model drift and handling its response - Example webinar☆108Updated 5 years ago
- A Python Library to support running data quality rules while the spark job is running⚡☆188Updated last week
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆69Updated 2 months ago
- Pythonic Programming Framework to orchestrate jobs in Databricks Workflow☆215Updated last week
- ☆16Updated last year
- ☆58Updated last year
- Data pipeline with dbt, Airflow, Great Expectations☆162Updated 3 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated last year
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆246Updated 3 months ago
- Basic and advanced MLflow examples for many ML flavors☆222Updated 9 months ago
- Template repo for kickstarting recipes for regression use case☆54Updated 5 months ago
- A PaaS End-to-End ML Setup with Metaflow, Serverless and SageMaker.☆37Updated 4 years ago
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆215Updated 2 months ago
- Data Quality assessment with one line of code☆442Updated this week
- Examples of Prompt Engineering, Zero Shot Learning, Few Shot Learning and Retrieval Augmented Generation (RAG) using Hugging Face, Databr…☆15Updated last year
- A Python PySpark Projet with Poetry☆23Updated 8 months ago