quintoandar / butterfree
A tool for building feature stores.
☆302Updated 3 weeks ago
Alternatives and similar repositories for butterfree:
Users that are interested in butterfree are comparing it to the libraries listed below
- Python tool for profiling-based anomaly monitoring on ETL data pipelines leveraging ML and Apache Spark.☆16Updated last year
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆500Updated 3 months ago
- ML made simple☆209Updated 2 years ago
- Joblib Apache Spark Backend☆245Updated last month
- End to End example integrating MLFlow and Seldon Core☆51Updated 4 years ago
- ML pipeline orchestration and model deployments on Kubernetes.☆435Updated last year
- Examples of data science projects created with Kedro.☆172Updated last year
- Python API for Deequ☆766Updated last month
- Kedro Plugin to support running workflows on GCP Vertex AI Pipelines☆36Updated this week
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 8 months ago
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆213Updated last month
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 2 years ago
- pyspark methods to enhance developer productivity 📣 👯 🎉☆669Updated 2 months ago
- PostgreSQL offline and online stores for Feast☆32Updated 3 years ago
- Great Expectations Airflow operator☆163Updated last week
- Fast iterative local development and testing of Apache Airflow workflows☆200Updated last week
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- real-time data + ML pipeline☆54Updated 3 weeks ago
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆720Updated last year
- Tool to automate data quality checks on data pipelines☆255Updated 2 years ago
- Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshop…☆319Updated 8 months ago
- PySpark test helper methods with beautiful error messages☆686Updated 3 weeks ago
- A curated list of awesome DataOps tools☆188Updated 6 months ago
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆390Updated 2 years ago
- 🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud sto…☆389Updated 3 months ago
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆69Updated 2 months ago
- Astronomer Core Docker Images☆107Updated 11 months ago
- Generate and Visualize Data Lineage from query history☆324Updated last year
- Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.☆125Updated 3 years ago
- Apache Marvin-AI☆101Updated 2 years ago