quintoandar / butterfree
A tool for building feature stores.
☆281Updated this week
Related projects: ⓘ
- ☆115Updated this week
- Joblib Apache Spark Backend☆242Updated last month
- Monitor the stability of a Pandas or Spark dataframe ⚙︎☆493Updated 2 months ago
- Python tool for profiling-based anomaly monitoring on ETL data pipelines leveraging ML and Apache Spark.☆15Updated 6 months ago
- ML made simple☆205Updated last year
- ☆107Updated this week
- Examples of data science projects created with Kedro.☆170Updated last year
- Toolkit for Apache Spark ML for Feature clean-up, feature Importance calculation suite, Information Gain selection, Distributed SMOTE, Mo…☆191Updated 3 years ago
- Python API for Deequ☆706Updated 2 weeks ago
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆198Updated last week
- Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...☆379Updated last year
- ML pipeline orchestration and model deployments on Kubernetes.☆435Updated last year
- Hopsworks - Data-Intensive AI platform with a Feature Store☆1,135Updated last week
- End to end MLRun demos☆92Updated 2 months ago
- End to End example integrating MLFlow and Seldon Core☆51Updated 3 years ago
- Great Expectations Airflow operator☆158Updated 2 weeks ago
- Creates a Simulation of Fake Web Events☆79Updated 2 years ago
- real-time data + ML pipeline☆54Updated this week
- PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more☆221Updated 9 months ago
- A curated list of awesome DataOps tools☆139Updated 3 months ago
- 🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud sto…☆371Updated 4 months ago
- A tool to deploy a mostly serverless MLflow tracking server on a GCP project with one command☆65Updated last year
- Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'☆120Updated last year
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆111Updated 5 months ago
- Apache Marvin-AI☆102Updated last year
- Joining the modern data stack with the modern ML stack☆188Updated last year
- HandySpark - bringing pandas-like capabilities to Spark dataframes☆185Updated 5 years ago
- A Spark/Scala implementation of the isolation forest unsupervised outlier detection algorithm with support for exporting in ONNX format.☆224Updated 2 weeks ago
- Resources for Data Science Process management☆206Updated 4 years ago
- Repository with sample code and instructions for "Continuous Intelligence" and "Continuous Delivery for Machine Learning: CD4ML" workshop…☆316Updated last month