getindata / quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
☆18Updated last year
Alternatives and similar repositories for quickstart-ml-blueprints:
Users that are interested in quickstart-ml-blueprints are comparing it to the libraries listed below
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 4 months ago
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆24Updated 9 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 7 months ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆28Updated last month
- Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP☆10Updated 8 months ago
- Kedro Plugin to support running pipelines on AWS SageMaker.☆21Updated 2 months ago
- Batteries included toolkit for data engineering.☆34Updated 3 months ago
- Events about the open source data stack☆13Updated 3 years ago
- Receipes of publicly-available Jupyter images☆8Updated last month
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- Foundational tools for BCG X's data science packages.☆35Updated 9 months ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆76Updated 4 months ago
- NiFi Processor for Apache Pulsar☆10Updated 5 months ago
- Apache Spark based framework for analysis A/B experiments☆13Updated 5 months ago
- A few end to end examples that use data-describe☆16Updated last year
- My speaker profile for events and conferences based on codepo8/presenter-terms☆13Updated 3 weeks ago
- A tool for generating docker-compose environments☆23Updated last month
- GetInData Helm Charts repository☆12Updated 2 years ago
- This repository contains a recipe for bootstrapping a climate analysis application using Apache Pinot and Superset☆20Updated 4 years ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Updated 6 months ago
- Publish and share Kedro-Viz static website on GitHub pages in your workflow through this GitHub action☆13Updated 3 weeks ago
- Templates for your Kedro projects.☆73Updated 3 weeks ago
- A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and …☆38Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆20Updated this week
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- Kedro plugin to support running pipelines on Dagster☆11Updated this week
- Kedro Plugin to support running workflows on GCP Vertex AI Pipelines☆36Updated this week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week