getindata / quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
☆18Updated last year
Alternatives and similar repositories for quickstart-ml-blueprints:
Users that are interested in quickstart-ml-blueprints are comparing it to the libraries listed below
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆24Updated 8 months ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆19Updated this week
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 6 months ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- My speaker profile for events and conferences based on codepo8/presenter-terms☆13Updated last week
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 3 months ago
- Explore tips and tricks to deploy machine learning models with Docker.☆13Updated last year
- Receipes of publicly-available Jupyter images☆8Updated 3 weeks ago
- Batteries included toolkit for data engineering.☆33Updated 3 months ago
- NiFi Processor for Apache Pulsar☆10Updated 4 months ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆28Updated 3 weeks ago
- Foundational tools for BCG X's data science packages.☆36Updated 8 months ago
- ☆17Updated 7 months ago
- Projects developed by Domino's R&D team☆76Updated 2 years ago
- Events about the open source data stack☆13Updated 2 years ago
- ☆17Updated 2 years ago
- A framework of open-source technologies to design real-time machine learning systems☆28Updated 2 years ago
- Playground site for creating/validating data contracts☆9Updated 6 months ago
- Dask integration for Snowflake☆30Updated 4 months ago
- A tool for generating docker-compose environments☆22Updated 2 weeks ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- ☆13Updated last year
- A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and …☆37Updated 2 years ago
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆74Updated 3 months ago
- ☆30Updated 3 years ago
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 3 weeks ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 7 months ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- Demo repository to lambda-fy your dbt runs☆11Updated last year