getindata / quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
☆16Updated last year
Alternatives and similar repositories for quickstart-ml-blueprints:
Users that are interested in quickstart-ml-blueprints are comparing it to the libraries listed below
- Receipes of publicly-available Jupyter images☆9Updated 3 months ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆28Updated last year
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆37Updated 5 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆54Updated 4 months ago
- NiFi Processor for Apache Pulsar☆10Updated 2 months ago
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆26Updated 6 months ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆18Updated this week
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆68Updated last month
- Big Data Newsletter☆25Updated 9 months ago
- Deploy A/B testing infrastructure in a containerized microservice architecture for Machine Learning applications.☆40Updated last week
- Batteries included toolkit for data engineering.☆33Updated 2 weeks ago
- Test data management tool for any data source, batch or real-time. Generate, validate and clean up data all in one tool.☆44Updated last week
- Source code for the post Effortless deployments with MLFlow, showcasing how logging models using MLFLow can provide you want to easily de…☆16Updated last year
- A tutorial on how to use kedro-mlflow plugin (https://github.com/Galileo-Galilei/kedro-mlflow) to synchronize training and inference and …☆37Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆43Updated 11 months ago
- ☆13Updated last year
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆32Updated last year
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms and…☆28Updated last year
- O'Reilly book - Building Machine Learning Systems with a feature store: batch, real-time, and LLMs☆25Updated last week
- Events about the open source data stack☆13Updated 2 years ago
- Using the Parquet file format with Python☆15Updated last year
- A curated list of awesome open source tools and commercial products for monitoring data quality, monitoring model performance, and profil…☆69Updated 8 months ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 2 years ago
- ☆28Updated 3 months ago
- Demo on how to use Prefect with Docker☆25Updated 2 years ago
- Assessing whether data from database complies with reference information.☆42Updated this week
- A few end to end examples that use data-describe☆16Updated last year
- The repository that contains all source code for the ZenML UI.☆45Updated this week
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆26Updated last month