getindata / quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
☆18Updated last year
Alternatives and similar repositories for quickstart-ml-blueprints:
Users that are interested in quickstart-ml-blueprints are comparing it to the libraries listed below
- NiFi Processor for Apache Pulsar☆10Updated 4 months ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 3 months ago
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆45Updated last year
- Events about the open source data stack☆13Updated 2 years ago
- ☆30Updated 3 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 6 months ago
- Kedro Plugin to support running pipelines on Kubernetes using Airflow.☆28Updated 3 weeks ago
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- Using Polars and Pandas on AWS Lambda to process data.☆9Updated last year
- Apache Spark based framework for analysis A/B experiments☆13Updated 4 months ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- A collection of MLflow custom flavors☆15Updated last year
- Kedro Plugin to support running pipelines on AWS SageMaker.☆21Updated last month
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆57Updated 2 years ago
- rust-for-data☆44Updated last year
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 7 months ago
- ☆17Updated 2 years ago
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- Integrating Apache Airflow, dbt, Great Expectations and Apache Superset to develop a modern open source data stack.☆11Updated 2 years ago
- A few end to end examples that use data-describe☆16Updated last year
- ☆18Updated 8 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆28Updated last week
- Plugins, extensions, case studies, articles, and video tutorials for Kedro☆73Updated 3 months ago
- Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/☆11Updated 10 months ago
- A Data Mesh demo repository☆13Updated 5 months ago
- 📌 Track & manage metadata, visualize & compare Kedro pipelines in a nice UI.☆18Updated 7 months ago
- Explore tips and tricks to deploy machine learning models with Docker.☆13Updated last year
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- ☆16Updated last year