getindata / quickstart-ml-blueprints
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
☆18Updated last year
Alternatives and similar repositories for quickstart-ml-blueprints:
Users that are interested in quickstart-ml-blueprints are comparing it to the libraries listed below
- NiFi Processor for Apache Pulsar☆10Updated 6 months ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆20Updated this week
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- A tool for generating docker-compose environments☆24Updated 2 weeks ago
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆25Updated 10 months ago
- ☆17Updated 9 months ago
- Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect☆13Updated 6 months ago
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 4 years ago
- rust-for-data☆45Updated last year
- Python package to monitor the power consumption of any algorithm☆46Updated 2 years ago
- Unity Catalog UI☆40Updated 8 months ago
- Batteries included toolkit for data engineering.☆34Updated 4 months ago
- ☆18Updated last year
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆36Updated this week
- Big Data Newsletter☆23Updated last year
- Getting Great Expectations setup to run on DataBricks with Spark Dataframes.☆13Updated 2 years ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 8 months ago
- CalData infrastructure☆15Updated this week
- ☆21Updated 5 months ago
- Demo repository to lambda-fy your dbt runs☆11Updated last year
- An open specification for data products in Data Mesh☆59Updated 6 months ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 8 months ago
- New generation opensource data stack☆67Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Repository containing various utils related to Snowflake migration at Faire.☆12Updated 2 years ago
- Guide to data platforms and tools☆32Updated 3 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆28Updated last year
- Receipes of publicly-available Jupyter images☆8Updated last month
- Yet Another (Spark) ETL Framework☆21Updated last year