getindata / quickstart-ml-blueprintsLinks
Data science project development best practices and state of the art open-source tooling forged into a set of solved ML use cases to serve as blueprints for efficient prototyping.
☆19Updated last year
Alternatives and similar repositories for quickstart-ml-blueprints
Users that are interested in quickstart-ml-blueprints are comparing it to the libraries listed below
Sorting:
- NiFi Processor for Apache Pulsar☆10Updated 7 months ago
- Workshop "From zero to MLOps: An open source stack to fight spaghetti ML"☆25Updated 11 months ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 6 months ago
- Batteries included toolkit for data engineering.☆34Updated 5 months ago
- Receipes of publicly-available Jupyter images☆8Updated 3 months ago
- Demo for GDS 2022 February Webinar☆11Updated 3 years ago
- FLaNK AI Weekly covering Apache NiFi, Apache Flink, Apache Kafka, Apache Spark, Apache Iceberg, Apache Ozone, Apache Pulsar, and more...☆21Updated this week
- Using Polars and Pandas on AWS Lambda to process data.☆9Updated last year
- A tool for generating docker-compose environments☆25Updated 2 months ago
- Big Data Newsletter☆23Updated last year
- Demos of Materialize, the operational data warehouse.☆51Updated 3 months ago
- Firefox extension that shows parquet schema when going over GCP cloud storage. Use DuckDB WASM☆12Updated last year
- ☆17Updated 2 years ago
- rust-for-data☆45Updated last year
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Multi-docker container data science / engineering playground (w/ Kafka, Airflow, MLFlow, Tensorflow-Keras / SKLearn) for simulating a mic…☆11Updated 2 years ago
- Predicting Car Prices with FastAPI, Streamlit, MLflow, Kafka, and Debezium: A Practical Demonstration☆20Updated 7 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- This repository contains code to build an MVP search engine with google like interface.☆15Updated 2 weeks ago
- This repository contains recipes for Apache Pinot.☆30Updated 4 months ago
- A repository that showcases how you can use ZenML with Git☆69Updated last month
- A framework of open-source technologies to design real-time machine learning systems☆28Updated 2 years ago
- Delta reader for the Ray open-source toolkit for building ML applications☆46Updated last year
- A collection of my favorite tech-related blog posts.☆10Updated last month
- Templates for your Kedro projects.☆76Updated this week
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- Kedro plugin to support running workflows on Microsoft Azure ML Pipelines☆37Updated this week
- Foundational tools for BCG X's data science packages.☆35Updated 11 months ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- Dataset for training ML ranking models☆20Updated 2 years ago