vincentclaes / datajobLinks
Build and deploy a serverless data pipeline on AWS with no effort.
☆111Updated 2 years ago
Alternatives and similar repositories for datajob
Users that are interested in datajob are comparing it to the libraries listed below
Sorting:
- This repo will teach you how to deploy an ML-powered web app to AWS Fargate from start to finish using Streamlit and AWS CDK☆109Updated 4 years ago
- Example templates for the delivery of custom ML solutions to production so you can get started quickly without having to make too many de…☆74Updated last year
- You're one command away from deploying your Streamlit app on AWS Fargate!☆48Updated 4 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆81Updated last year
- Tools to run Jupyter notebooks as jobs in Amazon SageMaker - ad hoc, on a schedule, or in response to events☆145Updated 2 years ago
- Step Functions Data Science SDK for building machine learning (ML) workflows and pipelines on AWS☆292Updated 7 months ago
- This sample demonstrates how to setup an Amazon SageMaker MLOps end-to-end pipeline for Drift detection☆63Updated 2 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆114Updated 4 months ago
- This is a repository for the Duke University Cloud Computing course project on Serveless Data Engineering Pipeline. For this project, I r…☆20Updated 4 years ago
- This repository shows a sample example to build, manage and orchestrate Machine Learning workflows using Amazon Sagemaker and Apache Airf…☆138Updated 4 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆114Updated 3 weeks ago
- ☆95Updated 4 years ago
- Docker images that replicate the Amazon SageMaker Notebook instance.☆57Updated 4 years ago
- A simple and easy to use Data Quality (DQ) tool built with Python.☆50Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆39Updated 9 months ago
- Write python locally, execute SQL in your data warehouse☆269Updated 3 years ago
- ☆73Updated last year
- Snowflake Guide: Building a Recommendation Engine Using Snowflake & Amazon SageMaker☆32Updated 4 years ago
- Streamlit EDA Dashboard Powered by AWS Cloud☆84Updated 5 months ago
- mlctl is the control plane for MLOps. It provides a CLI and a Python SDK for supporting key operations related to MLOps, such as "model t…☆25Updated 4 years ago
- Ingesting data with Pulumi, AWS lambdas and Snowflake in a scalable, fully replayable manner☆71Updated 3 years ago
- Projects developed by Domino's R&D team☆77Updated 3 years ago
- Safe blue/green deployment of Amazon SageMaker endpoints using AWS CodePipeline, CodeBuild and CodeDeploy.☆105Updated 3 years ago
- ∞ Priceloop Engineering Conventions for Scala, Python, Git Workflow etc☆100Updated 3 years ago
- Build your feature store with macros right within your dbt repository☆39Updated 2 years ago
- Playground for using large language models into the Modern Data Stack for entity matching☆108Updated 2 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Updated 7 months ago
- Open innovation with 60 minute cloud experiments on AWS☆87Updated last year
- ☆31Updated last year
- This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.☆52Updated 2 years ago