vincentclaes / datajobLinks
Build and deploy a serverless data pipeline on AWS with no effort.
☆111Updated 2 years ago
Alternatives and similar repositories for datajob
Users that are interested in datajob are comparing it to the libraries listed below
Sorting:
- Example templates for the delivery of custom ML solutions to production so you can get started quickly without having to make too many de…☆72Updated 11 months ago
- Demo for GitHub Universe 2022☆12Updated 2 years ago
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆37Updated 3 months ago
- ☆61Updated 3 years ago
- Step Functions Data Science SDK for building machine learning (ML) workflows and pipelines on AWS☆290Updated last month
- This repository shows a sample example to build, manage and orchestrate Machine Learning workflows using Amazon Sagemaker and Apache Airf…☆136Updated 3 years ago
- Tools to run Jupyter notebooks as jobs in Amazon SageMaker - ad hoc, on a schedule, or in response to events☆143Updated last year
- You're one command away from deploying your Streamlit app on AWS Fargate!☆47Updated 4 years ago
- This repo will teach you how to deploy an ML-powered web app to AWS Fargate from start to finish using Streamlit and AWS CDK☆108Updated 4 years ago
- Tutorials for Fugue - A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark and Dask withou…☆113Updated last year
- ☆84Updated last year
- ☆73Updated last year
- Safe blue/green deployment of Amazon SageMaker endpoints using AWS CodePipeline, CodeBuild and CodeDeploy.☆106Updated 2 years ago
- Docker images that replicate the Amazon SageMaker Notebook instance.☆58Updated 3 years ago
- ☆89Updated last year
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆85Updated 2 years ago
- This repository contains the dbt-glue adapter☆123Updated this week
- Amazon SageMaker MLOps deployment pipeline for A/B Testing of machine learning models.☆44Updated 3 years ago
- ☆57Updated 3 years ago
- Experiment tracking and metric logging for Amazon SageMaker notebooks and model training.☆127Updated last year
- Run dbt serverless in the Cloud (AWS)☆42Updated 5 years ago
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- Using the Parquet file format with Python☆15Updated last year
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- Redshift Python Connector. It supports Python Database API Specification v2.0.☆212Updated last week
- Open innovation with 60 minute cloud experiments on AWS☆88Updated last year
- ☆145Updated 2 years ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated last year
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆18Updated last month
- This repo provides an end-to-end example of using streaming feature aggregation with the Amazon SageMaker Feature Store.☆46Updated 3 years ago