aws / sagemaker-sparkml-serving-container
This code is used to build & run a Docker container for performing predictions against a Spark ML Pipeline.
☆53Updated last year
Related projects ⓘ
Alternatives and complementary repositories for sagemaker-sparkml-serving-container
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated last year
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated last year
- Toolkit for running MXNet training scripts on SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https://github.c…☆60Updated last year
- ☆64Updated 4 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆19Updated 4 years ago
- A library of additional estimators and SageMaker tools based on scikit-learn☆39Updated 9 months ago
- This repository shows a sample example to build, manage and orchestrate Machine Learning workflows using Amazon Sagemaker and Apache Airf…☆137Updated 3 years ago
- A Spark library for Amazon SageMaker.☆300Updated 2 weeks ago
- SageMaker specific extensions to TensorFlow.☆54Updated 4 months ago
- Supporting code, Dockerfile, and Jupyter notebook for an end to end tutorial on Amazon SageMaker and EMR.☆28Updated 5 years ago
- The open source version of the Amazon EMR Management Guide. You can submit feedback & requests for changes by submitting issues in this r…☆61Updated last year
- This is the Docker container based on open source framework XGBoost (https://xgboost.readthedocs.io/en/latest/) to allow customers use th…☆125Updated last month
- Best practices and recommendations for getting started with Amazon EMR on EKS.☆61Updated last week
- AWS Quick Start Team☆18Updated last month
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆83Updated last year
- Scripts and instructions to facilitate running Deep Learning Tasks on Amazon EMR☆63Updated last year
- ☆30Updated last year
- A high performance data access library for machine learning tasks☆74Updated last year
- XGBoost GPU accelerated on Spark example applications☆52Updated 2 years ago
- Tools to run Jupyter notebooks as jobs in Amazon SageMaker - ad hoc, on a schedule, or in response to events☆142Updated last year
- Examples for using Amazon SageMaker components in Kubeflow Pipelines☆22Updated 4 years ago
- Sample Apache Beam pipeline that can be deployed to Amazon Managed Service for Apache Flink. It reads taxi events from a Kinesis data str…☆46Updated last year
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- Learn how to build an end-to-end streaming architecture to ingest, analyze, and visualize streaming data in near real-time☆34Updated 2 years ago
- The SageMaker Spark Container is a Docker image used to run data processing workloads with the Spark framework on Amazon SageMaker.☆38Updated 4 months ago
- Distributed training using Kubeflow on Amazon EKS☆82Updated this week
- Reference Architectures for Datalakes on AWS☆79Updated 4 years ago
- A Data Platform built for AWS, powered by Kubernetes.☆127Updated last year
- [ARCHIVED] Moved to github.com/NVIDIA/spark-xgboost-examples☆70Updated 4 years ago