A Spark library for Amazon SageMaker.
☆301Mar 8, 2025Updated last year
Alternatives and similar repositories for sagemaker-spark
Users that are interested in sagemaker-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A library for training and deploying machine learning models on Amazon SageMaker☆2,233Updated this week
- SageMaker specific extensions to TensorFlow.☆54Jul 23, 2024Updated last year
- Toolkit for running TensorFlow training scripts on SageMaker. Dockerfiles used for building SageMaker TensorFlow Containers are at https:…☆271Jun 4, 2025Updated 9 months ago
- Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.☆10,888Feb 24, 2026Updated 3 weeks ago
- WARNING: This package has been deprecated. Please use the SageMaker Training Toolkit for model training and the SageMaker Inference Toolk…☆188Jun 22, 2020Updated 5 years ago
- Step Functions Data Science SDK for building machine learning (ML) workflows and pipelines on AWS☆295Apr 15, 2025Updated 11 months ago
- Serve machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.☆412Nov 20, 2023Updated 2 years ago
- The open source version of the Amazon SageMaker docs☆253Jun 15, 2023Updated 2 years ago
- Amazon SageMaker workshops: Introduction, TensorFlow in SageMaker, and more☆390Jan 14, 2026Updated 2 months ago
- A TensorFlow Serving solution for use in SageMaker. This repo is now deprecated.☆172Sep 13, 2023Updated 2 years ago
- A collection of sample scripts to customize Amazon SageMaker Notebook Instances using Lifecycle Configurations☆428Apr 2, 2024Updated last year
- Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB☆228Jan 15, 2026Updated 2 months ago
- Experiment tracking and metric logging for Amazon SageMaker notebooks and model training.☆127Nov 14, 2023Updated 2 years ago
- A Jupyter server extension to proxy requests with AWS SigV4 authentication☆22Jul 12, 2023Updated 2 years ago
- The SageMaker Spark Container is a Docker image used to run data processing workloads with the Spark framework on Amazon SageMaker.☆41Updated this week
- A tutorial on how to build, train, and deploy advanced CNN architectures U-Net and ENet for per-pixel binary segmentation on SageMaker.☆19Mar 21, 2018Updated 8 years ago
- Toolkit for allowing inference and serving with MXNet in SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https…☆29Sep 13, 2023Updated 2 years ago
- A set of dockerfiles that provide Reinforcement Learning solutions for use in SageMaker.☆82Apr 8, 2024Updated last year
- Utility which provides a UI to do prompt engineering within SageMaker Studio.☆14Jul 5, 2023Updated 2 years ago
- Amazon SageMaker examples for prebuilt framework mode containers, a.k.a. Script Mode, and more (BYO containers and models etc.)☆169Dec 20, 2023Updated 2 years ago
- Creates a CloudFormation template that uses AWS StepFunctions to automate the building and training of Sagemaker custom models based on S…☆165Jan 15, 2020Updated 6 years ago
- Toolkit for running PyTorch training scripts on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at https://gith…☆205Aug 25, 2025Updated 6 months ago
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.☆58Apr 18, 2022Updated 3 years ago
- Supporting code, Dockerfile, and Jupyter notebook for an end to end tutorial on Amazon SageMaker and EMR.☆28Jan 14, 2026Updated 2 months ago
- DynamoDB Key Diagnostics Library is a Java DynamoDB client wrapper that automatically logs your key usage information to Kinesis as your…☆15Oct 13, 2020Updated 5 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Sep 9, 2025Updated 6 months ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆142Oct 7, 2024Updated last year
- This library contains various Apache Flink connectors to connect to AWS data sources and sinks.☆16Dec 5, 2023Updated 2 years ago
- Train machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.☆534Jan 16, 2026Updated 2 months ago
- DynamoDB data source for Apache Spark☆95Sep 2, 2021Updated 4 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,596Mar 10, 2026Updated last week
- Integrating Amazon API Gateway private endpoints with on-premises networks☆12Jul 9, 2021Updated 4 years ago
- pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoD…☆4,106Updated this week
- Notebooks for MXNet☆12Aug 24, 2017Updated 8 years ago
- LLMs and Machine Learning done easily☆442Feb 11, 2026Updated last month
- Tools to run Jupyter notebooks as jobs in Amazon SageMaker - ad hoc, on a schedule, or in response to events☆144Oct 7, 2023Updated 2 years ago
- MLeap: Deploy ML Pipelines to Production☆1,535Mar 10, 2026Updated last week
- Automated data quality suggestions and analysis with Deequ on AWS Glue☆91Dec 29, 2022Updated 3 years ago
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆697Jan 13, 2026Updated 2 months ago