A Spark library for Amazon SageMaker.
☆301Mar 8, 2025Updated 11 months ago
Alternatives and similar repositories for sagemaker-spark
Users that are interested in sagemaker-spark are comparing it to the libraries listed below
Sorting:
- A library for training and deploying machine learning models on Amazon SageMaker☆2,228Updated this week
- SageMaker specific extensions to TensorFlow.☆54Jul 23, 2024Updated last year
- Toolkit for running TensorFlow training scripts on SageMaker. Dockerfiles used for building SageMaker TensorFlow Containers are at https:…☆271Jun 4, 2025Updated 8 months ago
- Docker container for running Chainer scripts to train and host Chainer models on SageMaker☆19Apr 10, 2023Updated 2 years ago
- Step Functions Data Science SDK for building machine learning (ML) workflows and pipelines on AWS☆295Apr 15, 2025Updated 10 months ago
- A TensorFlow Serving solution for use in SageMaker. This repo is now deprecated.☆172Sep 13, 2023Updated 2 years ago
- The SageMaker Spark Container is a Docker image used to run data processing workloads with the Spark framework on Amazon SageMaker.☆40Feb 11, 2026Updated 2 weeks ago
- The open source version of the Amazon SageMaker docs☆253Jun 15, 2023Updated 2 years ago
- Serve machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.☆412Nov 20, 2023Updated 2 years ago
- Toolkit for running MXNet training scripts on SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https://github.c…☆60Feb 11, 2025Updated last year
- A collection of sample scripts to customize Amazon SageMaker Notebook Instances using Lifecycle Configurations☆428Apr 2, 2024Updated last year
- Amazon SageMaker workshops: Introduction, TensorFlow in SageMaker, and more☆391Jan 14, 2026Updated last month
- Toolkit for allowing inference and serving with MXNet in SageMaker. Dockerfiles used for building SageMaker MXNet Containers are at https…☆29Sep 13, 2023Updated 2 years ago
- Experiment tracking and metric logging for Amazon SageMaker notebooks and model training.☆127Nov 14, 2023Updated 2 years ago
- This library contains various Apache Flink connectors to connect to AWS data sources and sinks.☆16Dec 5, 2023Updated 2 years ago
- Implementations of open source Apache Hadoop/Hive interfaces which allow for ingesting data from Amazon DynamoDB☆228Jan 15, 2026Updated last month
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.☆58Apr 18, 2022Updated 3 years ago
- Toolkit for running PyTorch training scripts on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at https://gith…☆205Aug 25, 2025Updated 6 months ago
- Amazon SageMaker examples for prebuilt framework mode containers, a.k.a. Script Mode, and more (BYO containers and models etc.)☆170Dec 20, 2023Updated 2 years ago
- Jupyter magics and kernels for working with remote Spark clusters☆1,362Sep 9, 2025Updated 5 months ago
- pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoD…☆4,107Updated this week
- Train machine learning models within a 🐳 Docker container using 🧠 Amazon SageMaker.☆534Jan 16, 2026Updated last month
- DynamoDB data source for Apache Spark☆95Sep 2, 2021Updated 4 years ago
- A continuous integration (CI) system for 📓 Jupyter notebooks, built using 🧠 Amazon SageMaker.☆11Aug 5, 2025Updated 6 months ago
- ☆14Feb 23, 2021Updated 5 years ago
- The plugin-driven server agent for collecting & reporting metrics.☆14Mar 21, 2025Updated 11 months ago
- SageMaker Experiments and DVC☆17Aug 22, 2022Updated 3 years ago
- DynamoDB Key Diagnostics Library is a Java DynamoDB client wrapper that automatically logs your key usage information to Kinesis as your…☆15Oct 13, 2020Updated 5 years ago
- Sample code supporting the `Generating REST APIs from data classes in Python` blog post☆11May 20, 2024Updated last year
- A tutorial on how to build, train, and deploy advanced CNN architectures U-Net and ENet for per-pixel binary segmentation on SageMaker.☆19Mar 21, 2018Updated 7 years ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,583Feb 17, 2026Updated last week
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆142Oct 7, 2024Updated last year
- The open source version of the AWS Glue docs. You can submit feedback & requests for changes by submitting issues in this repo or by maki…☆201Jun 15, 2023Updated 2 years ago
- Samples and documentation for using the Amazon Neptune graph database service☆369Updated this week
- AWS Glue code samples☆1,536Nov 5, 2025Updated 3 months ago
- AWS Glue Libraries are additions and enhancements to Spark for ETL operations.☆695Jan 13, 2026Updated last month
- Machine Learning Ops Workshop with SageMaker: lab guides and materials.☆333Apr 15, 2021Updated 4 years ago
- This is the Docker container based on open source framework XGBoost (https://xgboost.readthedocs.io/en/latest/) to allow customers use th…☆144Updated this week
- ☆13Jun 5, 2025Updated 8 months ago