β272Apr 23, 2025Updated 10 months ago
Alternatives and similar repositories for sagemaker-huggingface-inference-toolkit
Users that are interested in sagemaker-huggingface-inference-toolkit are comparing it to the libraries listed below
Sorting:
- Serve machine learning models within a π³ Docker container using π§ Amazon SageMaker.β412Nov 20, 2023Updated 2 years ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at hβ¦β142Oct 7, 2024Updated last year
- Large Language Model Hosting Containerβ91Oct 9, 2025Updated 4 months ago
- Deploy Stable Diffusion Model on Amazon SageMaker Endpontβ38Feb 22, 2024Updated 2 years ago
- Sagemaker Studio Docker UI Extensionβ11Apr 17, 2024Updated last year
- Train machine learning models within a π³ Docker container using π§ Amazon SageMaker.β535Jan 16, 2026Updated last month
- Training and inference on AWS Trainium and Inferentia chips.β261Updated this week
- In this repo, we show how to host two computer vision models trained using the TensorFlow framework under one SageMaker multi-model endpoβ¦β12Jun 8, 2021Updated 4 years ago
- A universal scalable machine learning model deployment solutionβ248Feb 28, 2026Updated last week
- β14Nov 1, 2024Updated last year
- Example π Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using π§ Amazon SageMaker.β10,884Feb 24, 2026Updated last week
- Deploy, launch and use LLMs on AWSβ16Jun 2, 2023Updated 2 years ago
- β64Apr 25, 2025Updated 10 months ago
- Notebooks and sample code for Build On Trainiumβ47Jan 14, 2026Updated last month
- Support code for building and running Amazon SageMaker compatible Docker containers based on the open source framework Scikit-learn (httpβ¦β182Feb 12, 2026Updated 3 weeks ago
- Example code for AWS Neuron SDK developers building inference and training applicationsβ158Jan 15, 2026Updated last month
- Notebooks using the Hugging Face libraries π€β4,474Feb 25, 2026Updated last week
- Port of Detectron2 to train/deploy model on Amazon Sagemakerβ16Mar 5, 2021Updated 5 years ago
- YoloV5 on SageMaker, including bring your own containerβ18Nov 23, 2020Updated 5 years ago
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,305Feb 9, 2026Updated 3 weeks ago
- Multi Model Server is a tool for serving neural net models for inferenceβ1,025May 20, 2024Updated last year
- Enterprise Scale NLP with Hugging Face & SageMaker Workshop seriesβ242Jan 20, 2023Updated 3 years ago
- β24Aug 26, 2024Updated last year
- A Jupyter server extension to proxy requests with AWS SigV4 authenticationβ22Jul 12, 2023Updated 2 years ago
- A set of Docker images that include popular frameworks for machine learning, data science and visualization.β147Updated this week
- How to use stable diffusion model on AWS Sagemakerβ38Feb 23, 2023Updated 3 years ago
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and iβ¦β583Updated this week
- Amazon SageMaker Edge Manager Workshopβ36Jun 6, 2022Updated 3 years ago
- Compilation of examples of SageMaker inference options and other features.β73Oct 28, 2025Updated 4 months ago
- Repository for training and deploying Generative AI models, including text-text, text-to-image generation and prompt engineering playgrouβ¦β202Feb 27, 2026Updated last week
- CLI for building Docker images in SageMaker Studio using AWS CodeBuild.β58Apr 18, 2022Updated 3 years ago
- β11Sep 20, 2021Updated 4 years ago
- Toolkit for running PyTorch training scripts on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at https://githβ¦β205Aug 25, 2025Updated 6 months ago
- Use the two different methods (deepspeed and SageMaker model parallelism library) to fine tune llama model on Sagemaker. Then deploy the β¦β24Aug 1, 2023Updated 2 years ago
- β27Feb 25, 2022Updated 4 years ago
- Viewer for text datasets in formats like HuggingFace, JSONL, etc.β15Feb 25, 2025Updated last year
- β14Nov 22, 2023Updated 2 years ago
- β12Sep 11, 2023Updated 2 years ago
- β13Oct 9, 2023Updated 2 years ago