huggingface / optimum-neuron
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
☆229Updated last week
Alternatives and similar repositories for optimum-neuron
Users that are interested in optimum-neuron are comparing it to the libraries listed below
Sorting:
- ☆107Updated 3 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆143Updated 2 weeks ago
- Large Language Model Hosting Container☆88Updated 2 weeks ago
- ☆56Updated last month
- ☆256Updated 3 weeks ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆139Updated 7 months ago
- ☆62Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆15Updated this week
- A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH (Secure Shell)☆242Updated 2 months ago
- Hands-on workshop for distributed training and hosting on SageMaker☆137Updated 3 weeks ago
- ☆70Updated 10 months ago
- ☆35Updated 4 months ago
- ☆24Updated last year
- ☆199Updated last year
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆510Updated this week
- A generative AI-powered framework for testing virtual agents.☆230Updated last month
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆300Updated this week
- ☆54Updated 5 months ago
- experiments with inference on llama☆104Updated 11 months ago
- ☆88Updated last year
- Command Line Interface for Hugging Face Inference Endpoints☆66Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆132Updated 4 months ago
- The package used to build the documentation of our Hugging Face repos☆112Updated 2 weeks ago
- ☆47Updated 2 weeks ago
- ☆56Updated 5 months ago
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆157Updated last year
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆186Updated this week
- ☆123Updated 6 months ago
- ☆96Updated last week
- Manage scalable open LLM inference endpoints in Slurm clusters☆256Updated 10 months ago