huggingface / optimum-neuron
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
☆221Updated this week
Alternatives and similar repositories for optimum-neuron:
Users that are interested in optimum-neuron are comparing it to the libraries listed below
- ☆103Updated 2 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆140Updated last month
- ☆250Updated 5 months ago
- Large Language Model Hosting Container☆84Updated last week
- ☆53Updated last month
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆137Updated 5 months ago
- ☆61Updated last week
- A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH (Secure Shell)☆234Updated 3 weeks ago
- Hands-on workshop for distributed training and hosting on SageMaker☆133Updated last month
- experiments with inference on llama☆104Updated 9 months ago
- A generative AI-powered framework for testing virtual agents.☆204Updated last week
- ☆35Updated 3 months ago
- Google TPU optimizations for transformers models☆104Updated 2 months ago
- ☆24Updated 11 months ago
- ☆69Updated 8 months ago
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆501Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆12Updated this week
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆267Updated this week
- A universal scalable machine learning model deployment solution☆213Updated this week
- ☆40Updated 4 months ago
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆230Updated 2 weeks ago
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆289Updated last month
- ☆22Updated last year
- ☆88Updated last year
- ☆120Updated 4 months ago
- The package used to build the documentation of our Hugging Face repos☆106Updated this week
- ☆67Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆306Updated 2 years ago
- ☆43Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆262Updated 5 months ago