huggingface / optimum-neuron
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
☆193Updated this week
Related projects: ⓘ
- ☆94Updated this week
- Example code for AWS Neuron SDK developers building inference and training applications☆120Updated 2 weeks ago
- ☆235Updated 3 weeks ago
- Large Language Model Hosting Container☆75Updated 2 weeks ago
- ☆39Updated this week
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆134Updated 3 months ago
- Hands-on workshop for distributed training and hosting on SageMaker☆118Updated this week
- ☆56Updated this week
- A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH (Secure Shell)☆217Updated last week
- ☆201Updated 7 months ago
- ☆14Updated 5 months ago
- ☆62Updated 2 months ago
- experiments with inference on llama☆106Updated 3 months ago
- Foundation Model Evaluations Library☆184Updated 3 weeks ago
- ☆21Updated 5 months ago
- git extension for {collaborative, communal, continual} model development☆202Updated 3 months ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆177Updated this week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆175Updated this week
- Manage scalable open LLM inference endpoints in Slurm clusters☆217Updated 2 months ago
- ☆86Updated last year
- ☆89Updated 11 months ago
- Zero administration inference with AWS Lambda for 🤗☆62Updated 2 years ago
- batched loras☆327Updated last year
- ☆31Updated this week
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆442Updated this week
- 🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…☆231Updated last week
- ☆25Updated this week
- A generative AI-powered framework for testing virtual agents.☆86Updated 3 weeks ago
- ☆18Updated last year
- ☆38Updated this week