huggingface / optimum-neuronLinks

Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.

☆235

Alternatives and similar repositories for optimum-neuron

Users that are interested in optimum-neuron are comparing it to the libraries listed below

Sorting:

aws-neuron / transformers-neuronx
☆112Updated 6 months ago
aws-neuron / aws-neuron-samples
Example code for AWS Neuron SDK developers building inference and training applications
☆148Updated this week
aws / sagemaker-huggingface-inference-toolkit
☆264Updated 3 months ago
awslabs / llm-hosting-container
Large Language Model Hosting Container
☆89Updated last week
aws / sagemaker-pytorch-inference-toolkit
Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…
☆141Updated 9 months ago
aws-neuron / neuronx-distributed
☆60Updated last month
cohere-ai / cohere-aws
☆62Updated 3 months ago
aws-neuron / aws-neuron-sdk
Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…
☆533Updated this week
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated last year
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆117Updated 6 months ago
Preemo-Inc / text-generation-inference
☆199Updated last year
aws-samples / llm-evaluation-methodology
☆44Updated 9 months ago
aws-samples / sagemaker-distributed-training-workshop
Hands-on workshop for distributed training and hosting on SageMaker
☆144Updated this week
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…
☆364Updated last month
aws-neuron / upstreaming-to-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆19Updated this week
NVIDIA-NeMo / Run
A tool to configure, launch and manage your machine learning experiments.
☆174Updated this week
huggingface / optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
☆191Updated this week
huggingface / competitions
☆124Updated 9 months ago
huggingface / doc-builder
The package used to build the documentation of our Hugging Face repos
☆122Updated this week
huggingface / optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…
☆307Updated 2 months ago
huggingface / llm-swarm
Manage scalable open LLM inference endpoints in Slurm clusters
☆268Updated last year
awslabs / extending-the-context-length-of-open-source-llms
☆56Updated last month
huggingface / api-inference-community
☆171Updated 5 months ago
aws-samples / sagemaker-ssh-helper
A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH (Secure Shell)
☆249Updated 3 weeks ago
coreweave / tensorizer
Module, Model, and Tensor Serialization/Deserialization
☆250Updated this week
aws-samples / aws-samples-for-ray
☆72Updated last year
aws-samples / awsome-distributed-training
Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.
☆331Updated this week
huggingface / data-is-better-together
Let's build better datasets, together!
☆260Updated 7 months ago
aws-samples / amazon-sagemaker-managed-spot-training
Amazon SageMaker Managed Spot Training Examples
☆51Updated last year
sgugger / torchdynamo-tests
☆19Updated 2 years ago