☆111Jan 16, 2025Updated last year
Alternatives and similar repositories for transformers-neuronx
Users that are interested in transformers-neuronx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Example code for AWS Neuron SDK developers building inference and training applications☆158Apr 2, 2026Updated 3 weeks ago
- ☆39Dec 19, 2024Updated last year
- ☆64Apr 9, 2026Updated 3 weeks ago
- Training and inference on AWS Trainium and Inferentia chips.☆267Apr 16, 2026Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆25Mar 5, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆597Apr 22, 2026Updated last week
- ☆63Apr 22, 2026Updated last week
- ☆22Mar 27, 2023Updated 3 years ago
- Cluster doctor skills☆15Feb 20, 2026Updated 2 months ago
- A universal scalable machine learning model deployment solution☆252Apr 24, 2026Updated last week
- Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stac…☆255Apr 11, 2025Updated last year
- ☆24Nov 18, 2025Updated 5 months ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆409Apr 21, 2026Updated last week
- One stop shop for running AI/ML on AWS.☆1,152Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- CMP314 Optimizing NLP models with Amazon EC2 Inf1 instances in Amazon Sagemaker☆14Dec 20, 2023Updated 2 years ago
- Learn how to use Transformer-based models for named-entity recognition (NER) tasks and how to analyze various model features, constraints…☆16Jun 29, 2022Updated 3 years ago
- ☆23Aug 21, 2025Updated 8 months ago
- A training framework for large-scale language models based on Megatron-Core, the COOM Training Framework is designed to efficiently handl…☆26Nov 14, 2025Updated 5 months ago
- Openfold inference architecture for Amazon EKS☆11Oct 1, 2024Updated last year
- ☆43Jan 29, 2026Updated 3 months ago
- notebooks on langchain and llamaindex experiments☆11Nov 2, 2023Updated 2 years ago
- Large Language Model Hosting Container☆92Apr 13, 2026Updated 2 weeks ago
- Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…☆142Oct 7, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Foundation Model Evaluations Library☆284Aug 7, 2025Updated 8 months ago
- Documenting my AI journey☆20Feb 22, 2023Updated 3 years ago
- ☆39Oct 3, 2022Updated 3 years ago
- Retrieval-Augmented Generation battle!☆66Apr 18, 2026Updated last week
- A template for the easy creation of a custom Dialogflow Webhook to handle fulfillment.☆12Jan 7, 2023Updated 3 years ago
- ☆34Apr 21, 2026Updated last week
- Google TPU optimizations for transformers models☆138Jan 23, 2026Updated 3 months ago
- Home for OctoML PyTorch Profiler☆113Apr 24, 2023Updated 3 years ago
- KubeFlow on AWS☆188Apr 13, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Tokenflood is a load testing framework for simulating arbitary loads on instruction-tuned LLMs☆45Updated this week
- MLIR-based partitioning system☆180Updated this week
- ☆20Nov 23, 2022Updated 3 years ago
- ☆19Nov 8, 2024Updated last year
- ☆302Apr 23, 2026Updated last week
- Example of applying CUDA graphs to LLaMA-v2☆11Aug 25, 2023Updated 2 years ago
- ☆89Aug 23, 2023Updated 2 years ago