aws-neuron / neuronx-nemo-megatron
☆31Updated this week
Related projects: ⓘ
- ☆39Updated this week
- ☆21Updated 5 months ago
- ☆94Updated this week
- ☆13Updated 3 years ago
- Example code for AWS Neuron SDK developers building inference and training applications☆120Updated 2 weeks ago
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆193Updated this week
- Distributed preprocessing and data loading for language datasets☆40Updated 5 months ago
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆40Updated last year
- ☆62Updated 2 months ago
- A high performance data access library for machine learning tasks☆74Updated 9 months ago
- SageMaker Studio Docker CLI Extension☆13Updated 5 months ago
- Various transformers for FSDP research☆31Updated last year
- Torch Distributed Experimental☆115Updated last month
- A performant, memory-efficient checkpointing library for PyTorch applications, designed with large, complex distributed workloads in mind…☆145Updated this week
- This repository contains example code to build models on TPUs☆30Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆76Updated 2 years ago
- Implementation of a Transformer, but completely in Triton☆242Updated 2 years ago
- ☆18Updated last year
- ☆40Updated this week
- ☆86Updated 2 years ago
- experiments with inference on llama☆106Updated 3 months ago
- Experiment of using Tangent to autodiff triton☆66Updated 7 months ago
- ☆234Updated last month
- This repository compiles prescriptive guidance and code samples for the operationalization of NVIDIA Merlin framework on Google Cloud Ver…☆34Updated 2 years ago
- ☆172Updated this week
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆152Updated this week
- ☆66Updated 3 months ago
- Sagemaker Studio Docker UI Extension☆11Updated 5 months ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆177Updated this week
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆60Updated this week