aws-neuron / nki-samples
☆28Updated last month
Alternatives and similar repositories for nki-samples:
Users that are interested in nki-samples are comparing it to the libraries listed below
- ☆52Updated last month
- ☆35Updated 2 months ago
- ☆11Updated this week
- ☆102Updated last month
- ☆23Updated 11 months ago
- A schedule language for large model training☆145Updated 8 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆137Updated last month
- extensible collectives library in triton☆83Updated 5 months ago
- ☆23Updated 3 months ago
- ☆14Updated 3 years ago
- ☆44Updated last year
- ☆23Updated 3 months ago
- ☆73Updated 4 months ago
- Distributed preprocessing and data loading for language datasets☆39Updated 11 months ago
- ☆100Updated 6 months ago
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆42Updated last year
- ☆52Updated 9 months ago
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆38Updated 2 years ago
- Home for OctoML PyTorch Profiler☆107Updated last year
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆23Updated 3 months ago
- An IR for efficiently simulating distributed ML computation.☆28Updated last year
- ☆43Updated last month
- Python package for rematerialization-aware gradient checkpointing☆24Updated last year
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems☆227Updated last week
- Memory Optimizations for Deep Learning (ICML 2023)☆62Updated last year
- A resilient distributed training framework☆89Updated 11 months ago
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆219Updated this week
- A parallel framework for training deep neural networks☆56Updated this week