aws-neuron / nki-samples
☆34Updated last month
Alternatives and similar repositories for nki-samples:
Users that are interested in nki-samples are comparing it to the libraries listed below
- ☆13Updated last month
- ☆55Updated last month
- ☆14Updated this week
- ☆35Updated 4 months ago
- Python package for rematerialization-aware gradient checkpointing☆24Updated last year
- extensible collectives library in triton☆86Updated last month
- ☆24Updated last year
- A schedule language for large model training☆146Updated 10 months ago
- ☆79Updated 6 months ago
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆14Updated 4 years ago
- ☆23Updated 5 months ago
- ☆27Updated 3 months ago
- (NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.☆38Updated 2 years ago
- Example code for AWS Neuron SDK developers building inference and training applications☆143Updated last week
- ☆43Updated last year
- ☆72Updated 4 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆15Updated last week
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆11Updated 7 months ago
- A resilient distributed training framework☆95Updated last year
- ☆107Updated 3 months ago
- ☆27Updated 5 months ago
- Stateful LLM Serving☆65Updated last month
- PyTorch bindings for CUTLASS grouped GEMM.☆88Updated last week
- Effective transpose on Hopper GPU☆17Updated last week
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Updated last year
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆23Updated 4 months ago
- Home for OctoML PyTorch Profiler☆113Updated 2 years ago
- FTPipe and related pipeline model parallelism research.☆41Updated last year
- LLM-Inference-Bench☆40Updated 4 months ago
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆194Updated this week