awslabs / nki-autotuneLinks
☆15Updated this week
Alternatives and similar repositories for nki-autotune
Users that are interested in nki-autotune are comparing it to the libraries listed below
Sorting:
- ☆47Updated this week
- ☆60Updated last week
- Project showing how to develop NKI kernels for Llama 3.2 1B inference☆19Updated 4 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆149Updated this week
- ☆39Updated 9 months ago
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆545Updated last week
- ☆21Updated last week
- ☆111Updated 8 months ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆351Updated last week
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆223Updated last week
- ☆12Updated 4 months ago
- ☆55Updated 2 weeks ago
- KernelBench: Can LLMs Write GPU Kernels? - Benchmark with Torch -> CUDA problems☆581Updated 2 weeks ago
- A schedule language for large model training☆151Updated last month
- ☆23Updated last year
- ☆13Updated last week
- ☆121Updated 9 months ago
- ☆177Updated last year
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆242Updated this week
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆44Updated 2 years ago
- ☆238Updated last year
- ☆242Updated this week
- ☆537Updated last year
- ☆118Updated 6 months ago
- 🚀 Collection of components for development, training, tuning, and inference of foundation models leveraging PyTorch native components.☆213Updated last week
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆328Updated this week
- A CLI tool that helps manage training jobs on the SageMaker HyperPod clusters orchestrated by Amazon EKS☆30Updated this week
- Applied AI experiments and examples for PyTorch☆296Updated last month
- Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core.☆98Updated 2 weeks ago
- Perplexity GPU Kernels☆476Updated 2 weeks ago