aws-neuron / nki-llamaLinks
Project showing how to develop NKI kernels for Llama 3.2 1B inference
☆16Updated last month
Alternatives and similar repositories for nki-llama
Users that are interested in nki-llama are comparing it to the libraries listed below
Sorting:
- ☆38Updated this week
- ☆14Updated this week
- ☆59Updated 2 weeks ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆12Updated 9 months ago
- ☆48Updated last week
- Example code for AWS Neuron SDK developers building inference and training applications☆148Updated last month
- ☆14Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆18Updated 2 weeks ago
- ☆111Updated 6 months ago
- Collection of best practices, reference architectures, model training examples and utilities to train large models on AWS.☆323Updated this week
- DGXC Benchmarking provides recipes in ready-to-use templates for evaluating performance of specific AI use cases across hardware and soft…☆17Updated 2 weeks ago
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆177Updated this week
- A schedule language for large model training☆149Updated last year
- ☆10Updated 5 months ago
- ☆56Updated 9 months ago
- MLIR-based partitioning system☆103Updated this week
- Perplexity GPU Kernels☆395Updated last month
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆187Updated last week
- Synthesizer for optimal collective communication algorithms☆108Updated last year
- FM-Leaderboard-er allows you to create leaderboard to find the best LLM/prompt for your own business use case based on your data, task, p…☆18Updated 8 months ago
- EFA/NCCL base AMI build Packer and CodeBuild/Pipeline files. Also base Docker build files to enable EFA/NCCL in containers☆43Updated last year
- Microsoft Collective Communication Library☆64Updated 7 months ago
- ☆62Updated 5 months ago
- ☆12Updated 8 months ago
- AI and Memory Wall☆217Updated last year
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆147Updated last week
- ☆83Updated 8 months ago
- ☆48Updated last week
- ☆108Updated this week
- A resilient distributed training framework☆95Updated last year