aws-neuron / nki-reasoningLinks
Project showing how to develop NKI kernels for Llama 3.2 1B inference
☆14Updated 3 weeks ago
Alternatives and similar repositories for nki-reasoning
Users that are interested in nki-reasoning are comparing it to the libraries listed below
Sorting:
- ☆37Updated 3 weeks ago
- ☆14Updated last week
- ☆58Updated last month
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆12Updated 9 months ago
- NVIDIA Resiliency Extension is a python package for framework developers and users to implement fault-tolerant features. It improves the …☆179Updated 2 weeks ago
- ☆47Updated last month
- Microsoft Collective Communication Library☆64Updated 7 months ago
- Ultra and Unified CCL☆165Updated this week
- Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)☆11Updated 3 months ago
- extensible collectives library in triton☆86Updated 2 months ago
- A schedule language for large model training☆149Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆18Updated this week
- ☆11Updated last month
- ☆62Updated last year
- ☆90Updated 5 months ago
- ☆81Updated 7 months ago
- LLM serving cluster simulator☆106Updated last year
- Example code for AWS Neuron SDK developers building inference and training applications☆149Updated 2 weeks ago
- MLIR-based partitioning system☆97Updated this week
- The ASPLOS 2025 / EuroSys 2025 Contest Track☆37Updated last month
- ☆159Updated this week
- FlexFlow Serve: Low-Latency, High-Performance LLM Serving☆42Updated last month
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆176Updated this week
- Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.☆67Updated 3 months ago
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆177Updated 8 months ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆144Updated last week
- ☆62Updated 4 months ago
- ☆47Updated 2 years ago
- A resilient distributed training framework☆95Updated last year
- ☆27Updated 6 months ago