☆58Feb 10, 2026Updated last month
Alternatives and similar repositories for nki-samples
Users that are interested in nki-samples are comparing it to the libraries listed below
Sorting:
- Project showing how to develop NKI kernels for Llama 3.2 1B inference☆21May 29, 2025Updated 9 months ago
- ☆63Updated this week
- ☆12Dec 20, 2025Updated 3 months ago
- Notebooks and sample code for Build On Trainium☆47Jan 14, 2026Updated 2 months ago
- ☆32Updated this week
- The ASPLOS 2025 / EuroSys 2025 Contest Track☆40Aug 7, 2025Updated 7 months ago
- Autocomp: AI-Driven Code Optimizer for Tensor Accelerators☆86Updated this week
- Example code for AWS Neuron SDK developers building inference and training applications☆158Mar 10, 2026Updated last week
- ☆13Dec 19, 2025Updated 3 months ago
- ☆47Feb 27, 2026Updated 2 weeks ago
- ☆24Oct 30, 2024Updated last year
- This repository features Amazon SageMaker Ground Truth and explains how to ingest raw 3D point cloud data, label it, train a 3D object de…☆13Jun 23, 2022Updated 3 years ago
- ☆43Jan 29, 2026Updated last month
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆583Updated this week
- MLSys competition for the best MOE NKI kernels☆39Updated this week
- ☆27Jan 22, 2026Updated last month
- ☆41Oct 9, 2025Updated 5 months ago
- ☆39Dec 19, 2024Updated last year
- ☆14Aug 29, 2023Updated 2 years ago
- ☆14May 29, 2024Updated last year
- ☆21Mar 12, 2026Updated last week
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆15Mar 19, 2023Updated 3 years ago
- AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia inst…☆21Feb 27, 2026Updated 3 weeks ago
- ☆10Mar 8, 2025Updated last year
- Streamlit MongoDB Connector: An efficient connector for interfacing MongoDB with Streamlit apps, developed for the Streamlit Connections …☆11Dec 19, 2023Updated 2 years ago
- Python package for rematerialization-aware gradient checkpointing☆27Oct 31, 2023Updated 2 years ago
- Dual-Core Out-of-Order MIPS CPU Design☆21May 8, 2025Updated 10 months ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- The repository guides you through generating a synthetic dataset for a QA-RAG application using the Bedrock API, Python and Langchain.☆20Sep 17, 2024Updated last year
- Hands-on workshop for distributed training and hosting on SageMaker☆153Nov 4, 2025Updated 4 months ago
- ☆15Apr 20, 2022Updated 3 years ago
- A retargetable and extensible synthesis-based compiler for modern hardware architectures☆17Nov 20, 2025Updated 3 months ago
- Program synthesis tools and utilities for LLVM.☆20Jul 6, 2023Updated 2 years ago
- A RISC-V Symmetric Multiprocessor(SMP) based on TileLink and can run Linux OS☆35Oct 23, 2025Updated 4 months ago
- TVM for Tenstorrent ASICs☆28Sep 8, 2025Updated 6 months ago
- This repository will soon contain all scripts and links to the annotated corpora of Tibetan.☆14Feb 4, 2025Updated last year
- Evaluating language models on word puzzle games☆10Oct 25, 2024Updated last year
- A new DRAM substrate that mitigates the excessive energy consumption from both (i) transmitting unused data on the memory channel and (i…☆14Aug 23, 2024Updated last year