☆68Jun 22, 2026Updated last week
Alternatives and similar repositories for nki-samples
Users that are interested in nki-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Updated this week
- Project showing how to develop NKI kernels for Llama 3.2 1B inference☆21May 29, 2025Updated last year
- ☆13Dec 20, 2025Updated 6 months ago
- Notebooks and sample code for Build On Trainium☆48Jan 14, 2026Updated 5 months ago
- Example code for AWS Neuron SDK developers building inference and training applications☆161May 20, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆37May 14, 2026Updated last month
- The ASPLOS 2025 / EuroSys 2025 Contest Track☆41Jun 4, 2026Updated 3 weeks ago
- ☆18May 9, 2024Updated 2 years ago
- ☆13Dec 19, 2025Updated 6 months ago
- Autocomp: Optimize any AI kernel, anywhere.☆139Updated this week
- This repository features Amazon SageMaker Ground Truth and explains how to ingest raw 3D point cloud data, label it, train a 3D object de…☆13Jun 23, 2022Updated 4 years ago
- ☆43Jan 29, 2026Updated 5 months ago
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆611Jun 18, 2026Updated last week
- ☆64Jun 2, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆26Jun 9, 2026Updated 3 weeks ago
- Accelerator Zoo☆20Oct 14, 2025Updated 8 months ago
- ☆19Apr 24, 2022Updated 4 years ago
- ☆39Dec 19, 2024Updated last year
- ☆14Aug 29, 2023Updated 2 years ago
- ☆22Apr 17, 2025Updated last year
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- AWS Neuron Deep Learning Containers (DLCs) are a set of Docker images for training and serving models on AWS Trainium and Inferentia inst…☆22Jun 19, 2026Updated last week
- ☆15Apr 20, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python package for rematerialization-aware gradient checkpointing☆27Oct 31, 2023Updated 2 years ago
- Using FlexAttention to compute attention with different masking patterns☆47Sep 22, 2024Updated last year
- Hands-on workshop for distributed training and hosting on SageMaker☆152Jun 16, 2026Updated 2 weeks ago
- SIMPLER MAGIC: Synthesis and In-memory MaPping of Logic Execution in a single Row for Memristor Aided loGIC☆13Dec 5, 2019Updated 6 years ago
- Fast and simple C++ DSP engine with high-quality effects. Originally built for PhantomAmp, an Android app for rootless system-wide audio…☆17Aug 21, 2023Updated 2 years ago
- A DL compiler fuzzer☆14Nov 1, 2024Updated last year
- In this repository, we will present techniques to detect covariate drift, and demonstrate how to incorporate your own custom drift detect…☆12May 26, 2021Updated 5 years ago
- TVM for Tenstorrent ASICs☆31Apr 29, 2026Updated 2 months ago
- This repository will soon contain all scripts and links to the annotated corpora of Tibetan.☆14Feb 4, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆62Feb 5, 2026Updated 4 months ago
- Mallacc: Accelerating Memory Allocation☆13Jan 2, 2018Updated 8 years ago
- A new DRAM substrate that mitigates the excessive energy consumption from both (i) transmitting unused data on the memory channel and (i…☆14Aug 23, 2024Updated last year
- Small sample programs that use LLVM and Clang APIs.☆52Jan 14, 2019Updated 7 years ago
- ☆22Dec 11, 2024Updated last year
- A retargetable and extensible synthesis-based compiler for modern hardware architectures☆18Nov 20, 2025Updated 7 months ago
- Samoyeds: Accelerating MoE Models with Structured Sparsity Leveraging Sparse Tensor Cores (EuroSys'25)☆15Jul 17, 2025Updated 11 months ago