abdelfattah-lab/nitro

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/abdelfattah-lab/nitro)

abdelfattah-lab / nitro

Lightweight Python Wrapper for OpenVINO, enabling LLM inference on NPUs

☆29

Alternatives and similar repositories for nitro

Users that are interested in nitro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sgl-project / sgl-kernel-xpu
View on GitHub
SGLang kernel library for Intel XPU
☆27Updated this week
chengzeyi / piflux
View on GitHub
(WIP) Parallel inference for black-forest-labs' FLUX model.
☆19Nov 18, 2024Updated last year
xlite-dev / flux-faster
View on GitHub
A forked version of flux-fast that makes flux-fast even faster with cache-dit, 3.3x speedup on NVIDIA L20.
☆24Jul 18, 2025Updated last year
weishengying / cute_gemm
View on GitHub
☆23Aug 14, 2024Updated last year
Idein / onnigiri
View on GitHub
☆13Jul 10, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cpldcpu / llmbenchmark
View on GitHub
Various LLM Benchmarks
☆26Feb 20, 2026Updated 5 months ago
zaydzuhri / pythia-mlkv
View on GitHub
Multi-Layer Key-Value sharing experiments on Pythia models
☆34Jun 14, 2024Updated 2 years ago
aws-samples / awsame-distributed-ai-samples
View on GitHub
Cluster doctor skills
☆14May 23, 2026Updated 2 months ago
uw-mad-dash / decoding-speculative-decoding
View on GitHub
☆16Aug 19, 2024Updated last year
inkarkat / vim-SearchHighlighting
View on GitHub
Highlighting of searches via star, auto-highlighting.
☆16Mar 14, 2023Updated 3 years ago
ParCIS / Magicube
View on GitHub
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆92Nov 23, 2022Updated 3 years ago
ferdinandyb / bibtexcite.vim
View on GitHub
Bib(la)tex citations for vim
☆16Oct 16, 2024Updated last year
codeplaysoftware / lldb-msp430
View on GitHub
Codeplay's tutorial LLDB-MSP430 - as presented at the 2016 EuroLLVM Developers' Meeting in Barcelona.
☆10Mar 15, 2016Updated 10 years ago
seifip / google-maps-rpa
View on GitHub
A script to reorganize 'Want to go' Saved places in Google Maps into separate lists by category.
☆11May 14, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Zolewit / TNNdemo
View on GitHub
很好用的tnn classify demo
☆11Mar 24, 2021Updated 5 years ago
canalcache / canal
View on GitHub
☆16May 20, 2019Updated 7 years ago
romain-jacob / triscale
View on GitHub
TriScale software
☆14Apr 23, 2024Updated 2 years ago
sgl-project / sgl-kernel-npu
View on GitHub
SGLang kernel library for NPU
☆170Updated this week
opendatahub-io / caikit-tgis-serving
View on GitHub
☆21Updated this week
specs-feup / lara-framework
View on GitHub
Tools and APIs to develop weavers for the LARA language (LARA Compiler, LARA Interpreter, Weaver Generator, etc...)
☆16Jul 15, 2026Updated last week
SJTU-IPADS / disb
View on GitHub
DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.
☆58Aug 21, 2024Updated last year
efeslab / Atom
View on GitHub
[MLSys'24] Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
☆344Jul 2, 2024Updated 2 years ago
seanzw / gem5-avx
View on GitHub
This adds partial support of AVX2 and AVX-512 to gem5.
☆15Dec 19, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
junmokane / spatially-aware-transformer
View on GitHub
☆10Dec 10, 2024Updated last year
c0x0o / bipbuffer
View on GitHub
a C implementation of Simon Cooke's bipbuffer
☆18Aug 11, 2017Updated 8 years ago
ysc3839 / SingleExeXamlIsland
View on GitHub
☆15May 13, 2024Updated 2 years ago
microsoft / TileFusion
View on GitHub
TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.
☆115Jun 28, 2025Updated last year
hpc-ulisboa / UVE
View on GitHub
Unlimited Vector Extension with Data Streaming Support
☆12Nov 25, 2024Updated last year
tud-ccc / RTSim
View on GitHub
☆15Jan 28, 2025Updated last year
flashinfer-ai / cutlass-viz
View on GitHub
☆65Apr 26, 2025Updated last year
siyan-zhao / prepacking
View on GitHub
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆62Oct 11, 2024Updated last year
liuzuyan / ElasticCache
View on GitHub
[ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache
☆43Jul 26, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
slowriot / anubis
View on GitHub
Anonymous, decentralised, git-based webpages for Akash and Radicle.
☆21Oct 19, 2021Updated 4 years ago
RISCV-on-Microsemi-FPGA / CPUs
View on GitHub
CPUs
☆17Dec 21, 2020Updated 5 years ago
dropbox / gemlite
View on GitHub
Fast low-bit matmul kernels in Triton
☆477Jul 15, 2026Updated last week
ChuangtaoChen-TUM / LiveMind
View on GitHub
☆15Apr 15, 2026Updated 3 months ago
Tencent-YouTu / Cplusplus_sdk
View on GitHub
腾讯优图开放平台 c++ sdk
☆15Apr 10, 2019Updated 7 years ago
nomonosound / numpy-minmax
View on GitHub
A fast function (SIMD-accelerated) for finding the minimum and maximum value in a NumPy array
☆15Updated this week