kuterd/nv_isa_solver

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kuterd/nv_isa_solver)

kuterd / nv_isa_solver

Nvidia Instruction Set Specification Generator

☆321

Alternatives and similar repositories for nv_isa_solver

Users that are interested in nv_isa_solver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cloudcores / CuAssembler
View on GitHub
An unofficial cuda assembler, for all generations of SASS, hopefully ：）
☆582Apr 20, 2023Updated 3 years ago
0xD0GF00D / DocumentSASS
View on GitHub
Unofficial description of the CUDA assembly (SASS) instruction sets.
☆211Jul 18, 2025Updated 9 months ago
daadaada / turingas
View on GitHub
Assembler for NVIDIA Volta and Turing GPUs
☆241Jan 13, 2022Updated 4 years ago
ademeure / QuickRunCUDA
View on GitHub
☆20Apr 24, 2026Updated last week
dramforever / ixayoi
View on GitHub
(WIP) A relatively simple pipelined RISC-V core, written in Bluespec SystemVerilog
☆12Sep 9, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
pku-liang / Cement
View on GitHub
The Next-gen Language & Compiler Powering Efficient Hardware Design
☆37Jan 16, 2025Updated last year
RRZE-HPC / gpu-benches
View on GitHub
collection of benchmarks to measure basic GPU capabilities
☆517Oct 24, 2025Updated 6 months ago
NVlabs / ptxmemorymodel
View on GitHub
☆73May 29, 2019Updated 6 years ago
vortexgpgpu / NVPTX-SPIRV-Translator
View on GitHub
The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.
☆45Oct 25, 2021Updated 4 years ago
Qazalin / remu
View on GitHub
RDNA3 emulator
☆61Apr 16, 2026Updated 2 weeks ago
ConvolutedDog / gpgpu-sim-comments
View on GitHub
GPGPU-Sim 中文注释版代码，包含 GPGPU-Sim 模拟器的最新版代码，经过中文注释，以帮助中文用户更好地理解和使用该模拟器。
☆27Dec 18, 2024Updated last year
dglai / FeatGraph
View on GitHub
Sparse kernels for GNNs based on TVM
☆17Nov 18, 2020Updated 5 years ago
geohot / tt-twitch
View on GitHub
tenstorrent kernel from twitch
☆28Mar 16, 2024Updated 2 years ago
NVIDIA / nvbench
View on GitHub
CUDA Kernel Benchmarking Library
☆858Apr 22, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ColfaxResearch / cfx-article-src
View on GitHub
☆186May 7, 2025Updated 11 months ago
HazyResearch / ThunderKittens
View on GitHub
Tile primitives for speedy kernels
☆3,336Updated this week
gpuocelot / gpuocelot
View on GitHub
GPUOcelot: A dynamic compilation framework for PTX
☆223Feb 9, 2025Updated last year
ademeure / cuda-side-boost
View on GitHub
☆57Feb 24, 2026Updated 2 months ago
hkust-adsl / gass
View on GitHub
☆41Apr 3, 2022Updated 4 years ago
IST-DASLab / FP-Quant
View on GitHub
☆107Feb 26, 2026Updated 2 months ago
IBM / triton-dejavu
View on GitHub
Framework to reduce autotune overhead to zero for well known deployments.
☆99Sep 19, 2025Updated 7 months ago
flashinfer-ai / cutlass-viz
View on GitHub
☆66Apr 26, 2025Updated last year
sdiehl / mlir-egglog
View on GitHub
A toy compiler for NumPy array expressions that uses e-graphs and MLIR
☆120Apr 27, 2026Updated last week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dropbox / gemlite
View on GitHub
Fast low-bit matmul kernels in Triton
☆446Apr 27, 2026Updated last week
dougallj / applegpu
View on GitHub
Apple G13 GPU architecture docs and tools
☆657May 16, 2025Updated 11 months ago
kristerw / smtgcc
View on GitHub
Some experiments with SMT solvers and GIMPLE IR
☆79Apr 24, 2026Updated last week
philipturner / metal-benchmarks
View on GitHub
Apple GPU microarchitecture
☆601Sep 22, 2024Updated last year
microsoft / BitBLAS
View on GitHub
BitBLAS is a library to support mixed-precision matrix multiplications, especially for quantized LLM deployment.
☆762Aug 6, 2025Updated 8 months ago
NVIDIA / cccl
View on GitHub
CUDA Core Compute Libraries
☆2,297Apr 28, 2026Updated last week
mikex86 / LibreCuda
View on GitHub
☆1,087May 18, 2025Updated 11 months ago
HPMLL / NVIDIA-Hopper-Benchmark
View on GitHub
☆99May 31, 2025Updated 11 months ago
JuliaLLVM / llvm-downgrade
View on GitHub
Fork of LLVM with support for downgrading bitcode.
☆21May 31, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SoftSec-KAIST / SURI
View on GitHub
Towards Sound Reassembly of Modern x86-64 Binaries (ASPLOS'25)
☆21Apr 1, 2025Updated last year
brightlaboratory / polydl
View on GitHub
☆11Jun 29, 2021Updated 4 years ago
GVProf / GVProf
View on GitHub
GVProf: A Value Profiler for GPU-based Clusters
☆54Mar 24, 2024Updated 2 years ago
eth-cscs / Tiled-MM
View on GitHub
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆32Apr 2, 2025Updated last year
ademeure / DeeperGEMM
View on GitHub
DeeperGEMM: crazy optimized version
☆86May 5, 2025Updated 11 months ago
upc-arco / modern-gpu-simulator-micro-2025
View on GitHub
Simulator code of the paper "Dissecting and Modeling the Architecture of Modern GPU Cores"
☆86Oct 15, 2025Updated 6 months ago
ColfaxResearch / layout-categories
View on GitHub
This repository contains companion software for the Colfax Research paper "Categorical Foundations for CuTe Layouts".
☆130Sep 24, 2025Updated 7 months ago