hyqneuron/asfermi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hyqneuron/asfermi)

hyqneuron / asfermi

assembler for NVIDIA FERMI. Imported from Google Code

☆77

Alternatives and similar repositories for asfermi

Users that are interested in asfermi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xiuxiazhang / KeplerAs
View on GitHub
An Open Source Kepler GPU Assembler
☆22Jan 23, 2017Updated 9 years ago
PAA-NCIC / PPoPP2017_artifact
View on GitHub
Third party assembler and GEMM library for NVIDIA Kepler GPU
☆86Oct 8, 2019Updated 6 years ago
daadaada / turingas
View on GitHub
Assembler for NVIDIA Volta and Turing GPUs
☆246Jan 13, 2022Updated 4 years ago
cloudcores / CuAssembler
View on GitHub
An unofficial cuda assembler, for all generations of SASS, hopefully ：）
☆609Apr 20, 2023Updated 3 years ago
gpgpu-sim / cutlass-gpgpu-sim
View on GitHub
☆28Oct 26, 2019Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Stefan20162016 / maxas-explained
View on GitHub
maxas Scott Grey's maxas assembler sgemm explaining the (for me) missing parts https://github.com/NervanaSystems/maxas
☆17Dec 22, 2018Updated 7 years ago
daadaada / gas
View on GitHub
☆49Dec 11, 2020Updated 5 years ago
decodecudabinary / Decoding-CUDA-Binary
View on GitHub
☆55Nov 21, 2019Updated 6 years ago
CMU-SAFARI / Mosaic
View on GitHub
Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…
☆49Aug 21, 2018Updated 7 years ago
NVlabs / SASSI
View on GitHub
Flexible GPGPU instrumentation
☆91Oct 10, 2019Updated 6 years ago
NVlabs / ptxmemorymodel
View on GitHub
☆77May 29, 2019Updated 7 years ago
codyjrivera / tsm2x-imp
View on GitHub
Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA
☆35Jul 28, 2020Updated 5 years ago
adwaitjog / mafia
View on GitHub
MAFIA: Multiple Application Framework for GPU architectures
☆28Jan 21, 2022Updated 4 years ago
ekondis / gpumembench
View on GitHub
A GPU benchmark suite for assessing on-chip GPU memory bandwidth
☆113Aug 12, 2017Updated 8 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
connorjan / llvm-cjg
View on GitHub
An LLVM backend for my custom 32-bit RISC CPU https://scholarworks.rit.edu/theses/9550/
☆14Aug 16, 2017Updated 8 years ago
gpgpu-sim / gpgpu-sim_distribution
View on GitHub
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for…
☆1,675Feb 15, 2025Updated last year
knotman90 / cuStreamComp
View on GitHub
Efficient CUDA Stream Compaction Library
☆34Jun 9, 2023Updated 3 years ago
MooreThreads / MT-flashMLA
View on GitHub
Fork from https://github.com/deepseek-ai/FlashMLA
☆17Feb 26, 2025Updated last year
ROCm / ROCdbgapi
View on GitHub
The AMD Debugger API is a library that provides all the support necessary for a debugger and other tools to perform low level control of …
☆19Updated this week
cudpp / cudpp
View on GitHub
CUDA Data Parallel Primitives Library
☆438Nov 9, 2018Updated 7 years ago
NVlabs / NVBit
View on GitHub
☆341Apr 6, 2026Updated 3 months ago
csl-iisc / iGUARD-SOSP21
View on GitHub
Race detector for NVIDIA GPUs, published in SOSP 2021.
☆19Feb 22, 2025Updated last year
sderek / CUDAAdvisor
View on GitHub
CUDAAdvisor: a GPU profiling tool
☆53Aug 24, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CNugteren / CLCudaAPI
View on GitHub
A portable high-level API with CUDA or OpenCL back-end
☆56Oct 8, 2017Updated 8 years ago
CMU-SAFARI / Cache-Memory-Hog
View on GitHub
Cache and main memory hog programs. These are programs with specific access patterns to evict the already existing cache blocks of variou…
☆19Nov 2, 2016Updated 9 years ago
balidani / gcnasm
View on GitHub
GCN ISA assembler tool for my GSoC project at Openwall
☆35Jan 4, 2016Updated 10 years ago
hughperkins / neonCl-underconstruction
View on GitHub
experimental port of nervana neon kernels in OpenCL
☆11Jul 24, 2016Updated 9 years ago
llnl / pLiner
View on GitHub
pLiner is a framework that helps programmers identify locations in the source of numerical code that are highly affected by compiler opti…
☆17Oct 27, 2023Updated 2 years ago
lwakefield / libgpucrypto
View on GitHub
☆26May 13, 2015Updated 11 years ago
GeorgOfenbeck / perfplot
View on GitHub
tools to create performance and roofline plots from measured data
☆61Jun 10, 2014Updated 12 years ago
PAA-NCIC / DeepPerf
View on GitHub
DeepPerf is a set of cuda assembling developing tools
☆11Dec 19, 2018Updated 7 years ago
ekondis / gpuroofperf-toolkit
View on GitHub
A GPU performance prediction toolkit for CUDA programs
☆18Mar 25, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lanl / PPT
View on GitHub
Performance Prediction Toolkit
☆58Sep 13, 2025Updated 10 months ago
XiuYuLi / flexible-gemm
View on GitHub
flexible-gemm conv of deepcore
☆17Dec 2, 2019Updated 6 years ago
tallendev / uvm-eval
View on GitHub
This serves as a repository for reproducibility of the SC21 paper "In-Depth Analyses of Unified Virtual Memory System for GPU Accelerated…
☆37Sep 25, 2023Updated 2 years ago
csl-iisc / SUV-MICRO24
View on GitHub
☆13Oct 6, 2024Updated last year
GraphStreamingProject / GraphZeppelin
View on GitHub
Open-source library for Graph Streaming. Solves the connected components problem using sub-linear space. Published in SIGMOD'22.
☆11Apr 6, 2026Updated 3 months ago
RRZE-HPC / gpu-benches
View on GitHub
collection of benchmarks to measure basic GPU capabilities
☆530Oct 24, 2025Updated 8 months ago
mcrl / DeepLearningTrainingScripts
View on GitHub
☆17Dec 28, 2021Updated 4 years ago