Source for Demystifying GPU Microarchitecture through Microbenchmarking
☆18May 29, 2023Updated 2 years ago
Alternatives and similar repositories for cudabmk
Users that are interested in cudabmk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24May 13, 2015Updated 10 years ago
- Benchmarks for locking algorithms as well as implementations of locking algorithms.☆25Mar 6, 2018Updated 8 years ago
- DeepPerf is a set of cuda assembling developing tools☆10Dec 19, 2018Updated 7 years ago
- ☆10Mar 24, 2022Updated 4 years ago
- Sparse matrix computation library for GPU☆59Jul 12, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆45Oct 25, 2021Updated 4 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆124Oct 26, 2022Updated 3 years ago
- ☆24Mar 22, 2018Updated 8 years ago
- [ICLR 2021: Spotlight] Source code for the paper "A Panda? No, It's a Sloth: Slowdown Attacks on Adaptive Multi-Exit Neural Network Infer…☆14Feb 16, 2022Updated 4 years ago
- An implementation of the Latent Skill Embedding model☆10Feb 19, 2016Updated 10 years ago
- Page Cache Side Channel Attacks (CVE-2019-5489) proof of concept for Linux☆10Oct 2, 2021Updated 4 years ago
- crc patcher for kernel modules☆12Apr 15, 2016Updated 10 years ago
- ☆12Oct 25, 2022Updated 3 years ago
- Source code of the U-TRR methodology presented in "Uncovering In-DRAM RowHammer Protection Mechanisms: A New Methodology, Custom RowHamme…☆18Nov 15, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Apr 27, 2021Updated 5 years ago
- ☆15Mar 26, 2025Updated last year
- Proof-of-concept implementation of the Obelix software hardening framework, based on LLVM.☆12May 22, 2024Updated last year
- ☆14Feb 5, 2025Updated last year
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆12Jun 20, 2025Updated 10 months ago
- ECM Factorization on CUDA-GPUs☆15Sep 29, 2020Updated 5 years ago
- 3d Telepresence(SBS) Platform using Google Cardboard and Raspberry Pi 2☆12Nov 9, 2017Updated 8 years ago
- Generate Serialization Functions for C++ classes and structs using python and libclang☆12Feb 24, 2018Updated 8 years ago
- ☆10May 12, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A configurable general purpose graphics processing unit for☆12May 18, 2019Updated 6 years ago
- ☆12Jul 13, 2018Updated 7 years ago
- maxas Scott Grey's maxas assembler sgemm explaining the (for me) missing parts https://github.com/NervanaSystems/maxas☆17Dec 22, 2018Updated 7 years ago
- A repository of tools for verifying constant-timeness☆19Feb 4, 2026Updated 3 months ago
- http response/request parser for rust☆14Aug 10, 2015Updated 10 years ago
- Signal Flow Graph Solver in javascript☆16Aug 5, 2012Updated 13 years ago
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆12Aug 3, 2023Updated 2 years ago
- ☆18May 8, 2021Updated 4 years ago
- Simple virtual machine for a stack-based assembler language.☆21Oct 15, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- distributed transaction processor☆16Apr 29, 2026Updated last week
- libuv bindings for D powering heaploop.io☆24Sep 27, 2016Updated 9 years ago
- dlang bindings for nanomsg☆14Nov 7, 2023Updated 2 years ago
- Specifications and safety proofs in different tools of a simple concurrent algorithm☆24May 24, 2020Updated 5 years ago
- a collection of .config files which work for certain machines☆10Apr 25, 2026Updated last week
- The implementatin of our ICLR 2021 work: Targeted Attack against Deep Neural Networks via Flipping Limited Weight Bits☆19Jul 20, 2021Updated 4 years ago
- Code for "On the Trade-off between Adversarial and Backdoor Robustness" (NIPS 2020)☆17Nov 11, 2020Updated 5 years ago