Source for Demystifying GPU Microarchitecture through Microbenchmarking
☆18May 29, 2023Updated 2 years ago
Alternatives and similar repositories for cudabmk
Users that are interested in cudabmk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Jun 24, 2022Updated 3 years ago
- A quick way to benchmark your CUDA compiler on a Linux environment☆27Mar 16, 2011Updated 15 years ago
- ☆24May 13, 2015Updated 10 years ago
- Benchmarks for locking algorithms as well as implementations of locking algorithms.☆27Mar 6, 2018Updated 8 years ago
- DeepPerf is a set of cuda assembling developing tools☆10Dec 19, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Slides for eMMC Hacking 2017☆16Apr 27, 2018Updated 7 years ago
- Sparse matrix computation library for GPU☆59Jul 12, 2020Updated 5 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆45Oct 25, 2021Updated 4 years ago
- Function hook, code injection, monitoring☆14Jul 12, 2018Updated 7 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆122Oct 26, 2022Updated 3 years ago
- ☆24Mar 22, 2018Updated 8 years ago
- An implementation of the Latent Skill Embedding model☆10Feb 19, 2016Updated 10 years ago
- An EDM-enabled PHY + a rack-level network simulator☆14Dec 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A Swift library to write parsers for domain specific languages.☆15Nov 3, 2020Updated 5 years ago
- Page Cache Side Channel Attacks (CVE-2019-5489) proof of concept for Linux☆10Oct 2, 2021Updated 4 years ago
- crc patcher for kernel modules☆12Apr 15, 2016Updated 9 years ago
- ☆12Oct 25, 2022Updated 3 years ago
- Source code of the U-TRR methodology presented in "Uncovering In-DRAM RowHammer Protection Mechanisms: A New Methodology, Custom RowHamme…☆17Nov 15, 2022Updated 3 years ago
- Eliminating Keystroke Timing Attacks☆22Dec 12, 2017Updated 8 years ago
- Some examples of the Cpp target.☆10Nov 7, 2023Updated 2 years ago
- FastSpec: Scalable Generation and Detection of Spectre Gadgets Using Neural Embeddings☆13Apr 12, 2023Updated 2 years ago
- Tools for Cellular Exploitation on a Global Scale - Blackhat USA 2014☆24Feb 12, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- raccoon engine for kha [formerly lkl]☆11Sep 2, 2019Updated 6 years ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Apr 27, 2021Updated 4 years ago
- ☆15Mar 26, 2025Updated last year
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆11Jun 20, 2025Updated 9 months ago
- Datasets from CHES papers on random delays☆13Apr 13, 2021Updated 4 years ago
- EDL mode downloaded for Samsung A505FN based on smdk-tools-v0.20☆20Jun 16, 2020Updated 5 years ago
- A GPU performance prediction toolkit for CUDA programs☆19Mar 25, 2019Updated 7 years ago
- A SoC for DOOM☆20Apr 11, 2021Updated 4 years ago
- ECM Factorization on CUDA-GPUs☆14Sep 29, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- C++ type_name template utilities for pretty-printing type names☆13Feb 1, 2019Updated 7 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- A configurable general purpose graphics processing unit for☆12May 18, 2019Updated 6 years ago
- Implemented a two-level (L1 and L2) cache simulator in C++ with round robin eviction policy☆10Jan 4, 2017Updated 9 years ago
- maxas Scott Grey's maxas assembler sgemm explaining the (for me) missing parts https://github.com/NervanaSystems/maxas☆17Dec 22, 2018Updated 7 years ago
- IDApro idc and idapython script collection☆28Aug 22, 2023Updated 2 years ago
- A repository of tools for verifying constant-timeness☆19Feb 4, 2026Updated last month