Source for Demystifying GPU Microarchitecture through Microbenchmarking
☆18May 29, 2023Updated 2 years ago
Alternatives and similar repositories for cudabmk
Users that are interested in cudabmk are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆25Jun 24, 2022Updated 3 years ago
- Benchmarks for locking algorithms as well as implementations of locking algorithms.☆25Mar 6, 2018Updated 8 years ago
- DeepPerf is a set of cuda assembling developing tools☆11Dec 19, 2018Updated 7 years ago
- ☆10Mar 24, 2022Updated 4 years ago
- Slides for eMMC Hacking 2017☆16Apr 27, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆45Oct 25, 2021Updated 4 years ago
- How to use node-local MPI rank IDs to manually map MPI ranks to GPUs☆14Apr 22, 2020Updated 6 years ago
- Function hook, code injection, monitoring☆14Jul 12, 2018Updated 7 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆124Oct 26, 2022Updated 3 years ago
- ☆24Mar 22, 2018Updated 8 years ago
- [ICLR 2021: Spotlight] Source code for the paper "A Panda? No, It's a Sloth: Slowdown Attacks on Adaptive Multi-Exit Neural Network Infer…☆14Feb 16, 2022Updated 4 years ago
- An EDM-enabled PHY + a rack-level network simulator☆14Dec 11, 2024Updated last year
- A Swift library to write parsers for domain specific languages.☆15Nov 3, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Page Cache Side Channel Attacks (CVE-2019-5489) proof of concept for Linux☆10Oct 2, 2021Updated 4 years ago
- OpenVLA for AIRBOT☆15Aug 15, 2024Updated last year
- Python bindings for the PMDK. Non-volatile memory for Python.☆13Mar 22, 2023Updated 3 years ago
- ☆12Oct 25, 2022Updated 3 years ago
- Eliminating Keystroke Timing Attacks☆22Dec 12, 2017Updated 8 years ago
- Linear-Time Self Attention with Codeword Histogram for Efficient Recommendation☆11Mar 23, 2021Updated 5 years ago
- FastSpec: Scalable Generation and Detection of Spectre Gadgets Using Neural Embeddings☆13Apr 12, 2023Updated 3 years ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆14Apr 27, 2021Updated 5 years ago
- ☆15Mar 26, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Proof-of-concept implementation of the Obelix software hardening framework, based on LLVM.☆12May 22, 2024Updated 2 years ago
- ECM Factorization on CUDA-GPUs☆15Sep 29, 2020Updated 5 years ago
- 3d Telepresence(SBS) Platform using Google Cardboard and Raspberry Pi 2☆12Nov 9, 2017Updated 8 years ago
- A SoC for DOOM☆20Apr 11, 2021Updated 5 years ago
- Generate Serialization Functions for C++ classes and structs using python and libclang☆12Feb 24, 2018Updated 8 years ago
- ☆10May 12, 2022Updated 4 years ago
- Implemented a two-level (L1 and L2) cache simulator in C++ with round robin eviction policy☆10Jan 4, 2017Updated 9 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated 2 years ago
- A configurable general purpose graphics processing unit for☆12May 18, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆12Jul 13, 2018Updated 7 years ago
- maxas Scott Grey's maxas assembler sgemm explaining the (for me) missing parts https://github.com/NervanaSystems/maxas☆17Dec 22, 2018Updated 7 years ago
- IDApro idc and idapython script collection☆28Aug 22, 2023Updated 2 years ago
- A field theory inspired xAct package for Mathematica☆17Jul 28, 2016Updated 9 years ago
- A repository of tools for verifying constant-timeness☆19Feb 4, 2026Updated 3 months ago
- Yet another Game Boy emulator☆22Sep 4, 2022Updated 3 years ago
- http response/request parser for rust☆15Aug 10, 2015Updated 10 years ago