gpgpu-sim / gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
☆1,140Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for gpgpu-sim_distribution
- This is the top-level repository for the Accel-Sim framework.☆305Updated last month
- ☆337Updated last year
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆307Updated this week
- A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, …☆585Updated last year
- ☆1,248Updated this week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆405Updated last year
- ☆580Updated 3 years ago
- ☆224Updated 2 months ago
- Rodinia benchmark☆169Updated last year
- BookSim 2.0☆278Updated 4 months ago
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆340Updated 2 weeks ago
- GPGPU processor supporting RISCV-V extension, developed with Chisel HDL☆635Updated this week
- Assembler for NVIDIA Maxwell architecture☆953Updated last year
- Berkeley's Spatial Array Generator☆818Updated this week
- DRAMSim2: A cycle accurate DRAM simulator☆256Updated 4 years ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆315Updated this week
- NVDLA SW☆489Updated 3 years ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆769Updated this week
- Assembler for NVIDIA Volta and Turing GPUs☆202Updated 2 years ago
- Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …☆246Updated 4 months ago
- BLISlab: A Sandbox for Optimizing GEMM☆483Updated 3 years ago
- CUDA Kernel Benchmarking Library☆519Updated this week
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆521Updated 2 weeks ago
- An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model☆408Updated 4 months ago
- ☆297Updated this week
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆326Updated 7 months ago
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,356Updated this week
- Awesome resources for GPUs☆495Updated last year
- ROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime☆224Updated this week
- ☆609Updated this week