gpgpu-sim / gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
☆1,312Updated 2 months ago
Alternatives and similar repositories for gpgpu-sim_distribution:
Users that are interested in gpgpu-sim_distribution are comparing it to the libraries listed below
- This is the top-level repository for the Accel-Sim framework.☆397Updated last week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆493Updated 2 years ago
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆387Updated this week
- Assembler for NVIDIA Maxwell architecture☆996Updated 2 years ago
- A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, …☆631Updated last year
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆380Updated last month
- ☆620Updated 4 years ago
- Rodinia benchmark☆178Updated 2 years ago
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,080Updated last month
- Berkeley's Spatial Array Generator☆937Updated 3 weeks ago
- ☆244Updated 2 months ago
- CUDA Kernel Benchmarking Library☆629Updated this week
- collection of benchmarks to measure basic GPU capabilities☆369Updated 2 months ago
- Awesome resources for GPUs☆567Updated last year
- An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model☆453Updated 10 months ago
- ☆350Updated last year
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆847Updated this week
- A highly-flexible GPU simulator for AMD GPUs.☆138Updated this week
- NVDLA SW☆497Updated 4 years ago
- GPGPU processor supporting RISCV-V extension, developed with Chisel HDL☆729Updated 2 weeks ago
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆585Updated 3 weeks ago
- ☆326Updated last week
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆397Updated 3 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆697Updated 2 months ago
- BLISlab: A Sandbox for Optimizing GEMM☆515Updated 3 years ago
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,517Updated this week
- A open source reimplementation of Google's Tensor Processing Unit (TPU).☆429Updated 7 years ago
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆338Updated last year
- HIPIFY: Convert CUDA to Portable C++ Code☆574Updated this week
- Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …☆325Updated last month