gpgpu-sim / gpgpu-sim_distributionLinks
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
☆1,447Updated 7 months ago
Alternatives and similar repositories for gpgpu-sim_distribution
Users that are interested in gpgpu-sim_distribution are comparing it to the libraries listed below
Sorting:
- This is the top-level repository for the Accel-Sim framework.☆477Updated last week
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆539Updated 2 years ago
- A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, …☆657Updated 2 years ago
- ☆643Updated 4 years ago
- Assembler for NVIDIA Maxwell architecture☆1,033Updated 2 years ago
- Berkeley's Spatial Array Generator☆1,061Updated last month
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆420Updated last week
- ☆278Updated last week
- NVDLA SW☆506Updated 4 years ago
- ☆1,677Updated last week
- GPGPU processor supporting RISCV-V extension, developed with Chisel HDL☆808Updated last week
- A highly-flexible GPU simulator for AMD GPUs.☆189Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆485Updated last week
- Rodinia benchmark☆188Updated 2 years ago
- An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model☆492Updated last year
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆423Updated 8 months ago
- Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …☆400Updated 2 months ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆806Updated last week
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆634Updated this week
- collection of benchmarks to measure basic GPU capabilities☆422Updated 7 months ago
- ☆365Updated 2 years ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆921Updated this week
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing☆341Updated last year
- The official repository for the gem5 computer-system architecture simulator.☆2,215Updated this week
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,228Updated last month
- Hands-On Practical MLIR Tutorial☆603Updated last year
- BookSim 2.0☆368Updated last year
- A list of tutorials, paper, talks, and open-source projects for emerging compiler and architecture☆501Updated 8 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆471Updated this week
- CUDA Kernel Benchmarking Library☆728Updated 2 weeks ago