gpgpu-sim / gpgpu-sim_distributionLinks
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
☆1,483Updated 8 months ago
Alternatives and similar repositories for gpgpu-sim_distribution
Users that are interested in gpgpu-sim_distribution are comparing it to the libraries listed below
Sorting:
- This is the top-level repository for the Accel-Sim framework.☆503Updated 2 weeks ago
- An unofficial cuda assembler, for all generations of SASS, hopefully :)☆553Updated 2 years ago
- A Fast and Extensible DRAM Simulator, with built-in support for modeling many different DRAM technologies including DDRx, LPDDRx, GDDRx, …☆667Updated 2 years ago
- ☆647Updated 4 years ago
- GPGPU processor supporting RISCV-V extension, developed with Chisel HDL☆822Updated 3 weeks ago
- ☆1,742Updated this week
- Assembler for NVIDIA Maxwell architecture☆1,049Updated 2 years ago
- Berkeley's Spatial Array Generator☆1,105Updated this week
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆522Updated this week
- Timeloop performs modeling, mapping and code-generation for tensor algebra workloads on various accelerator architectures.☆429Updated last month
- ☆288Updated last month
- Rodinia benchmark☆192Updated 2 years ago
- An integrated cache and memory access time, cycle time, area, leakage, and dynamic power model☆505Updated last year
- A highly-flexible GPU simulator for AMD GPUs.☆196Updated last week
- The official repository for the gem5 computer-system architecture simulator.☆2,290Updated this week
- Awesome resources for GPUs☆600Updated 2 years ago
- NVDLA SW☆508Updated 4 years ago
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,264Updated 2 months ago
- Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …☆431Updated 3 weeks ago
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆654Updated this week
- ChampSim is an open-source trace based simulator maintained at Texas A&M University and through the support of the computer architecture …☆646Updated last week
- collection of benchmarks to measure basic GPU capabilities☆451Updated 3 weeks ago
- HeteroCL: A Multi-Paradigm Programming Infrastructure for Software-Defined Heterogeneous Computing (FPGA'19 Best Paper)☆341Updated last year
- ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale☆459Updated last week
- BookSim 2.0☆380Updated last year
- DRAMsim3: a Cycle-accurate, Thermal-Capable DRAM Simulator☆418Updated last year
- CUDA Kernel Benchmarking Library☆762Updated 3 weeks ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆484Updated last week
- Hands-On Practical MLIR Tutorial☆649Updated 2 years ago
- Examples demonstrating available options to program multiple GPUs in a single node or a cluster☆826Updated last month