romnn / microgpusim
Cycle-level, trace-driven, parallel GPU simulator for NVIDIA Pascal.
☆12Updated 11 months ago
Alternatives and similar repositories for microgpusim:
Users that are interested in microgpusim are comparing it to the libraries listed below
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆16Updated 2 years ago
- ☆28Updated 2 years ago
- Verilog AST☆21Updated last year
- Productive and portable performance programming across spatial architectures (FPGAs, etc.) and vector architectures (GPUs, etc.)☆31Updated 10 months ago
- Embedded Universal DSL: a good DSL for us, by us☆32Updated this week
- A binary instrumentation tool to analyze load instructions in any off-the-shelf x86(-64) program. Described by Bera et al. in https://arx…☆20Updated 8 months ago
- Meta-Repository for Bespoke Silicon Group's Manycore Architecture (A.K.A HammerBlade)☆39Updated this week
- ☆40Updated this week
- The Splash-3 benchmark suite☆43Updated last year
- ☆33Updated 8 months ago
- DRAM error-correction code (ECC) simulator incorporating statistical error properties and DRAM design characteristics for inferring pre-c…☆10Updated last year
- Ocolos is the first online code layout optimization system for unmodified applications written in unmanaged languages.☆52Updated last year
- GenStore is the first in-storage processing system designed for genome sequence analysis that greatly reduces both data movement and comp…☆13Updated 2 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆37Updated 3 years ago
- ☆53Updated 5 years ago
- Branch predictor simulation framework for the Last-Level Branch Predictor☆21Updated 7 months ago
- Bridging polyhedral analysis tools to the MLIR framework☆109Updated last year
- Unit benchmarks of CUDA event APIs.☆17Updated 11 months ago
- A Hardware Pipeline Description Language☆44Updated last year
- A fast, accurate, and easy-to-integrate memory simulator that model memory system performance with bandwidth--latency curves.☆24Updated 2 weeks ago
- A framework that support executing unmodified CUDA source code on non-NVIDIA devices.☆116Updated 2 months ago
- compiling DSLs to high-level hardware instructions☆22Updated 2 years ago
- A collection of RISC-V Vector (RVV) benchmarks to help developers write portably performant RVV code☆107Updated this week
- Website for CS 265☆28Updated 2 months ago
- Example for running IREE in a bare-metal Arm environment.☆32Updated 3 weeks ago
- A pure, low-level tensor program representation enabling tensor program optimization via program rewriting. See the web demo at https://g…☆71Updated 9 months ago
- Tools and experiments for 0sim. Simulate system software behavior on machines with terabytes of main memory from your desktop.☆21Updated 4 years ago
- gem5 simulator with a gpgpu+graphics GPU model☆54Updated 4 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.☆38Updated 3 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.☆46Updated 5 years ago