sunggg / SRTunerLinks
SRTuner is a python library that provides efficient auto-tuning building blocks.
☆7Updated 3 years ago
Alternatives and similar repositories for SRTuner
Users that are interested in SRTuner are comparing it to the libraries listed below
Sorting:
- ☆17Updated last year
- Architecture-level Fault Injection Tool for GPU Application Resilience Evaluation☆67Updated last year
- PCMCsim: An Accurate Phase-Change Memory Controller Simulator and its Performance Analysis (ISPASS 2022)☆9Updated 11 months ago
- ☆18Updated 5 years ago
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)☆39Updated 3 years ago
- CUDAAdvisor: a GPU profiling tool☆49Updated 6 years ago
- ☆40Updated last week
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorch☆17Updated 2 years ago
- ☆12Updated 3 years ago
- ☆31Updated 3 years ago
- DiscoPoP - Discovery of Potential Parallelism☆45Updated last week
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"☆29Updated 3 years ago
- Data-Centric MLIR dialect☆42Updated last year
- ☆30Updated 2 years ago
- NeuroVectorizer is a framework that uses deep reinforcement learning (RL) to predict optimal vectorization compiler pragmas for for loops…☆94Updated 2 years ago
- Integer Set Library (source repository: http://repo.or.cz/w/isl.git)☆71Updated 5 months ago
- SST Macro Element Library☆37Updated 3 weeks ago
- GPTPU for SC 2021☆52Updated 2 years ago
- ☆18Updated 3 years ago
- A translation validation framework for MLIR☆88Updated 4 months ago
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆18Updated 5 months ago
- GPU Performance Advisor☆65Updated 3 years ago
- Torch Frontend for IREE☆25Updated last year
- Python wrapper for isl, an integer set library☆77Updated last week
- ☆38Updated 3 years ago
- ☆15Updated 2 years ago
- Library to interface Compilers and ML models for ML-Enabled Compiler Optimizations☆18Updated last month
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- A list of benchmark suites used in the research related to compilers, program performance, scientific computations etc.☆52Updated last year
- Persistent Collectives X- A collective communication library for high performance, low cost persistent collectives over RDMA devices.☆14Updated 6 years ago