ChrisCummins / paper-synthesizing-benchmarks
π "Synthesizing Benchmarks for Predictive Modeling" (π₯ CGO'17 Best Paper)
β22Updated 2 years ago
Alternatives and similar repositories for paper-synthesizing-benchmarks:
Users that are interested in paper-synthesizing-benchmarks are comparing it to the libraries listed below
- π "End-to-end Deep Learning of Optimization Heuristics" (π₯ PACT'17 Best Paper)β73Updated 2 years ago
- A framework that helps implementing swizzle GPU kernelsβ42Updated 5 years ago
- SMT-LIB benchmarks for shape computations from deep learning models in PyTorchβ18Updated 2 years ago
- Haystack is an analytical cache model that given a program computes the number of cache misses.β46Updated 5 years ago
- Code released to accompany the ISCA paper: "T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware"β28Updated 3 years ago
- an approximate compilerβ38Updated 4 years ago
- Artifact Evaluation Reproduction for "Software Prefetching for Indirect Memory Accesses", CGO 2017, using CK.β38Updated 3 years ago
- Automata Benchmark Suiteβ20Updated last year
- Deep learning program generatorβ106Updated last year
- Polyhedral Extraction Tool (source repository: http://repo.or.cz/w/pet.git)β39Updated 2 years ago
- NeuroVectorizer is a framework that uses deep reinforcement learning (RL) to predict optimal vectorization compiler pragmas for for loopsβ¦β92Updated 2 years ago
- a Pin tool for collecting microarchitecture-independent workload characteristicsβ60Updated last year
- CUDAAdvisor: a GPU profiling toolβ48Updated 6 years ago
- COBAYN: Compiler Autotuning Framework Using Bayesian Networksβ20Updated 2 years ago
- outline and links for PLDI 2022 tutorialβ17Updated 2 years ago
- β29Updated 3 years ago
- A Comprehensive Benchmark Suite for Graph Computingβ67Updated 6 years ago
- Library to plot integer sets and mapsβ49Updated 8 years ago
- High-performance automata-processing engines are traditionally evaluated using a limited set of regular expression rulesets. While regulaβ¦β32Updated last year
- A system for programming formally-verified loop transformations.β16Updated 6 years ago
- Alloy models for automatic synthesis of memory model litmus test suites (from ASPLOS 2017)β16Updated last year
- A Distributed Multi-GPU System for Fast Graph Processingβ65Updated 6 years ago
- Rigorous Floating-Point Mixed-Precision Tunerβ14Updated 4 years ago
- ComPy-Learn is a framework for exploring program representations for ML4CODE tasks.β23Updated last year
- Enabling on-the-fly manipulations with LLVM IR code of CUDA sourcesβ110Updated 2 years ago
- β19Updated 2 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUsβ34Updated 5 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUsβ28Updated 6 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launchesβ15Updated 5 years ago
- β17Updated 3 years ago