pssrawat / ppopp-artifactView external linksLinks
Artifact for 'Register Optimizations for Stencils on GPUs'
☆10Sep 18, 2018Updated 7 years ago
Alternatives and similar repositories for ppopp-artifact
Users that are interested in ppopp-artifact are comparing it to the libraries listed below
Sorting:
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆25Sep 27, 2019Updated 6 years ago
- ☆10Aug 4, 2020Updated 5 years ago
- PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation☆14Jul 15, 2016Updated 9 years ago
- Artifact repository for paper Automatic Generation of High-Performance Quantized Machine Learning Kernels☆17Oct 13, 2020Updated 5 years ago
- The LLVM DOE Fork is a fork of upstream LLVM (https://github.com/llvm/llvm-project/) that hosts multiple DOE-funded projects. Contact in…☆25Updated this week
- Luthier, a GPU binary instrumentation tool for AMD GPUs☆26Feb 6, 2026Updated last week
- ☆13Aug 28, 2025Updated 5 months ago
- development repository for the open earth compiler☆82Feb 19, 2021Updated 4 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- A place to chronical performance of various hardward and software solutions for using AI at the edge for object detection☆10Sep 11, 2022Updated 3 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Computes the Henry coefficient of methane in IRMOF-1☆10Oct 5, 2021Updated 4 years ago
- SQL Optimizations using MLIR☆12Apr 5, 2020Updated 5 years ago
- Time based theme switching for the alacritty terminal☆12Feb 9, 2022Updated 4 years ago
- NeonGoby alias analysis checker☆14Jul 2, 2013Updated 12 years ago
- NVIDIA Compute Unified Device Architecture Toolkit☆15Feb 2, 2026Updated last week
- Benchmark scripts for comparing tutorials in PyTorch and JAX☆14Aug 25, 2022Updated 3 years ago
- A tool for checking tool output inspired by LLVM's FileCheck☆12Aug 29, 2025Updated 5 months ago
- Gray-Scott reaction-diffusion system in 3D using CUDA☆12Jun 8, 2019Updated 6 years ago
- An implemention of parallel marching cubes algorithm by CUDA☆10Sep 23, 2021Updated 4 years ago
- Performance Prediction Toolkit☆56Sep 13, 2025Updated 5 months ago
- ASM methods to test small loop performance on x86☆13Jun 11, 2019Updated 6 years ago
- ☆11Dec 13, 2014Updated 11 years ago
- Fast Synchronization-Free Algorithms for Parallel Sparse Triangular Solves with Multiple Right-Hand Sides (SpTRSM)☆14Feb 14, 2020Updated 5 years ago
- Files used for the evaluation of uiCA☆18Dec 14, 2022Updated 3 years ago
- A Benchmark Suite for Heterogeneous System Computation☆55Feb 20, 2025Updated 11 months ago
- DeepPerf is a set of cuda assembling developing tools☆10Dec 19, 2018Updated 7 years ago
- Library for exact linear algebra, a C++ template-library based originally on LinBox intended for F4-like implementations☆18Dec 15, 2012Updated 13 years ago
- Code for reproducing experiments performed for Accoridon☆13Jun 11, 2021Updated 4 years ago
- RL-Scope: Cross-Stack Profiling for Deep Reinforcement Learning Workloads☆47Apr 7, 2021Updated 4 years ago
- Finite Field Operations on GPGPU☆15Jul 23, 2023Updated 2 years ago
- ☆15Oct 20, 2020Updated 5 years ago
- ☆12Oct 9, 2020Updated 5 years ago
- Build your own Zoom Client RPM repository☆11Dec 30, 2025Updated last month
- Mirror of official llvm git repository located at http://llvm.org/git/llvm. Updated hourly.☆13Jun 14, 2014Updated 11 years ago
- A dataflow runtime simulator.☆12Jul 18, 2019Updated 6 years ago
- ☆13Jan 7, 2023Updated 3 years ago
- Emacs major mode for Alloy☆13Jul 14, 2018Updated 7 years ago
- A set of cog recipes for C++ reflection☆15Aug 21, 2011Updated 14 years ago