A GPU cache model for research purposes
☆32Nov 4, 2013Updated 12 years ago
Alternatives and similar repositories for gpu-cache-model
Users that are interested in gpu-cache-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Research compiler based on algorithmic skeletons☆23Oct 18, 2014Updated 11 years ago
- Simulator for Heterogeneous Architecture☆12Jan 12, 2016Updated 10 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- Collection of full, mini, proxy, and benchmark apps.☆11Feb 14, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆30Mar 20, 2013Updated 13 years ago
- InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference☆17Mar 30, 2025Updated last year
- Artifact for "DX100: A Programmable Data Access Accelerator for Indirection (ISCA 2025)" paper☆18Nov 6, 2025Updated 6 months ago
- ☆327Apr 6, 2026Updated last month
- Julia wrapper of CLBlast, a "tuned OpenCL BLAS library".☆14Aug 23, 2023Updated 2 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- LLVM Plugin to Instrument Global Memory Accesses in CUDA Kernels☆10Jun 8, 2020Updated 5 years ago
- ☆33Sep 9, 2020Updated 5 years ago
- ☆19Jul 23, 2025Updated 9 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A Valgrind extension for CUDA, unofficial mirror for https://www.hlrs.de/organization/av/spmt/research/cudagrind/☆10Aug 5, 2015Updated 10 years ago
- Iodine: Verifying Constant-Time Execution of Hardware☆18Mar 29, 2021Updated 5 years ago
- A repository holding the slides and short information from my presentations at different events☆11Jul 25, 2025Updated 9 months ago
- GKLEE is a symbolic analyser and test generator tailored for CUDA C++ programs☆16Dec 12, 2014Updated 11 years ago
- ☆10Oct 3, 2018Updated 7 years ago
- A template for developing custom FIRRTL transforms☆10Jan 30, 2020Updated 6 years ago
- ☆14Feb 26, 2026Updated 2 months ago
- La plataforma de código abierto para la gestión de reportes ciudadanos.☆19Jul 18, 2017Updated 8 years ago
- High performance C++ Linear Algebra Library☆16Oct 12, 2020Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆16Updated this week
- A test case for evaluating the performance of the workgroup reduction operation in OpenCL 2.0☆10Nov 26, 2020Updated 5 years ago
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆62Aug 11, 2024Updated last year
- This is no longer maintained. Please visit StreamHPC's fork https://github.com/StreamHPC/FinanceBench☆43Apr 20, 2018Updated 8 years ago
- Bioinformatics benchmarking package, based on the original BioBench developed by Albayraktaroglu et al, 2005☆13Dec 13, 2018Updated 7 years ago
- A portable high-level API with CUDA or OpenCL back-end☆56Oct 8, 2017Updated 8 years ago
- Aplicación para mostrar los proyectos de ley emitidos por el Congreso☆11Jul 26, 2020Updated 5 years ago
- a computing kernel implementation in ML inference framework aiming at theoretical limit☆12Dec 18, 2019Updated 6 years ago
- Some materials for "The taste of probabilistic programming and modeling" by Oleg Kiselyov at FLOLAC'16☆16Jul 14, 2016Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automatic generation of architecture-level models for hardware from its RTL design.☆16Apr 12, 2023Updated 3 years ago
- OpenCL tool to detect buffer overflows in GPU kernels☆23Jan 7, 2019Updated 7 years ago
- An LLVM IR Editor plugin for Eclipse☆53Jan 22, 2014Updated 12 years ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆42Nov 16, 2021Updated 4 years ago
- Benchmarks of Deep Neural Networks☆39May 19, 2021Updated 5 years ago
- ☆82Nov 16, 2020Updated 5 years ago
- Run OpenCL program on MOBILE GPU (Qualcomm & ARM) !☆18Jun 27, 2018Updated 7 years ago