A GPU cache model for research purposes
☆32Nov 4, 2013Updated 12 years ago
Alternatives and similar repositories for gpu-cache-model
Users that are interested in gpu-cache-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Research compiler based on algorithmic skeletons☆23Oct 18, 2014Updated 11 years ago
- Simulator for Heterogeneous Architecture☆12Jan 12, 2016Updated 10 years ago
- CUDA Flux is a profiler for GPU applications which reports the basic block executions frequencies of compute kernels☆33Mar 15, 2021Updated 5 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- Collection of full, mini, proxy, and benchmark apps.☆11Feb 14, 2020Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference☆17Mar 30, 2025Updated last year
- ☆332Apr 6, 2026Updated 2 months ago
- Julia wrapper of CLBlast, a "tuned OpenCL BLAS library".☆14Aug 23, 2023Updated 2 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- LLVM Plugin to Instrument Global Memory Accesses in CUDA Kernels☆10Jun 8, 2020Updated 6 years ago
- ☆33Sep 9, 2020Updated 5 years ago
- ☆19Jul 23, 2025Updated 10 months ago
- A Valgrind extension for CUDA, unofficial mirror for https://www.hlrs.de/organization/av/spmt/research/cudagrind/☆10Aug 5, 2015Updated 10 years ago
- A simple utility to create user-specified git commit hashes☆15Nov 24, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A benchmarking suite for heterogeneous systems. The primary goal of this project is to improve and update aspects of existing benchmarkin…☆43Jan 30, 2026Updated 4 months ago
- A repository holding the slides and short information from my presentations at different events☆11Jul 25, 2025Updated 10 months ago
- GKLEE is a symbolic analyser and test generator tailored for CUDA C++ programs☆16Dec 12, 2014Updated 11 years ago
- ☆10Oct 3, 2018Updated 7 years ago
- A High-Performance Side-Channel-Resistant AES on GPUs☆13May 9, 2019Updated 7 years ago
- A template for developing custom FIRRTL transforms☆10Jan 30, 2020Updated 6 years ago
- ☆14Feb 26, 2026Updated 3 months ago
- La plataforma de código abierto para la gestión de reportes ciudadanos.☆19Jul 18, 2017Updated 8 years ago
- GPU Static Modeling using PTX and Deep Structured Learning☆19Apr 1, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆16Updated this week
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆63Aug 11, 2024Updated last year
- This is no longer maintained. Please visit StreamHPC's fork https://github.com/StreamHPC/FinanceBench☆43Apr 20, 2018Updated 8 years ago
- A portable high-level API with CUDA or OpenCL back-end☆56Oct 8, 2017Updated 8 years ago
- Aplicación para mostrar los proyectos de ley emitidos por el Congreso☆11Jul 26, 2020Updated 5 years ago
- a computing kernel implementation in ML inference framework aiming at theoretical limit☆12Dec 18, 2019Updated 6 years ago
- Some materials for "The taste of probabilistic programming and modeling" by Oleg Kiselyov at FLOLAC'16☆16Jul 14, 2016Updated 9 years ago
- Automatic generation of architecture-level models for hardware from its RTL design.☆16Apr 12, 2023Updated 3 years ago
- ⛔️ DEPRECATED - System for AUtomated Code Evaluation☆26Jun 18, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- OpenCL tool to detect buffer overflows in GPU kernels☆23Jan 7, 2019Updated 7 years ago
- An LLVM IR Editor plugin for Eclipse☆53Jan 22, 2014Updated 12 years ago
- Benchmarks of Deep Neural Networks☆39May 19, 2021Updated 5 years ago
- Processing in Memory Emulation☆27Feb 24, 2023Updated 3 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆35Jul 28, 2020Updated 5 years ago
- ☆83Nov 16, 2020Updated 5 years ago
- Run OpenCL program on MOBILE GPU (Qualcomm & ARM) !☆17Jun 27, 2018Updated 7 years ago