A quick way to benchmark your CUDA compiler on a Linux environment
☆27Mar 16, 2011Updated 15 years ago
Alternatives and similar repositories for Benchmarking-CUDA
Users that are interested in Benchmarking-CUDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source for Demystifying GPU Microarchitecture through Microbenchmarking☆18May 29, 2023Updated 2 years ago
- Benchmarks used in the gpgpu-sim ispass 2009 paper☆31May 7, 2015Updated 11 years ago
- Benchmarks for locking algorithms as well as implementations of locking algorithms.☆25Mar 6, 2018Updated 8 years ago
- A Coq framework to support structural design and proof of hardware cache-coherence protocols☆14May 7, 2022Updated 4 years ago
- Typst template for thesis submitted to University of Waterloo☆11Feb 21, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Fast GPU based tensor core reductions☆13Jan 13, 2023Updated 3 years ago
- ☆23Apr 25, 2023Updated 3 years ago
- Sample for ICS-20 transfer between Hyperledger Fabric and Cosmos-based Blockchain☆10Apr 27, 2022Updated 4 years ago
- This is a personal archive. Please refer to github.com/UCLA-VAST/RapidStream☆15May 31, 2022Updated 3 years ago
- Set of OpenCL microbenchmarks☆29Nov 19, 2025Updated 6 months ago
- The FCUDA CUDA-to-RTL compiler☆22Jul 1, 2016Updated 9 years ago
- Evaluating different memory managers for dynamic GPU memory☆26Dec 16, 2020Updated 5 years ago
- annotated bibliography on approximate computing☆10Jan 16, 2016Updated 10 years ago
- ☆31Jun 15, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- GPU MemoryManager based on virtualized queues☆27Jun 25, 2022Updated 3 years ago
- A Powerful AST Parser for Solidity☆10May 12, 2026Updated 2 weeks ago
- KWANT is an open source C++ toolkit for computing scores and other metrics for object tracking systems.☆11Jan 22, 2026Updated 4 months ago
- ☆10Dec 31, 2018Updated 7 years ago
- Fork of Hipacc generating code for Vivado HLS and Altera OpenCL☆24Oct 8, 2018Updated 7 years ago
- ☆10Jul 18, 2024Updated last year
- This repo contains all the code, slides and other reference documents used in community sessions.☆14Mar 29, 2023Updated 3 years ago
- Cross-Platform Annotation Tool for Person Search Datasets☆11Aug 29, 2017Updated 8 years ago
- A high performance implementation of kmeans algorithm with cuda☆18Sep 7, 2014Updated 11 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Wrapper for Allied Vision Technology cameras using their Vimba SDK☆10Jul 13, 2016Updated 9 years ago
- The Simple OpenGL Image Library for Mac OS X☆11Aug 19, 2011Updated 14 years ago
- Vietnamese spelling correction (ViSC) tool☆12Dec 11, 2016Updated 9 years ago
- PIN-tool to produce multi-threaded atomic memory traces☆36Oct 22, 2013Updated 12 years ago
- The wakatime daemon gives leaderboard notifications and analyses your coding data to give recommendations.☆16Feb 9, 2016Updated 10 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆17Jun 3, 2024Updated last year
- The presentation at Spark Summit 2014 showing how 4Quant does production scale image processing and analysis using Spark☆16Jul 29, 2014Updated 11 years ago
- ☆13Sep 22, 2025Updated 8 months ago
- notes on reading tensorflow source code☆13Aug 18, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorchSim is a Comprehensive, Fast, and Accurate NPU Simulation Framework☆124Updated this week
- Simulator for Heterogeneous Architecture☆12Jan 12, 2016Updated 10 years ago
- ☆26Jun 21, 2016Updated 9 years ago
- process monitor for server, auto kill processes consuming high CPU after long time☆17Sep 18, 2014Updated 11 years ago
- ☆17Nov 24, 2018Updated 7 years ago
- The project consists of a image processing application that is using distributed processors (MPI). The development language is C/C++ with…☆13Mar 26, 2012Updated 14 years ago
- 用VLCKit封装集成的播放器