A GPU benchmark suite for autotuners
☆19Feb 20, 2024Updated 2 years ago
Alternatives and similar repositories for BAT
Users that are interested in BAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using C++ magic to capture CUDA kernels and tune them with Kernel Tuner☆21Sep 12, 2025Updated 9 months ago
- Kernel Tuner☆398Updated this week
- Instructions and templates for SC authors☆17Aug 22, 2021Updated 4 years ago
- ☆17Dec 8, 2023Updated 2 years ago
- A Symbolic Emulator for Shuffle Synthesis on the NVIDIA PTX Code☆16Mar 19, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…☆17Mar 14, 2018Updated 8 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- RISC-V vector extension ISA simulation☆18Jun 11, 2019Updated 7 years ago
- PIRA - Automatic Instrumentation Refinement☆17Mar 28, 2024Updated 2 years ago
- 🔮 High-performance kaleidoscope effects for real-time applications☆15Jun 1, 2026Updated last week
- Kernel Tuning Toolkit☆70Updated this week
- Base container for developing C++ and Fortran HPC applications☆18Jun 14, 2022Updated 3 years ago
- a lightweight, semi-automated setup guide for HashiStack: Consul + Vault + Nomad, on Footloose powered Docker "container VMs", with Ansib…☆11Jul 2, 2021Updated 4 years ago
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆77Feb 18, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Automated bottleneck detection and solution orchestration☆21Feb 24, 2026Updated 3 months ago
- Dark channel Haze removal algorithm with CUDA acceleration (typically 10x or more speedup using a Nvidia GPU)☆14Dec 7, 2017Updated 8 years ago
- A fast alternative to the standard C/C++ pow() function. With adjustable accuracy-space tradeoff.☆14Jul 12, 2013Updated 12 years ago
- ☆13Nov 1, 2021Updated 4 years ago
- UCAS网络登录☆13Nov 17, 2018Updated 7 years ago
- A quick way of spawning many batch jobs☆14Oct 24, 2022Updated 3 years ago
- Ansible config for Cluster in the Cloud☆11Apr 25, 2024Updated 2 years ago
- Terraform project to create a cli for drift detection☆19Jun 19, 2025Updated 11 months ago
- cuPC: CUDA-based Parallel PC Algorithm for Causal Structure Learning on GPU☆16Mar 19, 2021Updated 5 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆13May 18, 2024Updated 2 years ago
- CUDA solutions for the lab assignments in the UIUC-ECE408 Applied Parallel Programming course.☆19Apr 18, 2023Updated 3 years ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆72Sep 12, 2018Updated 7 years ago
- Scripts for running various benchmarks on Isambard and other systems.☆29May 13, 2021Updated 5 years ago
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- WIP · CUDA compatibility for Blaze · https://bitbucket.org/blaze-lib/blaze☆21Nov 18, 2019Updated 6 years ago
- ☆20Sep 28, 2024Updated last year
- ☆13Sep 19, 2024Updated last year
- A simple profiler to count Nvidia PTX assembly instructions of OpenCL/SYCL/CUDA kernels for roofline model analysis.☆57Mar 20, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SIMULATeQCD is a multi-GPU Lattice QCD framework that makes it easy for physicists to implement lattice QCD formulas while still providin…☆39May 6, 2026Updated last month
- OrqueIO main source code repository☆37Updated this week
- Runs a single CUDA/OpenCL kernel, taking its source from a file and arguments from the command-line☆26Updated this week
- A cross-platform visualization prototyping framework☆56Oct 7, 2025Updated 8 months ago
- Tool to detect and report leaked MPI objects like MPI_Requests and MPI_Datatypes☆14Sep 17, 2014Updated 11 years ago
- ☆13Jan 7, 2025Updated last year
- Ansible role for managing Dell PowerConnect switches☆19Oct 31, 2023Updated 2 years ago