CUDA Template Functions
☆20Dec 16, 2025Updated 2 months ago
Alternatives and similar repositories for cutf
Users that are interested in cutf are comparing it to the libraries listed below
Sorting:
- An extension library of WMMA API (Tensor Core API)☆111Jul 12, 2024Updated last year
- ☆18Nov 19, 2024Updated last year
- The autoware diffusion planner package☆33Jul 24, 2025Updated 7 months ago
- Parallel selection on GPUs☆15Mar 23, 2021Updated 4 years ago
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆27Oct 13, 2024Updated last year
- a wavelet-based multifractal image analysis tool implementing the WTMM (Wavelet Transform Modulus Maxima) method.☆11Feb 1, 2020Updated 6 years ago
- Programmable JIT Compilation and Optimization for C/C++ using LLVM☆45Updated this week
- Please note OpenFPM project structure change in version 5.0.0. For details refer to the main repo and website☆11Jan 19, 2026Updated last month
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- A FORTRAN implementation of a Moving Finite Volume MHD code in three dimensions including self-gravity.☆12Jan 24, 2017Updated 9 years ago
- ☆53Updated this week
- Range-based for loops to iterate over a range of numbers or values☆34Nov 23, 2016Updated 9 years ago
- ☆18Feb 12, 2026Updated 3 weeks ago
- A program for rendering hexahedral meshes in the form of transparent volumes.☆15Feb 7, 2026Updated 3 weeks ago
- C++17 Wrapper for ScaLAPACK☆11Oct 5, 2023Updated 2 years ago
- EPOCH Input System Version 2☆10Jun 5, 2020Updated 5 years ago
- University of Vermont Mechanical Engineering Heat Transfer Course☆16Apr 6, 2023Updated 2 years ago
- GPU-accelerated RIME implementations. An offshoot of the BIRO projects, and one of the foothills of Mt Exaflop.☆10Dec 10, 2025Updated 2 months ago
- ☆10Mar 2, 2021Updated 5 years ago
- ☆11Jul 13, 2022Updated 3 years ago
- Repository for participants of the "Containers for HPC" training☆11Feb 11, 2026Updated 3 weeks ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- Performance portable routines for opacity, emissivity, and scattering☆13Jan 22, 2026Updated last month
- Deployed version of Tableaunoir. Do not modify this repository.☆11Updated this week
- ☆44Updated this week
- How to use CUDA with Python numpy☆41Dec 1, 2017Updated 8 years ago
- AMR code for compressible flow simulation☆15Mar 20, 2024Updated last year
- Simple microcanonical Molecular Dynamics simulation of a Lennard-Jones fluid in a periodic boundary☆10Jan 2, 2018Updated 8 years ago
- Unofficial implementation for Sigmoid Loss for Language Image Pre-Training☆11Sep 26, 2023Updated 2 years ago
- libSplash - Simple Parallel file output Library for Accumulating Simulation data using Hdf5☆16Apr 8, 2021Updated 4 years ago
- Nonequispaced FFTs on GPUs (based on NFFT: http://www.nfft.org)☆11Apr 30, 2018Updated 7 years ago
- linux内核异步内存回收的另一个思路:基于冷热文件的冷热区域精准的回收冷文件页page(可做成内核ko)☆12Jun 14, 2024Updated last year
- Parallel LiDAR Point Cloud Preprocessing for Autonomous Driving Applications☆10Apr 2, 2024Updated last year
- Continuum Dynamics Evaluation and Test Suite☆15Aug 29, 2017Updated 8 years ago
- AI Accelerators-SC23-tutorial Repository☆11Nov 12, 2023Updated 2 years ago
- 个人笔记☆15Feb 26, 2026Updated last week
- C/C++ Dynamic Memory Analyzer (CMA)☆18Jul 29, 2014Updated 11 years ago
- ☆11Dec 22, 2024Updated last year
- Demonstration of using Caffe2 inside an Android application.☆10Dec 23, 2018Updated 7 years ago