Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
☆62Mar 23, 2025Updated last year
Alternatives and similar repositories for hpc
Users that are interested in hpc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- c++ implementation of mmpose inference, for pose estimation based on MNN☆12Mar 9, 2021Updated 5 years ago
- Set of basic classes (vector, matrix, images and memory array) for CPU and GPU☆17Feb 17, 2021Updated 5 years ago
- Parallel Solver for Large-Scale Sparse Matrix Computations (MPI)☆20Jan 5, 2026Updated 3 months ago
- Hybrid CPU and GPU real-time dynamic digital image correlation engine and application☆10Apr 26, 2023Updated 2 years ago
- Dynamic matrix type and algorithms for sparse matrices☆23Feb 12, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- tensorrt部署教程☆11Aug 1, 2025Updated 8 months ago
- Prototype implementations of the orders 2 and 4 of the Runge-Kutta method in C++, CUDA and OpenCL applied to vector fields.☆18Jan 10, 2017Updated 9 years ago
- Agent with Warm Start and Adaptive Dynamic Termination for Plane Localization in 3D Ultrasound☆15Oct 5, 2022Updated 3 years ago
- For the paper 'A novel isogeometric coupling approach for assembled thin-walled structures', here is the core code and implementation tha…☆14Aug 9, 2024Updated last year
- ☆29Aug 13, 2019Updated 6 years ago
- Four tracking algorithms organized with cascade classifier or YOLOv3 for object detection.☆13Nov 20, 2018Updated 7 years ago
- REDM是一套基于商业化标准的开源directui界面框架,不仅能提供完善的项目管理方案、详细的文档框架,也可轻松协助完成可视化界面设计,其核心库的稳定性已在内部多个大型商化项目中通过验证。 http://hgy413.com/3426.html☆14Aug 19, 2018Updated 7 years ago
- yolov8n 部署版,基于官方的导出onnx脚本导出onnx模型,在不同平台上进行部署测试,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆39May 26, 2023Updated 2 years ago
- ☆13Oct 8, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- OpenFPM: A scalable open framework for particle and particle-mesh codes on parallel computers☆23Jan 20, 2026Updated 2 months ago
- opencv和ffmpeg结合实现推流美颜☆31Dec 12, 2018Updated 7 years ago
- Code for NVIDIA's CUDA By Example Book.☆48Apr 14, 2020Updated 5 years ago
- This repository contains the coding aspect of the project Leakage Detection in Smart Water Distribution Systems, where the collected data…☆11Jul 22, 2024Updated last year
- ☆12Jan 25, 2023Updated 3 years ago
- Tool Kit for Lagrangian Grid Reconnection☆24May 26, 2023Updated 2 years ago
- Concurrent CPU-GPU Programming using Task Models☆106Dec 19, 2019Updated 6 years ago
- CUDA based parallel Image processing tool☆23Jan 11, 2017Updated 9 years ago
- Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts☆25Aug 29, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Dec 20, 2019Updated 6 years ago
- yolop 部署版本,后处理用python语言以C++方式形式进行改写,便于移植不同平台(onnx、tensorRT、rknn)。☆15Mar 2, 2023Updated 3 years ago
- Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.☆56Apr 25, 2017Updated 8 years ago
- ☆11Mar 3, 2020Updated 6 years ago
- Lightweight face detectors with landmarks. Training code using pytorch and inference using pytorch/ncnn/tensorflow/tflite.☆10Jul 1, 2020Updated 5 years ago
- Sn neutron transport written mostly in python. For university Sn transport theory class.☆12Aug 23, 2017Updated 8 years ago
- Keras implementation of Morphological Convolutional Neural Networks for Hyperspectral Image Classification☆10Nov 25, 2021Updated 4 years ago
- Around View Monitor☆23Sep 1, 2022Updated 3 years ago
- nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…☆17Mar 14, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- CMake Tutorial☆11Oct 8, 2022Updated 3 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- A rknn cpp/c++ inference codebase for yolov5.☆31Aug 25, 2021Updated 4 years ago
- Sample project to integrate catch2, cmake and jenkins☆12Dec 11, 2017Updated 8 years ago
- Deploy deep learning model on difference hardware and framework. (TensorRT/ONNX/MNN/RKNN)☆13Jan 2, 2022Updated 4 years ago
- N-dimensional Array Datastructure on CPU and GPU☆20Mar 5, 2018Updated 8 years ago
- Vectorised data model base and helper classes.☆20Updated this week