Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )
☆62Mar 23, 2025Updated last year
Alternatives and similar repositories for hpc
Users that are interested in hpc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- c++ implementation of mmpose inference, for pose estimation based on MNN☆13Mar 9, 2021Updated 5 years ago
- Set of basic classes (vector, matrix, images and memory array) for CPU and GPU☆17Feb 17, 2021Updated 5 years ago
- Parallel Solver for Large-Scale Sparse Matrix Computations (MPI)☆20Jan 5, 2026Updated 4 months ago
- Dynamic matrix type and algorithms for sparse matrices☆24Feb 12, 2025Updated last year
- Utilities to support interacting with multiple HPC clusters☆11Nov 21, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Jan 18, 2019Updated 7 years ago
- One simple way which to change two faces☆19Oct 28, 2018Updated 7 years ago
- ☆29Aug 13, 2019Updated 6 years ago
- Four tracking algorithms organized with cascade classifier or YOLOv3 for object detection.☆13Nov 20, 2018Updated 7 years ago
- Fast, matrix-free isogeometric Galerkin method for Karhunen-Loeve approximation of random fields.☆11Mar 22, 2021Updated 5 years ago
- detect UAV based on YOLOv5 and siamRPN☆10Oct 20, 2021Updated 4 years ago
- REDM是一套基于商业化标准的开源directui界面框架,不仅能提供完善的项目管理方案、详细的文档框架,也可轻松协助完成可视化界面设计,其核心库的稳定性已在内部多个大型商化项目中通过验证。 http://hgy413.com/3426.html☆14Aug 19, 2018Updated 7 years ago
- yolov8n 部署版,基于官方的导出onnx脚本导出onnx模型,在不同平台上进行部署测试,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆40May 26, 2023Updated 2 years ago
- OpenFPM: A scalable open framework for particle and particle-mesh codes on parallel computers☆23Apr 13, 2026Updated last month
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A thread safe simple C++ wrapper for FFTW & MKL☆17Sep 27, 2021Updated 4 years ago
- Python module for Environment Modules☆17Sep 7, 2017Updated 8 years ago
- Generate training sample images of TSN algorithm \ TSN异常行为检测训练样本生成C++ 代码☆10Nov 3, 2018Updated 7 years ago
- FEA project for EN2340 Computational Methods in Structural and Solid Mechanics, Brown University☆35Dec 2, 2017Updated 8 years ago
- It is an annoying thing of preparing the openCL environment, so I wapper the initialization part of OpenCL and setting parameters for ker…☆16May 16, 2018Updated 8 years ago
- OpenCV Universal Multi thread video Interface with neglectable latency.☆34Mar 21, 2024Updated 2 years ago
- Offload Eigen operations to GPUs☆20Feb 3, 2022Updated 4 years ago
- Tool Kit for Lagrangian Grid Reconnection☆24May 26, 2023Updated 2 years ago
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Concurrent CPU-GPU Programming using Task Models☆109Dec 19, 2019Updated 6 years ago
- CUDA based parallel Image processing tool☆21Jan 11, 2017Updated 9 years ago
- Examples of MPI and OpenMP (adapted from MPI Tutorial)☆45Apr 26, 2026Updated 3 weeks ago
- Matrix Multiplication on GPU using Shared Memory considering Coalescing and Bank Conflicts☆26Aug 29, 2022Updated 3 years ago
- CUDA and OpenCL SVM training benchmark☆16Jul 20, 2017Updated 8 years ago
- Intel HPC Containers using Singularity☆19Jan 7, 2023Updated 3 years ago
- HPC dashboards developed for SRCC systems☆20Dec 11, 2021Updated 4 years ago
- OCCA Python API: JIT Compilation for Multiple Architectures☆11Dec 20, 2019Updated 6 years ago
- yolop 部署版本,后处理用python语言以C++方式形式进行改写,便于移植不同平台(onnx、tensorRT、rknn)。☆15Mar 2, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementing CNN code in CUDA and OpenCL to evaluate its performance on NVIDIA GPUs, AMD GPUs, and an FPGA platform.☆56Apr 25, 2017Updated 9 years ago
- Lightweight face detectors with landmarks. Training code using pytorch and inference using pytorch/ncnn/tensorflow/tflite.☆10Jul 1, 2020Updated 5 years ago
- Sn neutron transport written mostly in python. For university Sn transport theory class.☆12Aug 23, 2017Updated 8 years ago
- Eigen Recursive Matrix Extension☆12Sep 18, 2019Updated 6 years ago
- 复合材料力学性能 层合板相关计算☆14Aug 21, 2024Updated last year
- CMake Tutorial☆10Oct 8, 2022Updated 3 years ago
- nVidia's CUDA accelerated Spin Transformations of Discrete Surfaces, based on the original code and paper by Keenan Crane, Ulrich Pinkall…☆17Mar 14, 2018Updated 8 years ago