☆270Jan 14, 2018Updated 8 years ago
Alternatives and similar repositories for OpenCUDA
Users that are interested in OpenCUDA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆1,052Mar 13, 2024Updated 2 years ago
- parallel algorithm based on cuda☆60Nov 27, 2017Updated 8 years ago
- Sample codes for my CUDA programming book☆2,048Dec 14, 2025Updated 5 months ago
- useful cuda code .☆43Mar 11, 2022Updated 4 years ago
- Ultrasound image formation, processing, and analysis. Interfaces built off the ITKUltrasound library.☆27Jan 30, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于ncnn的android端的enet分割☆17Mar 29, 2020Updated 6 years ago
- Developing a complete set of GPU-accelerated image processing tools, including convolution and morphology☆52Sep 28, 2010Updated 15 years ago
- This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several…☆1,300Jul 29, 2023Updated 2 years ago
- Otsu's method thresholding and image binarization on GPU in CUDA☆23Dec 3, 2022Updated 3 years ago
- 高性能编程 笔记☆170May 20, 2022Updated 4 years ago
- ☆121Apr 11, 2024Updated 2 years ago
- Input-aware cuBLAS/clBLAS implementation for better performance☆17Aug 4, 2022Updated 3 years ago
- ☆26Aug 15, 2023Updated 2 years ago
- ☆2,742Jan 16, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The Hybrid Task Graph Scheduler API☆40May 6, 2025Updated last year
- A GPU performance profiling tool for PyTorch models☆22Jul 5, 2022Updated 3 years ago
- ☆10Feb 17, 2017Updated 9 years ago
- 使用 cutlass 仓库在 ada 架构上实现 fp8 的 flash attention☆81Aug 12, 2024Updated last year
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- ☆19Jul 3, 2017Updated 8 years ago
- ONNX-TensorRT: TensorRT backend for ONNX☆3,202Mar 25, 2026Updated last month
- Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.☆417Jan 2, 2025Updated last year
- Small library for working with rotated rectangle shaped image regions.☆16Nov 7, 2017Updated 8 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Dec 1, 2023Updated 2 years ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,977Apr 13, 2026Updated last month
- CV-CUDA ™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,682Mar 31, 2026Updated last month
- how to optimize some algorithm in cuda.☆2,981May 8, 2026Updated last week
- The CMake version of cuda_by_example☆148Jul 24, 2020Updated 5 years ago
- A Deep Learning Approach to Ultrasound Image Recovery☆54Sep 12, 2017Updated 8 years ago
- [ICASSP2024] This repo holds the code for work "Residual Dense Swin Transformer for Continuous Depth-Independent Ultrasound Imaging"☆19Apr 10, 2024Updated 2 years ago
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated 2 years ago
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆515Oct 30, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 来记录一波 pybind11 实例~☆18Nov 19, 2022Updated 3 years ago
- 自己的一些零散代码合集☆18Jul 25, 2024Updated last year
- A simple high performance CUDA GEMM implementation.☆434Jan 4, 2024Updated 2 years ago
- ☆14Jul 23, 2025Updated 9 months ago
- Code for "Learning a Descriptor-Specific 3D Keypoint Detector" and "Learning to detect good 3d keypoints" -ICCV 2015, IJCV 2018☆27Feb 26, 2019Updated 7 years ago
- 🔴 Accelerated GStreamer utilities for NVIDIA Jetson Nano.☆10May 8, 2021Updated 5 years ago
- A practical way of learning Swizzle☆39Feb 3, 2025Updated last year