☆64Dec 26, 2022Updated 3 years ago
Alternatives and similar repositories for cuda-samples
Users that are interested in cuda-samples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tutorials for NVIDIA CUPTI samples☆68Nov 3, 2025Updated 8 months ago
- Differentiable scattering matrix computation for designing photonic devices☆12May 26, 2023Updated 3 years ago
- ☆12Apr 26, 2023Updated 3 years ago
- Sparse matrix-matrix multiplication on CPU+GPU systems.☆13Mar 17, 2014Updated 12 years ago
- A GPU version implementation of Guided Filter, using CUDA C/C++, calculates 1080P images in 10ms on 4090☆10Jun 21, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [TMLR 2026] GIOROM, sampling based model-order reduction for Lagrangian systems☆21Mar 12, 2026Updated 3 months ago
- Play-with-compiler sandbox based on PWD☆10Oct 22, 2020Updated 5 years ago
- ☆18Apr 19, 2020Updated 6 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- ☆49May 4, 2024Updated 2 years ago
- Official implementation of CVPR 2020 paper "Front2Back: Single View 3D Shape Reconstruction via Front to Back Prediction"☆12Aug 20, 2021Updated 4 years ago
- ☆10May 20, 2022Updated 4 years ago
- Examples for Monkey2 3D module☆13Jun 19, 2022Updated 4 years ago
- ☆15Jan 5, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆45Oct 25, 2021Updated 4 years ago
- Catkinized version of the latest version of PCL (http://pointclouds.org/)☆13Apr 9, 2020Updated 6 years ago
- A Differential Monte Carlo Solver For the Poisson Equation☆28Jul 17, 2024Updated last year
- [WIP] Better (FP8) attention for Hopper☆33Feb 24, 2025Updated last year
- Harmonia is an algorithm that allows for the implementation of operations on B+ trees using parallelization. As a part of my GPU project,…☆31Aug 8, 2021Updated 4 years ago
- Official Pytorch implementation of Semantic Implicit Neural Scene Representations with Semi-Supervised Training☆13Jan 3, 2022Updated 4 years ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆21Jul 13, 2025Updated 11 months ago
- A graph coloring register allocator for LLVM.☆11Jan 23, 2017Updated 9 years ago
- Neural sentiment classification of text using the Stanford Sentiment Treebank (SST-2) movie reviews dataset, logistic regression, naive b…☆15Oct 7, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Benchmarks☆20Jun 24, 2026Updated last week
- Extended globbing in modern C++☆15Dec 24, 2025Updated 6 months ago
- Memory footprint reduction for transformer models☆11Jan 24, 2023Updated 3 years ago
- An llvm pass for counting global uncoalesced acceses for cuda code via dynamic analysis.☆14Nov 17, 2018Updated 7 years ago
- ThemeSupport is a small library that can be used to determine whether the operating system is using a light or dark theme.☆13Jun 13, 2024Updated 2 years ago
- Mathematical Software (Now MathMod)☆11Apr 10, 2018Updated 8 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Dec 9, 2019Updated 6 years ago
- Add an OpenGL renderer to older RPG Maker 2003 games☆13Mar 7, 2022Updated 4 years ago
- Musical Gestures Toolbox for Matlab☆10Dec 21, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Strassen's Algorithm for Tensor Contraction☆15Jul 7, 2017Updated 8 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆52Mar 1, 2018Updated 8 years ago
- AutodiffEngine☆13Apr 1, 2019Updated 7 years ago
- AI Agent eBPF optimization benchmark and framework☆20Updated this week
- GPGPU-SIM 使用篇☆14Nov 12, 2022Updated 3 years ago
- Parallel implementation of k-means clustering using MPI4PY and PyCUDA.☆10Mar 11, 2019Updated 7 years ago
- BLAS OpenCL implementation.☆17Apr 8, 2015Updated 11 years ago