☆93Nov 11, 2025Updated 5 months ago
Alternatives and similar repositories for GPU_Programming
Users that are interested in GPU_Programming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Step by step implementation of a fast softmax kernel in CUDA☆65Jan 6, 2025Updated last year
- General Matrix Multiplication using NVIDIA Tensor Cores☆28Jan 25, 2025Updated last year
- ☆91Feb 29, 2024Updated 2 years ago
- ☆12Dec 22, 2024Updated last year
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- RAPIDS Deployment Documentation☆15Updated this week
- ☆17Updated this week
- IBM Spectrum LSF - IBM Cloud☆16Sep 30, 2024Updated last year
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆76Feb 18, 2026Updated last month
- An example of how to use the multiprocessing package along with PyTorch.☆21Jan 15, 2021Updated 5 years ago
- ☆15Feb 13, 2018Updated 8 years ago
- Comparing Deep Learning Inference of Pytorch models running on CPU, CUDA and TensorRT☆16Feb 20, 2022Updated 4 years ago
- ☆15Feb 23, 2025Updated last year
- NVIDIA tools guide☆164Jan 7, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Repository to host ROCm Developer Hub Notebook Tutorials☆67Mar 24, 2026Updated 2 weeks ago
- Planetary crustal and mantle properties, and lithospheric displacements, stress and strain, calculations in spherical harmonics.☆15Mar 26, 2026Updated 2 weeks ago
- Read custom dataset☆12Mar 31, 2023Updated 3 years ago
- ☆14Apr 10, 2023Updated 3 years ago
- A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do☆543Mar 2, 2026Updated last month
- Flash Attention in raw Cuda C beating PyTorch☆38May 14, 2024Updated last year
- Simple command line to get directory information☆13May 27, 2025Updated 10 months ago
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated last year
- ☆12Apr 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Finetuning BLOOM on a single GPU using gradient-accumulation☆31Mar 29, 2023Updated 3 years ago
- DECT NR+ Link-Level Simulation (ETSI TS 103 636)☆17Jan 25, 2026Updated 2 months ago
- Variational Autoencoder with non-euclidean (hyperbolic) latent space☆12Nov 25, 2022Updated 3 years ago
- torchcomms: a modern PyTorch communications API☆355Updated this week
- World's Simplest Todo, Just 1 checkbox per day, no bs!☆13Nov 22, 2024Updated last year
- Apply GPU in ML and DL☆67Mar 23, 2026Updated 2 weeks ago
- ☆23Feb 16, 2022Updated 4 years ago
- C++ version of Conway's Game of Life with raylib. This project is accompanied by a video tutorial that explains everything in detail.☆11Mar 14, 2024Updated 2 years ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆470Mar 10, 2025Updated last year
- Implementation from scratch in CUDA C++ of image processing algorithms.☆22Oct 26, 2020Updated 5 years ago
- Backtracking regular expression engine written in Python☆13Nov 4, 2022Updated 3 years ago
- Easy to use benchmarks for linear algebra frameworks☆24Jun 5, 2020Updated 5 years ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆32Feb 21, 2026Updated last month
- ☆23Apr 2, 2026Updated last week
- ☆94Updated this week