☆98May 30, 2026Updated last month
Alternatives and similar repositories for GPU_Programming
Users that are interested in GPU_Programming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A proof-of-concept implementation of Titans: models mixing long-term, short-term and persistent memories☆24Apr 9, 2025Updated last year
- General Matrix Multiplication using NVIDIA Tensor Cores☆28Jan 25, 2025Updated last year
- ☆92Feb 29, 2024Updated 2 years ago
- ☆13Dec 22, 2024Updated last year
- RAPIDS Deployment Documentation☆15Jun 10, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20May 30, 2026Updated last month
- Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> O…☆33Aug 18, 2021Updated 4 years ago
- OpenShell is the safe, private runtime for autonomous AI agents.☆165May 29, 2026Updated last month
- A series of high-performance GEMM (General Matrix Multiply) implementations Iteratively optimised for H100 GPUs in Pure CUDA.☆79Feb 18, 2026Updated 4 months ago
- Homepage of Software Engineering for Machine Learning☆17May 25, 2026Updated last month
- My study notes and hands-on projects for CUDA-based GPU programming☆13Dec 11, 2025Updated 6 months ago
- ☆12Oct 31, 2021Updated 4 years ago
- ☆15Feb 13, 2018Updated 8 years ago
- Hugging Face Download (Cache) Manager☆22Aug 7, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NVIDIA tools guide☆166Jan 7, 2025Updated last year
- ☆27May 18, 2022Updated 4 years ago
- A set of helper classes to make simpy simulation simpler☆14May 21, 2023Updated 3 years ago
- Repository to host ROCm Developer Hub Notebook Tutorials☆86Updated this week
- ☆14Apr 10, 2023Updated 3 years ago
- Simple command line to get directory information☆13May 27, 2025Updated last year
- Reinforcement Learning example in Nim, playing tic tac toe. Based off original C version from the great Antirez☆15Apr 2, 2025Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆165Oct 19, 2023Updated 2 years ago
- Finetuning BLOOM on a single GPU using gradient-accumulation☆32Mar 29, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- DECT NR+ Link-Level Simulation (ETSI TS 103 636)☆19May 20, 2026Updated last month
- Variational Autoencoder with non-euclidean (hyperbolic) latent space☆13Nov 25, 2022Updated 3 years ago
- World's Simplest Todo, Just 1 checkbox per day, no bs!☆13Nov 22, 2024Updated last year
- ☆14Mar 29, 2026Updated 3 months ago
- C++ version of Conway's Game of Life with raylib. This project is accompanied by a video tutorial that explains everything in detail.☆12Mar 14, 2024Updated 2 years ago
- ☆23Feb 16, 2022Updated 4 years ago
- Apply GPU in ML and DL☆68Mar 23, 2026Updated 3 months ago
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆491Mar 10, 2025Updated last year
- Implementation from scratch in CUDA C++ of image processing algorithms.☆23Oct 26, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A curriculum for learning about gpu performance engineering, from scratch to what the frontier AI labs do☆857Apr 27, 2026Updated 2 months ago
- c++ implementation of a simple-virtual-machine☆14Sep 19, 2014Updated 11 years ago
- ☆41Feb 14, 2026Updated 4 months ago
- Simple problems implemented in CUDA C☆39Apr 7, 2025Updated last year
- ring-attention experiments☆168Oct 17, 2024Updated last year
- Dynamically typed N-D expression system based on xtensor☆26Oct 20, 2021Updated 4 years ago
- Implement Neural Networks in Cuda from Scratch☆23May 17, 2024Updated 2 years ago