ashokyannam / GPU_Acceleration_Using_CUDA_C_CPP
Programming accelerated applications with CUDA C/C++, enough to be able to begin work accelerating your own CPU-only applications for performance gains, and for moving into novel computational territory.
☆92Updated 6 years ago
Alternatives and similar repositories for GPU_Acceleration_Using_CUDA_C_CPP:
Users that are interested in GPU_Acceleration_Using_CUDA_C_CPP are comparing it to the libraries listed below
- CUDA by Example, written by two senior members of the CUDA software platform team, shows programmers how to employ this new technology. …☆405Updated last year
- CUDA by practice☆125Updated 5 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆88Updated last year
- Hands-On GPU Programming with Python and CUDA, Second Edition, published by Packt☆36Updated 4 years ago
- A collection of awesome algorithms, implemented in CUDA.☆25Updated 7 years ago
- ☆433Updated 9 years ago
- A plugin for Jupyter Notebook to run CUDA C/C++ code☆219Updated 6 months ago
- Algorithms implemented in CUDA + resources about GPGPU☆55Updated 3 years ago
- Introduction to CUDA programming☆116Updated 7 years ago
- Learn OpenMP examples step by step☆91Updated 2 months ago
- Installing and Test PyTorch C++ API on Ubuntu with GPU enabled☆24Updated last year
- A set of hands-on tutorials for CUDA programming☆218Updated last year
- CUDA Matrix Multiplication Optimization☆178Updated 8 months ago
- CS344 - Introduction To Parallel Programming course (Udacity) proposed solutions☆54Updated 7 years ago
- ☆23Updated 5 years ago
- Implement Neural Networks in Cuda from Scratch☆22Updated 10 months ago
- NVIDIA tools guide☆123Updated 3 months ago
- Learning CUDA 10 Programming, published by Packt☆41Updated 2 years ago
- Implementing CNN for Digit Recognition (MNIST and SVHN dataset) using PyTorch C++ API☆24Updated 3 years ago
- Learning and practice of high performance computing (CUDA, Vulkan, OpenCL, OpenMP, TBB, SSE/AVX, NEON, MPI, coroutines, etc. )☆60Updated 2 weeks ago
- ResNet Implementation, Training, and Inference Using LibTorch C++ API☆39Updated 10 months ago
- Some CUDA design patterns and a bit of template magic for CUDA☆150Updated last year
- Implementation of a simple CNN using CUDA☆67Updated 7 years ago
- This is an archive of materials produced for an introductory class on CUDA programming at Stanford University in 2010☆217Updated 2 years ago
- Tutorial for Using Custom Layers with OpenVINO (Intel Deep Learning Toolkit)☆106Updated 5 years ago
- A full-fledged yet minimalistic CUDA-based convolutional neural network library from scratch in C++☆15Updated 5 years ago
- The CMake version of cuda_by_example☆148Updated 4 years ago
- ☆18Updated 8 years ago
- ☆34Updated 5 years ago
- CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.☆36Updated 7 years ago