Neural network from scratch in CUDA/C++
☆87Sep 8, 2025Updated 6 months ago
Alternatives and similar repositories for neural-network-cuda
Users that are interested in neural-network-cuda are comparing it to the libraries listed below
Sorting:
- Implement Neural Networks in Cuda from Scratch☆24May 17, 2024Updated last year
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆598Aug 12, 2025Updated 7 months ago
- Simple neural network implementation using CUDA technology. It is an educational implementation.☆98Apr 12, 2018Updated 7 years ago
- A Boid Flocking Simulation Built with Rust☆14May 17, 2021Updated 4 years ago
- Convolutional Neural Network of vgg19 model using Cuda to accelerate☆12Jun 11, 2018Updated 7 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- Small tool for profiling the performance of hardware-accelerated Rust code using OpenCL and CUDA☆15Aug 31, 2023Updated 2 years ago
- Play-with-compiler sandbox based on PWD☆10Oct 22, 2020Updated 5 years ago
- Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch☆941Jul 19, 2023Updated 2 years ago
- Record GPU memory accesses of a CUDA program and visualize the access pattern in a browser☆13Nov 17, 2020Updated 5 years ago
- Goal: a website to automatically train and certify compiler researchers and developers☆10Nov 24, 2019Updated 6 years ago
- Scaling RLLib for generic simulation environments on Theta☆20Feb 16, 2023Updated 3 years ago
- This is a c++ implementation of an LSTM Neural Network parallelized for a GPU using CUDA☆25Oct 29, 2017Updated 8 years ago
- Using TensorFlow for physics-informed neural networks for scientific machine learning (SciML)☆16Nov 30, 2020Updated 5 years ago
- [EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…☆14Oct 17, 2023Updated 2 years ago
- Render, select coordinates, export to video and more.☆13Apr 28, 2024Updated last year
- A graph coloring register allocator for LLVM.☆11Jan 23, 2017Updated 9 years ago
- Experiment of using Tangent to autodiff triton☆82Jan 22, 2024Updated 2 years ago
- A repo based on XiLin Li's PSGD repo that extends some of the experiments.☆14Oct 7, 2024Updated last year
- Multi-GPU (CUDA-MPI) baseline implementation of Heat Equation and the inviscid Burgers' equation☆12Oct 17, 2017Updated 8 years ago
- Mrzaizai2k Stock Assistant Bot: Your all-in-one stock analysis companion. Calculate payback time, find support/resistance, and receive ma…☆28Nov 19, 2025Updated 4 months ago
- Reinforcement learning Q-learning approach to OpenAI Gym's CartPole environment☆30Mar 25, 2023Updated 2 years ago
- [ICLR 2021] Group Equivariant Generative Adversarial Networks.☆14May 6, 2021Updated 4 years ago
- Notes and toy codes...☆11Jul 5, 2019Updated 6 years ago
- ☆105Mar 12, 2026Updated last week
- A Highly-Extensible Data Assimilation Testing Suite☆10Feb 24, 2019Updated 7 years ago
- Supplementary material for the DAFx23 paper Neural Grey-Box Guitar Amplifier Modelling with Limited Data.☆18Sep 14, 2023Updated 2 years ago
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆12Mar 4, 2024Updated 2 years ago
- Leong, Z. X., Zhu, T., & Sun, A. Y. (2024). Time-lapse seismic inversion for CO2 saturation with SeisCO2Net: An application to Frio-II si…☆10Aug 23, 2024Updated last year
- Teaching materials for the deep learning course.☆17Feb 2, 2026Updated last month
- This repository contains the test code developed by MITRE during our quantum software framework evaluation.☆12Jun 25, 2021Updated 4 years ago
- Quantized LLM training in pure CUDA/C++.☆241Mar 6, 2026Updated 2 weeks ago
- ☆10Nov 16, 2024Updated last year
- Exploit Auto-encoder for exploring and predict flow dynamic☆10Oct 4, 2019Updated 6 years ago
- The ALCF hosts a regular simulation, data, and learning workshop to help users scale their applications. This repository contains the exa…☆75Dec 17, 2025Updated 3 months ago
- A Deep Learning-based Real-time Object Detector for DJI Drones☆12Oct 5, 2018Updated 7 years ago
- A multilayer perceptron (for simple image classification), accelerated with CUDA☆17Oct 21, 2019Updated 6 years ago
- Playground to the famous book from Andrei Alexandrescu☆15Nov 18, 2018Updated 7 years ago
- ☆11Nov 21, 2023Updated 2 years ago