Neural network from scratch in CUDA/C++
☆94Sep 8, 2025Updated 9 months ago
Alternatives and similar repositories for neural-network-cuda
Users that are interested in neural-network-cuda are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆604May 13, 2026Updated last month
- ☆21Mar 26, 2024Updated 2 years ago
- Distributed Training of Bayesian Neural Networks at Scale☆11May 26, 2020Updated 6 years ago
- Convolutional Neural Network of vgg19 model using Cuda to accelerate☆12Jun 11, 2018Updated 8 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ChartOCR, based on original repo.☆14Mar 22, 2023Updated 3 years ago
- Tracking with Bounding Polygons☆24Nov 23, 2021Updated 4 years ago
- Code from the "CUDA Crash Course" YouTube series by CoffeeBeforeArch☆963Jul 19, 2023Updated 2 years ago
- Goal: a website to automatically train and certify compiler researchers and developers☆10Nov 24, 2019Updated 6 years ago
- ☆10May 20, 2022Updated 4 years ago
- We try to put source files of llvm tutorials here☆18Oct 6, 2020Updated 5 years ago
- CUB-200-2011 dataset by classes folder☆12Nov 7, 2024Updated last year
- Scaling RLLib for generic simulation environments on Theta☆20Feb 16, 2023Updated 3 years ago
- Convolutional neural networks C++ framework with CPU and GPU (CUDA) backends☆181Dec 7, 2018Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- official code for Dynamic Smooth Label Assignment☆12Oct 5, 2022Updated 3 years ago
- Using TensorFlow for physics-informed neural networks for scientific machine learning (SciML)☆16Nov 30, 2020Updated 5 years ago
- Code and pretrained models accompanying the paper "Ensembling geophysical models using Bayesian Neural Networks"☆10Jul 11, 2022Updated 3 years ago
- MNIST inference on i.MT RT1062 (Teensy 4.0) using TensorFlow Lite for Microcontrollers☆13May 30, 2020Updated 6 years ago
- Audio Masking Methods☆12Nov 15, 2019Updated 6 years ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆21Jul 13, 2025Updated 11 months ago
- Unsupervised Lifelong Person Re-identification via Contrastive Rehearsal☆11Apr 7, 2022Updated 4 years ago
- Fast domain-aware neural network emulation of a planetary boundary layer parameterization in a numerical weather forecast model☆12Mar 26, 2019Updated 7 years ago
- Rembg is a tool to remove images background.☆12Nov 29, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A parameter server implement with MPI.☆11Nov 15, 2017Updated 8 years ago
- Implement feedforward, convolution neural network in C++ with only built-in libraries. This is a tutorial style project which implements …☆30Apr 10, 2018Updated 8 years ago
- WWW2021: Interpreting and Unifying Graph Neural Networks with An Optimization Framework☆14Jun 23, 2021Updated 5 years ago
- Multi-GPU (CUDA-MPI) baseline implementation of Heat Equation and the inviscid Burgers' equation☆12Oct 17, 2017Updated 8 years ago
- Reinforcement learning Q-learning approach to OpenAI Gym's CartPole environment☆30Mar 25, 2023Updated 3 years ago
- Notes and toy codes...☆11Jul 5, 2019Updated 6 years ago
- ☆11Nov 6, 2019Updated 6 years ago
- A Highly-Extensible Data Assimilation Testing Suite☆10Feb 24, 2019Updated 7 years ago
- Supplementary material for the DAFx23 paper Neural Grey-Box Guitar Amplifier Modelling with Limited Data.☆18Sep 14, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official code for CVPR2024 “VideoMAC: Video Masked Autoencoders Meet ConvNets”☆15May 12, 2026Updated last month
- lshash for python3☆10Mar 21, 2018Updated 8 years ago
- Pure Java Llama2 inference with optional multi-GPU CUDA implementation☆13Sep 2, 2023Updated 2 years ago
- Quantized LLM training in pure CUDA/C++.☆248Jun 3, 2026Updated 3 weeks ago
- ☆10Nov 16, 2024Updated last year
- Exploit Auto-encoder for exploring and predict flow dynamic☆10Oct 4, 2019Updated 6 years ago
- Example of bazel python cpp binding☆10May 27, 2023Updated 3 years ago