cuDNN sample codes provided by Nvidia
☆47Feb 18, 2019Updated 7 years ago
Alternatives and similar repositories for cuDNN-sample
Users that are interested in cuDNN-sample are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- C++ framework for deep learning☆13Dec 1, 2022Updated 3 years ago
- Transparent Cudnn / Cublas / Eigen usage for the deep learning training using MNIST dataset.☆18Sep 3, 2020Updated 5 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- notes on reading tensorflow source code☆13Aug 18, 2018Updated 7 years ago
- Simple Arm assembly kernels for testing the performance and functionality of Arm CPUs.☆16Dec 3, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- TA's implementation for the project of Computer Architecture and Intelligent Chip Design (23 Spring)☆10May 20, 2023Updated 3 years ago
- ☆14Feb 5, 2025Updated last year
- This repository provides the official implementation of QSVD, a method for efficient low-rank approximation that unifies Query-Key-Value …☆28May 16, 2026Updated last month
- My learning notes about AI, including Machine Learning and Deep Learning.☆18Jun 30, 2019Updated 7 years ago
- Detailed examples demonstrating cross-platform building with Bazel☆11Dec 1, 2017Updated 8 years ago
- Yinghan's Code Sample☆365Jul 25, 2022Updated 3 years ago
- Simple neural network implementation using CUDA technology. It is an educational implementation.☆98Apr 12, 2018Updated 8 years ago
- This is a c++ implementation of an LSTM Neural Network parallelized for a GPU using CUDA☆25Oct 29, 2017Updated 8 years ago
- Automotive S32 U-Boot☆10May 18, 2026Updated last month
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 4 years ago
- ☆18Jan 1, 2023Updated 3 years ago
- Source code examples from the Parallel Forall Blog☆1,331Sep 23, 2025Updated 9 months ago
- ROS driver for techman robot☆15Feb 19, 2020Updated 6 years ago
- PowerSwitch: a adaptive mode switch engine for distributed parrallel graph computation☆16Dec 23, 2013Updated 12 years ago
- Implementation from Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference☆11Feb 5, 2020Updated 6 years ago
- tb3_aprilTag simulation☆14Sep 2, 2023Updated 2 years ago
- Accelerating DNN Convolutional Layers with Micro-batches☆63Apr 30, 2020Updated 6 years ago
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆18Mar 25, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- C++ Interfaces for the nAG Library☆18Jul 24, 2025Updated 11 months ago
- Deploy YOLOv8 in Unity using Sentis☆21Apr 20, 2024Updated 2 years ago
- C# applications which can connect to ROS via RosBridge.☆17Apr 13, 2016Updated 10 years ago
- Adaptive floating-point based numerical format for resilient deep learning☆14Apr 11, 2022Updated 4 years ago
- ☆16Nov 2, 2022Updated 3 years ago
- Final Project for Parallel Computing at CMU (15-618/15-418)☆10May 13, 2016Updated 10 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆33Feb 10, 2025Updated last year
- Multi-Paradigm Programming with Modern C++, published by Packt☆25Jan 30, 2023Updated 3 years ago
- LOUDS-trie implementation example (C++)☆15Nov 27, 2019Updated 6 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- efficient header for third-party libs☆12May 27, 2022Updated 4 years ago
- Python Tools for the POP Metrics☆13Feb 16, 2022Updated 4 years ago
- ☆10Aug 30, 2017Updated 8 years ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- Handwritten Digit Recognition Using Neural Network by Python☆10May 10, 2018Updated 8 years ago
- ☆23Nov 2, 2022Updated 3 years ago
- Solutions to problems on Hackerearth☆12Jan 18, 2018Updated 8 years ago