Accelerating DNN Convolutional Layers with Micro-batches
☆63Apr 30, 2020Updated 5 years ago
Alternatives and similar repositories for ucudnn
Users that are interested in ucudnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Dual-way gradient sparsification approach for async DNN training, based on PyTorch.☆11Dec 8, 2022Updated 3 years ago
- Python tools for NVIDIA Profiler☆21Dec 21, 2017Updated 8 years ago
- Script to check ONNX model compatibility against TensorRT versions using docker images☆12Nov 23, 2023Updated 2 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆268Jul 30, 2023Updated 2 years ago
- Implementation of vDNN++; an improvement over vDNN☆18Dec 7, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆10Oct 23, 2019Updated 6 years ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 3 months ago
- Implementate super resolution in deep learning☆14May 17, 2017Updated 8 years ago
- C++ framework for deep learning☆13Dec 1, 2022Updated 3 years ago
- maskrcnn implementation using chainer☆14Jun 12, 2018Updated 7 years ago
- Parallel Tensor Infrastructure (ParTI!)☆33Aug 18, 2020Updated 5 years ago
- ☆13Oct 10, 2018Updated 7 years ago
- Data and devtools for the "Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video" paper.☆22Nov 2, 2018Updated 7 years ago
- Torch FFI-bindings for NNPACK☆31May 26, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Self-learning hands-on for Chainer by Jupyter notebook☆43Feb 14, 2017Updated 9 years ago
- Assembly-optimized Marvin32 hash function☆12Jan 17, 2024Updated 2 years ago
- Nonblocking data structures☆12Jan 25, 2015Updated 11 years ago
- Chunky Loop Interaction☆25Aug 13, 2019Updated 6 years ago
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆19Apr 14, 2020Updated 5 years ago
- Immix GC for LLVM based languages☆15Apr 2, 2025Updated 11 months ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- TP-PARSEC: A Task Parallel PARSEC Benchmark Suite☆11Nov 1, 2020Updated 5 years ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆33Feb 21, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆17Sep 15, 2021Updated 4 years ago
- The Core Paper Project of EVM☆15Nov 2, 2019Updated 6 years ago
- ☆17Feb 23, 2019Updated 7 years ago
- ☆10Dec 19, 2019Updated 6 years ago
- cuDNN sample codes provided by Nvidia☆47Feb 18, 2019Updated 7 years ago
- Data Dependence Analyzer in the Polyhedral Model☆21Nov 2, 2023Updated 2 years ago
- ☆62Mar 15, 2018Updated 8 years ago
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- A Convolutional Neural Network Cascade for Face Detection☆14May 29, 2016Updated 9 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- SKFAC Preconditioner for MindSpore☆12Jul 2, 2021Updated 4 years ago
- Automatic differentiation with uarray/unumpy.☆16Mar 7, 2021Updated 5 years ago
- Deep Reinforcement Learning framework based on TensorFlow and OpenAI Gym☆13Apr 30, 2018Updated 7 years ago
- ☆24Mar 22, 2018Updated 8 years ago
- Hyperoctree construction and manipulation