Accelerating DNN Convolutional Layers with Micro-batches
☆63Apr 30, 2020Updated 6 years ago
Alternatives and similar repositories for ucudnn
Users that are interested in ucudnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Deep Learning Meta-Framework and HPC Benchmarking Library☆81May 23, 2022Updated 4 years ago
- Python tools for NVIDIA Profiler☆21Dec 21, 2017Updated 8 years ago
- Compiler toolchain to enable generation of high-level DSLs for geophysical fluid dynamics models☆29Mar 22, 2023Updated 3 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆269Jul 30, 2023Updated 2 years ago
- ONNX SEA-RAFT, optical flow☆14Jan 5, 2026Updated 5 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Jul 17, 2019Updated 6 years ago
- Implementation of vDNN++; an improvement over vDNN☆18Dec 7, 2018Updated 7 years ago
- C++ framework for deep learning☆13Dec 1, 2022Updated 3 years ago
- maskrcnn implementation using chainer☆14Jun 12, 2018Updated 7 years ago
- Parallel Tensor Infrastructure (ParTI!)☆34Aug 18, 2020Updated 5 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆12Nov 20, 2019Updated 6 years ago
- Streaming Message Interface: High-Performance Distributed Memory Programming on Reconfigurable Hardware☆15Mar 1, 2022Updated 4 years ago
- ☆13Oct 10, 2018Updated 7 years ago
- GPU Optimization and Memory Abstraction Framework☆33Oct 31, 2019Updated 6 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Self-learning hands-on for Chainer by Jupyter notebook☆43Feb 14, 2017Updated 9 years ago
- This is the open-source version of TinyTS. The code is dirty so far. We may clean the code in the future.☆21Aug 11, 2025Updated 9 months ago
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆21Apr 14, 2020Updated 6 years ago
- Question Dependent Recurrent Entity Network☆13Sep 21, 2017Updated 8 years ago
- TP-PARSEC: A Task Parallel PARSEC Benchmark Suite☆11Nov 1, 2020Updated 5 years ago
- Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.☆12Oct 20, 2020Updated 5 years ago
- Anomaly Detection in computer vision☆21May 21, 2020Updated 6 years ago
- The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github…☆33Feb 21, 2026Updated 3 months ago
- ☆17Sep 15, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆17Feb 23, 2019Updated 7 years ago
- cuDNN sample codes provided by Nvidia☆47Feb 18, 2019Updated 7 years ago
- Data Dependence Analyzer in the Polyhedral Model☆21Nov 2, 2023Updated 2 years ago
- Strapdown Inertial Navigation System position estimator algorithm☆12Apr 14, 2018Updated 8 years ago
- ☆62Mar 15, 2018Updated 8 years ago
- Repository for the code of the paper "Neural Networks Regularization Through Class-wise Invariant Representation Learning".☆12Oct 1, 2017Updated 8 years ago
- a model zoo☆11Jul 19, 2017Updated 8 years ago
- Prototype of OpenSHMEM for NVIDIA GPUs, developed as part of DoE Design Forward☆24Apr 26, 2018Updated 8 years ago
- An Agile Chisel-Based SoC Design Framework☆26Dec 29, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Deep Reinforcement Learning framework based on TensorFlow and OpenAI Gym☆13Apr 30, 2018Updated 8 years ago
- ☆24Mar 22, 2018Updated 8 years ago
- Code for reproducing the results from "CrAM: A Compression-Aware Minimizer" accepted at ICLR 2023☆10Mar 1, 2023Updated 3 years ago
- ext_mpi_collectives☆11Mar 27, 2026Updated 2 months ago
- ☆20Apr 27, 2016Updated 10 years ago
- Based on Thompson sampling with the online bootstrap (Dean Eckles, Maurits Kaptein). http://arxiv.org/abs/1410.4009☆11Dec 30, 2014Updated 11 years ago
- Workflow management system for the automated and distributed analysis of large-scale experimental data.☆13Oct 3, 2024Updated last year