CUDA for MNIST training/inference
☆44Dec 30, 2023Updated 2 years ago
Alternatives and similar repositories for mnist-cudnn
Users that are interested in mnist-cudnn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Transparent Cudnn / Cublas / Eigen usage for the deep learning training using MNIST dataset.☆18Sep 3, 2020Updated 5 years ago
- Simple CuDNN wrapper☆19Nov 29, 2015Updated 10 years ago
- Parallel cuckoo hashing on GPUs with CUDA☆12Sep 27, 2019Updated 6 years ago
- cuDNN sample codes provided by Nvidia☆47Feb 18, 2019Updated 7 years ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆30Feb 12, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Sep 17, 2020Updated 5 years ago
- a Log-Structured Merged-Tree store engine☆16Sep 21, 2023Updated 2 years ago
- A python implementation for secure file transfer. Course project: "Cyber Security" (KFUPM-COE451)☆17Oct 18, 2021Updated 4 years ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆17Oct 20, 2021Updated 4 years ago
- llama INT4 cuda inference with AWQ☆54Jan 20, 2025Updated last year
- Assembler for NVIDIA Volta and Turing GPUs☆241Jan 13, 2022Updated 4 years ago
- ☆48Jan 30, 2026Updated 3 months ago
- Transforming Graphs for Efficient Irregular Graph Processing on GPUs☆50Nov 15, 2022Updated 3 years ago
- ebrowser, an energy-efficient and lightweight human interaction framework without degrading the user experience in mobile Web browsers.☆12Sep 7, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆12Sep 20, 2023Updated 2 years ago
- iSpot is a lightweight and cost-effective instance provisioning framework for Directed Acyclic Graph (DAG)-style big data analytics, in …☆11Sep 7, 2023Updated 2 years ago
- CUDAAdvisor: a GPU profiling tool☆53Aug 24, 2018Updated 7 years ago
- Fast CUDA Kernels for ResNet Inference.☆183May 26, 2019Updated 6 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆11May 6, 2023Updated 3 years ago
- OpenPose network tensorrt optimizer☆39Aug 20, 2019Updated 6 years ago
- A C++/CUDA toolkit for Transformer (NMT) Translator (Decoder)☆17Jan 7, 2019Updated 7 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- GPUfs - File system support for NVIDIA GPUs☆104Nov 26, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A simple script to plot the Roofline model for given HW platforms and applications☆10Mar 17, 2026Updated last month
- ☆16Aug 18, 2015Updated 10 years ago
- DelayStage is a simple yet effective stage delay scheduling strategy to interleave the cluster resources across the parallel stages, so a…☆14Sep 7, 2023Updated 2 years ago
- ☆10Oct 31, 2022Updated 3 years ago
- A fully cuda implementation of DCNv2(deformable convolution) forward. Without dependent of cuTorch(THC).☆10Dec 9, 2019Updated 6 years ago
- Deep learning for time-varying multi-entity datasets☆17May 12, 2018Updated 7 years ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 10 months ago
- Spark, Cassandra, Tessellation and ArcGIS☆10Jan 18, 2015Updated 11 years ago
- Prophet is a predictable communication scheduling strategy to schedule the gradient transfer in an adequate order, with the aim of maximi…☆16Sep 13, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository contains the results and code for the MLPerf™ Training v2.1 benchmark.☆15Aug 9, 2023Updated 2 years ago
- C++ Example project using SQLiteCpp as a Git submodule / CMake subdirectory☆29Sep 30, 2022Updated 3 years ago
- Code for the paper "Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation" by Alexander Levine and Soheil Feizi.☆10Aug 22, 2022Updated 3 years ago
- ☆19Dec 8, 2013Updated 12 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- Run Tensorflow and Keras with GPU support on Kubernetes☆13Mar 21, 2017Updated 9 years ago
- Opara is a lightweight and resource-aware DNN Operator parallel scheduling framework to accelerate the execution of DNN inference on GPUs…☆23Dec 19, 2024Updated last year