SNU-ARC/OpenDNN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SNU-ARC/OpenDNN)

SNU-ARC / OpenDNN

OpenDNN: An Open-source, cuDNN-like Deep Learning Primitive Library

☆29

Alternatives and similar repositories for OpenDNN

Users that are interested in OpenDNN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IronySuzumiya / NiuDianNao
View on GitHub
A simple cycle-accurate DaDianNao simulator
☆13Mar 27, 2019Updated 7 years ago
FelixWinterstein / LEAP-HLS
View on GitHub
Rapid system integration of high-level synthesis kernels using the LEAP FPGA framework
☆12Apr 17, 2016Updated 10 years ago
tsinghua-ideal / ANSMET
View on GitHub
An accelerator for high-dimensional approximate nearest neighbor search
☆15May 17, 2025Updated last year
andreaskuster / black-parrot-branch-predictor
View on GitHub
Branch Predictor Optimization for BlackParrot
☆15Mar 24, 2024Updated 2 years ago
cornell-brg / hb-pytorch
View on GitHub
Repo to hold HammerBlade PyTorch port. Based on PyTorch v1.4.0
☆14Oct 4, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apuaaChen / EVT_AE
View on GitHub
Artifacts of EVT ASPLOS'24
☆29Mar 6, 2024Updated 2 years ago
sld-columbia / esp-caches
View on GitHub
SystemVerilog overhaul of ESP L2 and LLC caches with directory based protocol
☆19Feb 27, 2025Updated last year
zslwyuan / AutoCellLibX
View on GitHub
AutoCellLibX: Automated Standard Cell Library Extension Based on Pattern Mining
☆20Nov 1, 2022Updated 3 years ago
cornell-brg / pymtl-tut-hls
View on GitHub
Tutorial for integrating PyMTL and Vivado HLS
☆20Apr 17, 2016Updated 10 years ago
bakhi / GPUReplay
View on GitHub
GPUReplay, ASPLOS 2022
☆42Feb 21, 2022Updated 4 years ago
jhson989 / cuda-ptx
View on GitHub
Inline PTX Assembly in CUDA example
☆15May 7, 2022Updated 4 years ago
md2z34 / winograd_gpu
View on GitHub
GPU implementation of Winograd convolution
☆10Oct 23, 2017Updated 8 years ago
Sys-Inventor-Lab / AI4System-OSML
View on GitHub
☆14Feb 26, 2026Updated 4 months ago
Ryu1845 / hyena-jax
View on GitHub
Implementation of Hyena Hierarchy in JAX
☆10Apr 30, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xxcclong / GNN-Computing
View on GitHub
Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"
☆42Nov 16, 2021Updated 4 years ago
YulhwaKim / cutlass_tilesparse
View on GitHub
CUDA templates for tile-sparse matrix multiplication based on CUTLASS.
☆52Mar 1, 2018Updated 8 years ago
songqun / speedup-aarch64-cpu
View on GitHub
a computing kernel implementation in ML inference framework aiming at theoretical limit
☆12Dec 18, 2019Updated 6 years ago
shashankprasanna / torchserve-examples
View on GitHub
Repository with torchserve examples
☆18Oct 6, 2021Updated 4 years ago
dwfault / CollAFLplusplus
View on GitHub
Implement CollAFL using LLVM LTO pass on afl++.
☆12Sep 24, 2020Updated 5 years ago
lightsighter / CudaDMA
View on GitHub
Emulating DMA Engines on GPUs for Performance and Portability
☆43May 17, 2015Updated 11 years ago
BradMcDanel / sdgp
View on GitHub
☆10Feb 1, 2022Updated 4 years ago
cvanderwel / TurbulentFlows
View on GitHub
Data and Code supporting the eBook by Castro and Vanderwel (2021)
☆20Feb 7, 2022Updated 4 years ago
mbalesni / deepspeed_llama
View on GitHub
Finetuning LLaMA with DeepSpeed
☆10Apr 14, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
WeiCheng14159 / bazel-android-opencl
View on GitHub
Run OpenCL program on MOBILE GPU (Qualcomm & ARM) !
☆18Jun 27, 2018Updated 8 years ago
nzaocan / YYSYuhun
View on GitHub
阴阳师御魂方案计算工具，基于动态规划和剪枝
☆14Sep 3, 2018Updated 7 years ago
sunlex0717 / DissectingTensorCores
View on GitHub
☆114Apr 19, 2024Updated 2 years ago
johnpzh / parallel_ANNS
View on GitHub
Parallel Approximate Nearest Neighbor Search
☆14Nov 12, 2022Updated 3 years ago
sifive / riscv-gcc
View on GitHub
☆19Feb 24, 2026Updated 4 months ago
jimmy-evo / opencl_kernels
View on GitHub
An easy way to run, test, benchmark and tune OpenCL kernel files
☆24Aug 25, 2023Updated 2 years ago
apuaaChen / gcnLib
View on GitHub
☆10Aug 2, 2021Updated 4 years ago
google / jax-recommenders
View on GitHub
☆11Oct 29, 2022Updated 3 years ago
maufadel / EnergyMeter
View on GitHub
A Python tool to measure the energy consumption of software
☆16Feb 5, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
antiagainst / SM-G991U
View on GitHub
Kernel code for Samsung Galaxy S21 (Snapdragon 888)
☆20Jul 4, 2021Updated 5 years ago
filvarga / srv6-mobile
View on GitHub
SRv6 IETF 104 Hackathon
☆11Dec 8, 2022Updated 3 years ago
PKUZHOU / PetS-ATC-2022
View on GitHub
☆10Sep 14, 2023Updated 2 years ago
mohshawky5193 / dog-breed-classifier
View on GitHub
A CNN based project where trying to predict the breed of the dog in the image and detect human faces
☆12Dec 8, 2022Updated 3 years ago
Leonardo-Ding / gpu_sgemm
View on GitHub
☆17Jul 1, 2020Updated 6 years ago
qgwang-hust / GraSU
View on GitHub
A Fast Graph Update Library for FPGA-based Dynamic Graph Processing
☆10Dec 20, 2021Updated 4 years ago
dglai / FeatGraph
View on GitHub
Sparse kernels for GNNs based on TVM
☆17Nov 18, 2020Updated 5 years ago