bcaine / nn_cppLinks

A minimalistic header only C++11 Neural Network library based on Eigen::Tensor

☆20

Alternatives and similar repositories for nn_cpp

Users that are interested in nn_cpp are comparing it to the libraries listed below

Sorting:

afantideng / caffe_comments
Caffe 源码注释
☆15Updated 7 years ago
zhiqi-0 / RDMA-MXNet-ps-lite
RDMA Optimization on MXNet
☆14Updated 7 years ago
jiazhihao / sosp19ae
Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions
☆21Updated 3 years ago
XiuYuLi / flexible-gemm
flexible-gemm conv of deepcore
☆17Updated 5 years ago
maps-gpu / MAPS
GPU Optimization and Memory Abstraction Framework
☆32Updated 5 years ago
xldrx / tensorflow-tracer
Runtime Tracing Library for TensorFlow
☆43Updated 6 years ago
wlzhao22 / tsdg
TSDG: An efficient index graph for graph-based nearest neighbor search
☆9Updated 3 years ago
NVIDIA / kmeans
kmeans clustering with multi-GPU capabilities
☆119Updated 2 years ago
CSshengxy / MEC
ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)
☆17Updated 6 years ago
masahi / tvm-winograd
Test winograd convolution written in TVM for CUDA and AMDGPU
☆41Updated 6 years ago
chaolongzhang / algorithms-cuda
parallel algorithm based on cuda
☆60Updated 7 years ago
carlushuang / cpu_gemm_opt
how to design cpu gemm on x86 with avx256, that can beat openblas.
☆70Updated 6 years ago
OpenHero / im2col
image to column
☆30Updated 11 years ago
adnanozsoy / CUDA_Compression
A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…
☆35Updated 9 years ago
microsoft / Analysis-Framework-for-TVM
Static analysis framework for analyzing programs written in TVM's Relay IR.
☆28Updated 5 years ago
wangsiping97 / GPU-Tutorials
Tutorials to GPU programming. Reading notes.
☆17Updated 2 years ago
iamhankai / mini-dnn-cpp
C++ demo of deep neural networks (MLP, CNN)
☆32Updated last year
hyln9 / GCNGEMM
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
☆47Updated 8 years ago
owensgroup / GpuBTree
Code for paper "Engineering a High-Performance GPU B-Tree" accepted to PPoPP 2019
☆57Updated 3 years ago
zhangxinqian / example-of-nnvm-in-cpp
An Example of MXNet Models Comilation and Deployment with NNVM in C++
☆16Updated 7 years ago
lcy-seso / DLFrameworkTest
My tests and experiments with some popular dl frameworks.
☆13Updated 2 weeks ago
ravi-teja-mullapudi / Halide-NN
CNNs in Halide
☆23Updated 9 years ago
tobegit3hub / tftvm
TensorFlow and TVM integration
☆37Updated 5 years ago
ModelTC / pyvlova
Yet another Polyhedra Compiler for DeepLearning
☆19Updated 2 years ago
HolmesShuan / Caffe-Computation-Graph-Optimization
Caffe Computation Graph Optimization.
☆29Updated 5 years ago
vazois / TopK
TopK Algorithms Benchmark
☆10Updated 6 years ago
HPAC / TTC
TTC: A high-performance Compiler for Tensor Transpositions
☆20Updated 7 years ago
hclhkbu / gcoospdm
Sparse-dense matrix-matrix multiplication on GPUs
☆14Updated 6 years ago
yester31 / Cutlass_EX
study of cutlass
☆21Updated 8 months ago
Harry-Chen / InfMoE
Inference framework for MoE layers based on TensorRT with Python binding
☆41Updated 4 years ago