boxvc / NVIDIA-JobsLinks

Deep Learning/GPU Architect/Autonomous Driving Positions

☆81

Alternatives and similar repositories for NVIDIA-Jobs

Users that are interested in NVIDIA-Jobs are comparing it to the libraries listed below

Sorting:

wkcn / mobula
A Lightweight & Flexible Deep Learning (Neural Network) Framework in Python
☆45Updated last year
seetaresearch / dragon
A Computation Graph Virtual Machine based ML Framework
☆108Updated last year
tvmai / tvmai.github.io
Move to https://github.com/apache/incubator-tvm-site
☆27Updated 4 years ago
mli / dlmark
☆18Updated 7 years ago
cwlacewe / netscope
This is a CNN Analyzer tool, based on Netscope by dgschwend/netscope
☆42Updated 7 years ago
AojunZhou / Efficient-Deep-Learning
Related Paper of Efficient Deep Neural Networks
☆86Updated 4 years ago
Caffe-MPI / Caffe-MPI.github.io
☆125Updated 7 years ago
dlsys-course / assignment2-2018
(Spring 2018) Assignment 2: Graph Executor with TVM
☆123Updated 7 years ago
tensorpack / benchmarks
Use TensorFlow efficiently
☆95Updated 4 years ago
dlsys-course / examples
Example codes appears in lectures
☆23Updated 3 years ago
borisgin / nvcaffe
☆45Updated 2 years ago
tvmai / meetup-slides
Place for meetup slides
☆141Updated 4 years ago
zhxfl / CUDA-CNN
CNN accelerated by cuda. Test on mnist and finilly get 99.76%
☆186Updated 7 years ago
vinx13 / tvm-cuda-int8-benchmark
Benchmark of TVM quantized model on CUDA
☆111Updated 5 years ago
dlsys-course / tinyflow
Tutorial code on how to build your own Deep Learning System in 2k Lines
☆125Updated 8 years ago
HolmesShuan / Caffe-Computation-Graph-Optimization
Caffe Computation Graph Optimization.
☆29Updated 5 years ago
StacyYang / MXNet-Gluon-SyncBN
MXNet Gluon Synchronized Batch Normalization Preview
☆77Updated 7 years ago
strin / gemm-android
tutorial to optimize GEMM performance on android
☆51Updated 9 years ago
davidstutz / tensorflow-cpp-op-example
Simple example of implementing a new Tensorflow operation and its gradient in C++.
☆56Updated 6 years ago
DrZhang99 / algorithms-cuda
parallel algorithm based on cuda
☆60Updated 7 years ago
wkcn / CaffeSVD
使用SVD、K-Means、降低权值精度的方法压缩Cifar-10神经网络的全连接层
☆23Updated 8 years ago
zhangxinqian / example-of-nnvm-in-cpp
An Example of MXNet Models Comilation and Deployment with NNVM in C++
☆16Updated 7 years ago
wkcn / MobulaOP
A Simple & Flexible Cross Framework Operators Toolkit
☆164Updated 4 years ago
neopenx / Dragon
Dragon: A Computation Graph Virtual Machine Based Deep Learning Framework.
☆175Updated 7 years ago
zhuwenxi / pytorch-profiling-tool
☆54Updated 7 years ago
LamHoCN / Depth_conv-for-mobileNet
Depth_conv for MobileNet
☆30Updated 5 years ago
zccyman / pytorch-inference
PyTorch 1.0 inference in C++ on Windows10 platforms
☆89Updated 6 years ago
hyln9 / GCNGEMM
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
☆47Updated 8 years ago
Yangqing / caffe
Caffe: a Fast framework for neural networks. For the most recent version, check out branch dev. For a more stable version, check out bran…
☆196Updated 5 years ago
nicklhy / DLInfBench
CNN model inference benchmarks for some popular deep learning frameworks
☆52Updated 6 years ago