keroro824 / HashingDeepLearningLinks
Codebase for "SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems"
☆1,106Updated 4 years ago
Alternatives and similar repositories for HashingDeepLearning
Users that are interested in HashingDeepLearning are comparing it to the libraries listed below
Sorting:
- ☆470Updated 4 years ago
- Efficient GPU kernels for block-sparse matrix multiplication and convolution☆1,062Updated 2 years ago
- Fast & Simple Resource-Constrained Learning of Deep Network Structure☆1,033Updated 3 weeks ago
- A uniform interface to run deep learning models from multiple frameworks☆941Updated 2 years ago
- Mesh TensorFlow: Model Parallelism Made Easier☆1,626Updated 2 years ago
- nGraph has moved to OpenVINO☆1,345Updated 5 years ago
- PyTorch elastic training☆728Updated 3 years ago
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,512Updated this week
- "Multi-Level Intermediate Representation" Compiler Infrastructure☆1,761Updated 4 years ago
- A performant and modular runtime for TensorFlow☆756Updated 4 months ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆734Updated 2 years ago
- Library for faster pinned CPU <-> GPU transfer in Pytorch☆683Updated 5 years ago
- Fast Block Sparse Matrices for Pytorch☆550Updated 4 years ago
- common in-memory tensor structure☆1,136Updated last month
- Collective communications library with various primitives for multi-machine training.☆1,384Updated last month
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,737Updated 3 weeks ago
- Reference implementations of MLPerf® training benchmarks☆1,737Updated 3 weeks ago
- A domain specific language to express machine learning workloads.☆1,762Updated 2 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,551Updated 6 years ago
- The Tensor Algebra Compiler (taco) computes sparse tensor expressions on CPUs and GPUs☆1,343Updated 8 months ago
- ATen: A TENsor library for C++11☆713Updated 6 years ago
- Large Model Support in Tensorflow☆202Updated 5 years ago
- A GPipe implementation in PyTorch☆861Updated last year
- TVM integration into PyTorch☆456Updated 5 years ago
- Myia prototyping☆461Updated 2 years ago
- Accelerate your Neural Architecture Search (NAS) through fast, reproducible and modular research.☆483Updated last month
- A GPU performance profiling tool for PyTorch models☆509Updated 4 years ago
- Compiler for Neural Network hardware accelerators☆3,323Updated last year
- GPU implementation of a fast generalized ANS (asymmetric numeral system) entropy encoder and decoder, with extensions for lossless compre…☆369Updated 3 weeks ago
- PyTorch, TensorFlow, JAX and NumPy — all of them natively using the same code☆699Updated 2 years ago