pnnl / TCBNNLinks

☆35

Alternatives and similar repositories for TCBNN

Users that are interested in TCBNN are comparing it to the libraries listed below

Sorting:

BoyuanFeng / APNN-TC
☆19Updated 4 years ago
uuudown / SBNN
Singular Binarized Neural Network based on GPU Bit Operations (see our SC-19 paper)
☆15Updated 4 years ago
ceruleangu / Block-Sparse-Benchmark
Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.
☆23Updated 5 years ago
jafermarq / WinogradAwareNets
Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)
☆27Updated 2 years ago
uwsampl / SparseTIR
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆141Updated 2 years ago
lixiuhong / batched_gemm
☆39Updated 5 years ago
owensgroup / merge-spmm
Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018
☆73Updated 5 years ago
pku-liang / AMOS
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆117Updated 3 years ago
masahi / tvm-cutlass-eval
☆41Updated 3 years ago
ParCIS / Magicube
Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.
☆90Updated 3 years ago
marsupialtail / gpu-sparsert
☆18Updated 5 years ago
uwsampl / sparsetir-artifact
Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"
☆26Updated 2 years ago
BradMcDanel / column-combine
☆27Updated 5 years ago
apuaaChen / EVT_AE
Artifacts of EVT ASPLOS'24
☆28Updated last year
dgSPARSE / dgSPARSE-Lib
PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity
☆118Updated 2 weeks ago
wangmaolin / niti
Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv
☆86Updated 3 years ago
YashasSamaga / ConvolutionBuildingBlocks
GEMM and Winograd based convolutions using CUTLASS
☆28Updated 5 years ago
pku-liang / FlexTensor
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆180Updated 3 years ago
IntelLabs / FP8-Emulation-Toolkit
PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.
☆112Updated 11 months ago
hsharma35 / bitfusion
Simulator for BitFusion
☆102Updated 5 years ago
esa-tu-darmstadt / spn-compiler
Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.
☆24Updated last year
union-codesign / union
☆14Updated 4 years ago
anony-sub / chameleon
Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation
☆27Updated 6 years ago
escalab / SIMD2
☆31Updated 3 years ago
daadaada / gas
☆47Updated 4 years ago
olcf / NVIDIA-tensor-core-examples
☆20Updated 6 years ago
UofT-EcoSystem / DietCode
DietCode Code Release
☆65Updated 3 years ago
oresths / tSparse
A GPU algorithm for sparse matrix-matrix multiplication
☆73Updated 5 years ago
comaniac / epoi
Benchmark PyTorch Custom Operators
☆14Updated 2 years ago
itayhubara / CalibTIP
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
☆98Updated 4 years ago