sebastienwood / MemSELinks
☆12Updated last year
Alternatives and similar repositories for MemSE
Users that are interested in MemSE are comparing it to the libraries listed below
Sorting:
- Code for SIGGRAPH 2022 paper "Automatic quantization for physics-based simulation"☆64Updated 3 years ago
- ☆67Updated 2 years ago
- BGHT: High-performance static GPU hash tables.☆70Updated last month
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆28Updated 8 years ago
- CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution☆18Updated 2 years ago
- RISC-V-based many-core neuromorphic architecture☆12Updated last week
- A GPU Ray Tracer☆50Updated 4 years ago
- EGGS, a method to speed up sparse matrix operations when the same sparsity is used for multiple times. This repo contains examples that s…☆25Updated 5 years ago
- ☆11Updated 4 years ago
- Efficient SpGEMM on GPU using CUDA and CSR☆57Updated 2 years ago
- ☆52Updated 6 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆27Updated 4 years ago
- This repository contains a non-exponential transmittance operator that can be used with PyTorch☆18Updated 4 years ago
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆32Updated 3 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 6 years ago
- TaichiCon: Taichi Conferences☆73Updated 3 years ago
- ☆14Updated 2 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆29Updated 5 years ago
- Code for "Structured Sparsity Inducing Adaptive Optimizers for Deep Learning" in PyTorch☆18Updated 4 years ago
- Code for SIGGRAPH 2020 paper "Langevin Monte Carlo Rendering with Gradient-based Adaptation"☆74Updated 5 years ago
- Fundamental Sources for Water Wave Animation☆20Updated 2 years ago
- An implementation of parallel exclusive scan in CUDA☆62Updated 7 years ago
- An implementation of a BinaryConnect network for cifar10☆11Updated 5 years ago
- TBNv2: Convolutional Neural Network With Ternary Inputs and Binary Weights☆17Updated 5 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Updated 7 years ago
- Really Elastic Ray Engine☆28Updated last year
- CamJ: an energy modeling and system-level exploration framework for in-sensor visual computing☆23Updated last year
- ☆23Updated 3 years ago
- Approximate layers - TensorFlow extension☆27Updated 3 months ago