☆18Oct 15, 2020Updated 5 years ago
Alternatives and similar repositories for gpu-sparsert
Users that are interested in gpu-sparsert are comparing it to the libraries listed below
Sorting:
- ☆24May 9, 2025Updated 9 months ago
- Fast sparse deep learning on CPUs☆56Sep 28, 2022Updated 3 years ago
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆16Feb 28, 2019Updated 7 years ago
- MLPruning, PyTorch, NLP, BERT, Structured Pruning☆20Jun 29, 2021Updated 4 years ago
- A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.☆18Aug 16, 2022Updated 3 years ago
- ☆40Feb 28, 2020Updated 6 years ago
- ☆17Apr 1, 2020Updated 5 years ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆41Nov 16, 2021Updated 4 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- ☆84Dec 2, 2022Updated 3 years ago
- ☆16Aug 20, 2021Updated 4 years ago
- ☆22Feb 18, 2025Updated last year
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- A novel spatial accelerator for horizontal diffusion weather stencil computation, as described in ICS 2023 paper by Singh et al. (https:/…☆22Jul 27, 2023Updated 2 years ago
- To deploy Transformer models in CV to mobile devices.☆18Jan 20, 2022Updated 4 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Feb 24, 2023Updated 3 years ago
- CUDA project for uni subject☆26Oct 26, 2020Updated 5 years ago
- Muon fsdp 2☆54Aug 8, 2025Updated 6 months ago
- Chameleon: Adaptive Code Optimization for Expedited Deep Neural Network Compilation☆27Nov 7, 2019Updated 6 years ago
- Fast CUDA Kernels for ResNet Inference.☆182May 26, 2019Updated 6 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- A library of GPU kernels for sparse matrix operations.☆283Nov 24, 2020Updated 5 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆23Aug 21, 2020Updated 5 years ago
- [Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration☆31Feb 11, 2023Updated 3 years ago
- ☆87Updated this week
- ☆35Apr 10, 2024Updated last year
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆38Dec 10, 2015Updated 10 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆73Oct 5, 2020Updated 5 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- ☆36Apr 20, 2021Updated 4 years ago
- ☆11Apr 17, 2021Updated 4 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆37Dec 18, 2021Updated 4 years ago
- Reinforcement learning modular with pytorch☆11Jan 18, 2021Updated 5 years ago
- [ICLR-2020] Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers.☆31Jan 20, 2020Updated 6 years ago
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆199Apr 27, 2022Updated 3 years ago
- Boost hardware utilization for ML training workloads via Inter-model Horizontal Fusion☆32May 15, 2024Updated last year
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago