Fast sparse deep learning on CPUs
☆55Sep 28, 2022Updated 3 years ago
Alternatives and similar repositories for sparsednn
Users that are interested in sparsednn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Oct 15, 2020Updated 5 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆29Jul 23, 2021Updated 4 years ago
- ☆20Aug 26, 2021Updated 4 years ago
- CAKE Library for constant-bandwidth matrix multiplication on CPUs☆14Apr 6, 2024Updated 2 years ago
- ☆38Jun 27, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository contains the results and code for the MLPerf™ Inference v2.1 benchmark.☆18Jul 24, 2025Updated 10 months ago
- Multiple 1-stencil implementations using nvidia cuda.☆12Dec 2, 2017Updated 8 years ago
- Implementation of the ACL Findings paper "OutFlip: Generating Examples for Unknown Intent Detection with Natural Language Attack"☆10May 24, 2021Updated 5 years ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆111Jun 28, 2025Updated 11 months ago
- ☆17Apr 1, 2020Updated 6 years ago
- ☆24May 9, 2025Updated last year
- A platform to display the carbon neutralization information for researchers, decision-makers, and other participants in the community.☆18Aug 16, 2022Updated 3 years ago
- ☆14May 6, 2021Updated 5 years ago
- ☆16May 11, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Apr 3, 2023Updated 3 years ago
- Muon fsdp 2☆62Aug 8, 2025Updated 10 months ago
- High-performance LLM operator library built on TileLang.☆143Jun 11, 2026Updated last week
- [MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration☆201Apr 27, 2022Updated 4 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 10 months ago
- 第二届云原生编程挑战赛: RocketMQ存储系统设计 第4名 我之渺小 队代码☆11Nov 3, 2021Updated 4 years ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆45Oct 25, 2021Updated 4 years ago
- ☆12Aug 26, 2022Updated 3 years ago
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆107May 10, 2026Updated last month
- A baseline repository of Auto-Parallelism in Training Neural Networks☆145Jun 25, 2022Updated 3 years ago
- ☆167Jul 22, 2024Updated last year
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆15May 18, 2022Updated 4 years ago
- An innovative library for efficient LLM inference via low-bit quantization☆353Aug 30, 2024Updated last year
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆12Nov 26, 2019Updated 6 years ago
- ☆13Jun 16, 2021Updated 5 years ago
- A library for creating complex experimental pipelines☆12Jul 25, 2022Updated 3 years ago
- A pytorch implementation of: "Unsupervised Deep Learning for Structured Shape Matching"☆16Jun 9, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- quantize aware training package for NCNN on pytorch☆68Jul 27, 2021Updated 4 years ago
- ☆63Dec 18, 2024Updated last year
- Efficient Deep Learning Survey Paper☆34Feb 14, 2023Updated 3 years ago
- MS Marco Entity Annotations Disambiguation☆13May 19, 2023Updated 3 years ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆38Dec 10, 2015Updated 10 years ago
- Haskell implementation of Glumpy☆12Jun 21, 2021Updated 4 years ago