vnatesh / CAKE_on_CPUView external linksLinks
CAKE Library for constant-bandwidth matrix multiplication on CPUs
☆14Apr 6, 2024Updated last year
Alternatives and similar repositories for CAKE_on_CPU
Users that are interested in CAKE_on_CPU are comparing it to the libraries listed below
Sorting:
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- ☆11Apr 3, 2023Updated 2 years ago
- ☆10Mar 2, 2024Updated last year
- [ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation☆15Mar 6, 2025Updated 11 months ago
- General Purpose Graphics Processing Unit (GPGPU) IP Core☆11Jul 4, 2014Updated 11 years ago
- ☆18Apr 8, 2022Updated 3 years ago
- ☆15Dec 16, 2021Updated 4 years ago
- Fundamental Sources for Water Wave Animation☆20Dec 8, 2022Updated 3 years ago
- 如何做技术演讲(how to give a talk)的slide☆22Feb 8, 2021Updated 5 years ago
- End to End steps for adding custom ops in PyTorch.☆24Aug 20, 2020Updated 5 years ago
- Parsers for CUDA binary files☆25Dec 29, 2023Updated 2 years ago
- Fast sparse deep learning on CPUs☆56Sep 28, 2022Updated 3 years ago
- ☆42Nov 1, 2025Updated 3 months ago
- Evaluating different memory managers for dynamic GPU memory☆26Dec 16, 2020Updated 5 years ago
- ☆27Oct 25, 2021Updated 4 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Feb 24, 2023Updated 2 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆29Jul 23, 2021Updated 4 years ago
- [ICCV 2021] Code release for "Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks"☆32Jul 24, 2022Updated 3 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated 10 months ago
- The translator that supports translating NVPTX to SPIR-V. This translator is modified from LLVM-SPIR-V Translator.☆44Oct 25, 2021Updated 4 years ago
- Various examples for Chisel HDL☆30Mar 20, 2022Updated 3 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- The note of Qualcomm OpenCL SDK☆37Nov 8, 2018Updated 7 years ago
- Advanced Integrated Circuits 2025☆13Nov 1, 2025Updated 3 months ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 2 months ago
- Front-End and Back-End for the Vulkan Hardware Database☆37Feb 1, 2026Updated 2 weeks ago
- TLB Benchmarks☆35Sep 11, 2017Updated 8 years ago
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- ☆40Feb 28, 2020Updated 5 years ago
- MATLAB function to fill an area with hatching ~~or speckling~~☆11Mar 4, 2018Updated 7 years ago
- An artificial matrix generator in C☆12Feb 16, 2023Updated 3 years ago
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- BERT Sentiment Classification on the IMDb Large Movie Review Dataset.☆16Sep 8, 2022Updated 3 years ago
- A simple script to plot the Roofline model for given HW platforms and applications