vnatesh/CAKE_on_CPU

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vnatesh/CAKE_on_CPU)

vnatesh / CAKE_on_CPU

CAKE Library for constant-bandwidth matrix multiplication on CPUs

☆14

Alternatives and similar repositories for CAKE_on_CPU

Users that are interested in CAKE_on_CPU are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SpRegTiling / sparse-register-tiling
View on GitHub
☆10Mar 2, 2024Updated 2 years ago
TiledTensor / TiledKernel
View on GitHub
TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.
☆19May 12, 2024Updated 2 years ago
AnonymousYWL / MYLIB
View on GitHub
☆18Apr 8, 2022Updated 4 years ago
zxytim / arithmetic-encoding-compression
View on GitHub
☆11Apr 3, 2023Updated 3 years ago
ULAFF / LAFF-On-PfHP
View on GitHub
Repository for "LAFF-On Programming for High Performance"
☆45Jul 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sandeepkumar-skb / pytorch_custom_op
View on GitHub
End to End steps for adding custom ops in PyTorch.
☆24Aug 20, 2020Updated 5 years ago
LighthouseHPC / lighthouse
View on GitHub
☆11Apr 10, 2019Updated 7 years ago
AnonymousRepo123 / AlphaSparse
View on GitHub
A intelligent matrix format designer for SpMV
☆10Oct 10, 2023Updated 2 years ago
eth-cscs / COSTA
View on GitHub
Distributed Communication-Optimal Shuffle and Transpose Algorithm
☆14Apr 18, 2026Updated 3 months ago
shieldforever / NeuronQuant
View on GitHub
[ASP-DAC 2025] "NeuronQuant: Accurate and Efficient Post-Training Quantization for Spiking Neural Networks" Official Implementation
☆19Mar 6, 2025Updated last year
eth-cscs / Tiled-MM
View on GitHub
Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.
☆33Apr 2, 2025Updated last year
ispras / quix86
View on GitHub
An x86-64 instruction decoder.
☆16Mar 11, 2024Updated 2 years ago
deathwings602 / Unified-IR
View on GitHub
面向多平台编译优化的深度学习中间表示
☆10Oct 28, 2024Updated last year
GPUPeople / spECK
View on GitHub
Efficient SpGEMM on GPU using CUDA and CSR
☆61Jul 18, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ecrc / polar
View on GitHub
Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix
☆14Jun 3, 2020Updated 6 years ago
HPAC / ELAPS
View on GitHub
Experimental Linear Algebra Performance Studies
☆12Feb 24, 2017Updated 9 years ago
ishani / Tether-ISPC
View on GitHub
A comprehensive Visual Studio MSBuild integration of the Intel SPMD Compiler (ISPC), Premake support and a collection of ISPC tests and d…
☆13Apr 7, 2025Updated last year
brandonhamilton / GPGPU
View on GitHub
General Purpose Graphics Processing Unit (GPGPU) IP Core
☆11Jul 4, 2014Updated 12 years ago
jeremypw / gnonograms
View on GitHub
Nonograms puzzle game written in Vala.
☆12Dec 12, 2024Updated last year
FZJ-JSC / jube-configs
View on GitHub
JUBE benchmarking environment configuration files
☆10Oct 1, 2015Updated 10 years ago
nihui / ncnn_on_xr806
View on GitHub
☆15Dec 16, 2021Updated 4 years ago
UofT-EcoSystem / Tempo
View on GitHub
Memory footprint reduction for transformer models
☆11Jan 24, 2023Updated 3 years ago
enp1s0 / cuMpSGEMM
View on GitHub
Fast SGEMM emulation on Tensor Cores
☆17Feb 16, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
lixiuhong / batched_gemm
View on GitHub
☆40Feb 28, 2020Updated 6 years ago
forhappy / CS61
View on GitHub
CS61 learning schedules and assessments
☆16Dec 6, 2011Updated 14 years ago
ahmedheakl / CASS
View on GitHub
[ACL 2026 🔥] CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark
☆35Apr 20, 2026Updated 3 months ago
96boards-hikey / aosp-device-linaro-hikey
View on GitHub
This repository is based off https://android.googlesource.com/device/linaro/hikey/
☆12Apr 2, 2019Updated 7 years ago
jagot / ThreadedSparseArrays.jl
View on GitHub
☆19Dec 27, 2023Updated 2 years ago
logological / heria
View on GitHub
A LaTeX class for Horizon Europe RIA and IA grant proposals
☆17Aug 17, 2025Updated 11 months ago
nDIRECT / nDIRECT
View on GitHub
A direct convolution library targeting ARM multi-core CPUs.
☆12Nov 27, 2024Updated last year
conanhujinming / How_to_give_a_talk
View on GitHub
如何做技术演讲(how to give a talk)的slide
☆22Feb 8, 2021Updated 5 years ago
ChASE-library / ChASE
View on GitHub
This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…
☆21Jul 8, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
schreckc / FSWW
View on GitHub
Fundamental Sources for Water Wave Animation
☆20Dec 8, 2022Updated 3 years ago
ian-r-rose / buckinghampy
View on GitHub
Teaching tool for the Buckingham Pi theorem. With a terribly obvious name.
☆17Apr 11, 2018Updated 8 years ago
rmsrosa / UnitfulBuckinghamPi.jl
View on GitHub
Solve for the adimensional Pi groups in a list of Unitful parameters, according to the Buckingham-Pi Theorem.
☆18Mar 4, 2025Updated last year
ispras / centos6.9-build-docker
View on GitHub
CentOS 6.9 build Docker environment to distribute portable Linux binaries
☆13Dec 28, 2021Updated 4 years ago
csc-training / hip
View on GitHub
☆14Oct 5, 2022Updated 3 years ago
KAdamek / GPU_Overlap-and-save_convolution
View on GitHub
Shared memory overlap-and-save method for NVIDIA GPUs using CUDA
☆18Aug 21, 2025Updated 11 months ago
JuliaGPU / NCCL.jl
View on GitHub
A Julia wrapper for the NVIDIA Collective Communications Library.
☆31Jul 11, 2026Updated 2 weeks ago