AnonymousYWL/LibShalom

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AnonymousYWL/LibShalom)

AnonymousYWL / LibShalom

☆30

Alternatives and similar repositories for LibShalom

Users that are interested in LibShalom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AnonymousYWL / MYLIB
View on GitHub
☆18Apr 8, 2022Updated 4 years ago
nDIRECT / nDIRECT
View on GitHub
A direct convolution library targeting ARM multi-core CPUs.
☆12Nov 27, 2024Updated last year
dglai / FeatGraph
View on GitHub
Sparse kernels for GNNs based on TVM
☆17Nov 18, 2020Updated 5 years ago
spcl / absinthe
View on GitHub
Absinthe is an optimization framework to fuse and tile stencil codes in one shot
☆14Jul 17, 2019Updated 7 years ago
nullplay / Unified-Convolution-Framework
View on GitHub
☆10Apr 24, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
SYSU-SCC / sysu-scc-spack-repo
View on GitHub
Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.
☆16Aug 20, 2025Updated 11 months ago
yanghaku / tvm-rt-wasm
View on GitHub
A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…
☆13Aug 3, 2023Updated 2 years ago
weifengliu-ssslab / Benchmark_SpMV_using_CSR5
View on GitHub
CSR5-based SpMV on CPUs, GPUs and Xeon Phi
☆111Jun 10, 2024Updated 2 years ago
gty111 / GEMM_WMMA
View on GitHub
GEMM by WMMA (tensor core)
☆15Jul 31, 2022Updated 3 years ago
icl-utk-edu / hpcc
View on GitHub
HPC Challenge Benchmark
☆70Sep 28, 2025Updated 10 months ago
tpoisonooo / chgemm
View on GitHub
symmetric int8 gemm
☆67Jun 7, 2020Updated 6 years ago
CompML / survey-deep-gradient-compression
View on GitHub
☆10Jun 4, 2021Updated 5 years ago
lixiuhong / batched_gemm
View on GitHub
☆40Feb 28, 2020Updated 6 years ago
SpRegTiling / sparse-register-tiling
View on GitHub
☆10Mar 2, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
stillbreeze / 3D-reconstruction-Using-SfM-and-Stereo-Matching
View on GitHub
SfMEdu System from Princeton for Dense 3D Reconstruction
☆11Dec 11, 2019Updated 6 years ago
tbd-ai / tbd-tools
View on GitHub
☆12May 3, 2020Updated 6 years ago
SEP-Graph / sep-graph
View on GitHub
This is the repo of "SEP-Graph: Finding Shortest Execution Paths for Graph Processing under a Hybrid Framework on GPU"
☆14Dec 11, 2018Updated 7 years ago
FindHao / drgpu
View on GitHub
A Top-Down Profiler for GPU Applications
☆23Feb 29, 2024Updated 2 years ago
yanghaku / neu_bachelor_thesis_template_2021_for_cs
View on GitHub
东北大学本科毕业设计论文latex模板适应2021届新版书写印制规范针对计算机类专业
☆11Apr 21, 2021Updated 5 years ago
CDECatapult / mlpredict
View on GitHub
Python package to predict deep learning execution time
☆13Jul 26, 2022Updated 4 years ago
petermigi / protoss-tool
View on GitHub
慕课网 thinkphp5.0 微信小程序零食商贩项目小程序令牌测试工具
☆12Dec 13, 2018Updated 7 years ago
cornell-brg / torng-uecgra-scripts-hpca2021
View on GitHub
☆12Aug 4, 2022Updated 3 years ago
ritikraj7 / cpu-centric-agentic-ai
View on GitHub
A comprehensive benchmarking framework for evaluating and optimizing CPU-centric agentic AI systems across multiple workloads, reproducin…
☆49Feb 12, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cat538 / SKVQ
View on GitHub
[COLM 2024] SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models
☆24Oct 5, 2024Updated last year
hao-ai-lab / flash-attention-fp4
View on GitHub
NVFP4 Flash-Attention 4 on BlackWell
☆31Jul 23, 2026Updated last week
InnovArul / personreid_sequential_rl
View on GitHub
An attempt to replicate the paper "Multi-shot Pedestrian Re-identification via Sequential Decision Making (CVPR2018)"
☆10Nov 16, 2019Updated 6 years ago
yester31 / Cutlass_EX
View on GitHub
study of cutlass
☆22Nov 10, 2024Updated last year
OpenMathLib / OpenVML
View on GitHub
Vector Math Library
☆87Nov 7, 2025Updated 8 months ago
zhanghb55 / parallel-and-distributed-computing-homework
View on GitHub
中山大学2020年并行与分布式计算作业
☆21Jul 28, 2020Updated 6 years ago
vnatesh / CAKE_on_CPU
View on GitHub
CAKE Library for constant-bandwidth matrix multiplication on CPUs
☆14Apr 6, 2024Updated 2 years ago
libxsmm / libxsmm
View on GitHub
Library for specialized dense and sparse matrix operations, and deep learning primitives.
☆969Updated this week
weishengying / cute_gemm
View on GitHub
☆23Aug 14, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ustc-compiler / 2017fall-student-teamworks
View on GitHub
Student teamworks summary repository for USTC Compiler H lecture in fall, 2017.
☆15Jan 13, 2018Updated 8 years ago
giaf / blasfeo
View on GitHub
Basic linear algebra subroutines for embedded optimization
☆415Jun 30, 2026Updated 3 weeks ago
illinois-impact / klap
View on GitHub
A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches
☆15Jun 21, 2019Updated 7 years ago
flame / how-to-optimize-gemm
View on GitHub
☆2,025Jul 29, 2023Updated 3 years ago
TiledTensor / TiledLower
View on GitHub
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆13Nov 23, 2024Updated last year
pec27 / hfof
View on GitHub
Friends-of-Friends via spatial hashing
☆15May 26, 2023Updated 3 years ago
791136190 / awesome-qat
View on GitHub
☆21Apr 13, 2022Updated 4 years ago