☆30Apr 18, 2024Updated 2 years ago
Alternatives and similar repositories for LibShalom
Users that are interested in LibShalom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Apr 8, 2022Updated 4 years ago
- A direct convolution library targeting ARM multi-core CPUs.☆12Nov 27, 2024Updated last year
- An automatic test case generator for C source code using Memorized Symbolic Execution☆12May 4, 2023Updated 3 years ago
- Sparse kernels for GNNs based on TVM☆17Nov 18, 2020Updated 5 years ago
- Absinthe is an optimization framework to fuse and tile stencil codes in one shot☆14Jul 17, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆10Apr 24, 2023Updated 3 years ago
- DietCode Code Release☆65Jul 21, 2022Updated 3 years ago
- ☆16Nov 26, 2025Updated 6 months ago
- Stepwise optimizations of DGEMM on CPU, reaching performance faster than Intel MKL eventually, even under multithreading.☆163Feb 3, 2022Updated 4 years ago
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Aug 20, 2025Updated 9 months ago
- ☆22Aug 14, 2024Updated last year
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆12Aug 3, 2023Updated 2 years ago
- HPC Challenge Benchmark☆70Sep 28, 2025Updated 8 months ago
- GEMM by WMMA (tensor core)☆15Jul 31, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Apr 13, 2022Updated 4 years ago
- SfMEdu System from Princeton for Dense 3D Reconstruction☆11Dec 11, 2019Updated 6 years ago
- symmetric int8 gemm☆67Jun 7, 2020Updated 6 years ago
- ☆10Jun 4, 2021Updated 5 years ago
- ☆10Mar 2, 2024Updated 2 years ago
- 将MNN拆解的简易前向推理框架(for study!)☆24Feb 21, 2021Updated 5 years ago
- This is the repo of "SEP-Graph: Finding Shortest Execution Paths for Graph Processing under a Hybrid Framework on GPU"☆14Dec 11, 2018Updated 7 years ago
- ☆39Feb 28, 2020Updated 6 years ago
- The repository maintains the source code for the article titled "Optimizing Attention by Exploiting Data Reuse on ARM Multi-core CPUs."☆17Dec 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12May 3, 2020Updated 6 years ago
- A Top-Down Profiler for GPU Applications☆22Feb 29, 2024Updated 2 years ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- ☆11Aug 4, 2022Updated 3 years ago
- Python package to predict deep learning execution time☆13Jul 26, 2022Updated 3 years ago
- 慕课网 thinkphp5.0 微信小程序 零食商贩项目 小程 序令牌测试工具☆12Dec 13, 2018Updated 7 years ago
- [COLM 2024] SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models☆24Oct 5, 2024Updated last year
- 东北大学本科毕业设计 论文latex模板 2020 针对计算机相关专业☆12Jun 10, 2020Updated 6 years ago
- ☆34Mar 31, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- study of cutlass☆22Nov 10, 2024Updated last year
- Library for specialized dense and sparse matrix operations, and deep learning primitives.☆962Updated this week
- ☆84Apr 29, 2026Updated last month
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆92Nov 23, 2022Updated 3 years ago
- Basic linear algebra subroutines for embedded optimization☆412Jun 12, 2026Updated last week
- ☆17Aug 18, 2025Updated 10 months ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago