summerspringwei/souffle-ae

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/summerspringwei/souffle-ae)

summerspringwei / souffle-ae

☆17

Alternatives and similar repositories for souffle-ae

Users that are interested in souffle-ae are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uiuc-arc / felix
View on GitHub
Optimize tensor program fast with Felix, a gradient descent autotuner.
☆33Mar 5, 2026Updated 4 months ago
FuyuWang / Soter
View on GitHub
☆13Jan 7, 2025Updated last year
tfruan2000 / mlsys-study-note
View on GitHub
My study note for mlsys
☆14Nov 4, 2024Updated last year
microsoft / cusync
View on GitHub
☆27Feb 20, 2024Updated 2 years ago
ZhW-loop / UniCoMo
View on GitHub
☆13Sep 19, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
humuyan / Korch
View on GitHub
ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch
☆41Mar 27, 2025Updated last year
yonsei-hpcp / gcom
View on GitHub
☆15May 8, 2025Updated last year
hgl71964 / cuasmrl
View on GitHub
☆19Nov 9, 2024Updated last year
IPRC-ICT / Heron
View on GitHub
Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators
☆23Jan 30, 2024Updated 2 years ago
AlibabaResearch / mononn
View on GitHub
☆32Jul 17, 2024Updated 2 years ago
tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆20Updated this week
zhaiyi000 / tlm
View on GitHub
☆49Jul 13, 2024Updated 2 years ago
alibaba / redfuser
View on GitHub
☆21Mar 17, 2026Updated 4 months ago
nox-410 / Welder
View on GitHub
OSDI 2023 Welder, deeplearning compiler
☆34Nov 24, 2023Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
HPMLL / SpInfer_EuroSys25
View on GitHub
☆35Apr 2, 2025Updated last year
pku-liang / MAGIS
View on GitHub
MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)
☆57May 29, 2024Updated 2 years ago
toyaix / triton-runner
View on GitHub
Multi-Level Triton Runner supporting Python, IR, PTX, AMDGCN, cubin and hasco.
☆98May 8, 2026Updated 2 months ago
apuaaChen / EVT_AE
View on GitHub
Artifacts of EVT ASPLOS'24
☆29Mar 6, 2024Updated 2 years ago
LeiWang1999 / tvm_gpu_gemm
View on GitHub
play gemm with tvm
☆91Jul 22, 2023Updated 3 years ago
pku-liang / TileFlow
View on GitHub
TileFlow is a performance analysis tool based on Timeloop for fusion dataflows
☆72Apr 12, 2024Updated 2 years ago
leoluopy / autotvm_tutorial
View on GitHub
autoTVM神经网络推理代码优化搜索演示，基于tvm编译开源模型centerface，并使用autoTVM搜索最优推理代码，　最终部署编译为c++代码，演示平台是cuda，可以是其他平台，例如树莓派，安卓手机，苹果手机．Thi is a demonstration of …
☆31May 6, 2021Updated 5 years ago
monellz / FlashTensor
View on GitHub
☆19Mar 4, 2025Updated last year
baco-authors / baco
View on GitHub
☆17Dec 8, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
pku-liang / AMOS
View on GitHub
Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators
☆125Oct 26, 2022Updated 3 years ago
nulidangxueshen / CSR2
View on GitHub
A New Format for SIMD-accelerated SpMV
☆22Apr 4, 2022Updated 4 years ago
aoli-al / HFuse
View on GitHub
Horizontal Fusion
☆24Jan 7, 2022Updated 4 years ago
uwsampl / SparseTIR
View on GitHub
SparseTIR: Sparse Tensor Compiler for Deep Learning
☆145Mar 31, 2023Updated 3 years ago
sjtu-epcc / Tacker
View on GitHub
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆33Feb 10, 2025Updated last year
hgyhungry / alcop-artifact
View on GitHub
☆25Mar 15, 2023Updated 3 years ago
KnowingNothing / MatmulTutorial
View on GitHub
A Easy-to-understand TensorOp Matmul Tutorial
☆445Mar 5, 2026Updated 4 months ago
caiwanxianhust / flash-attention-opt
View on GitHub
flash attention 优化日志
☆31Jun 4, 2025Updated last year
robcasloz / llvm-discovery
View on GitHub
Discovery of Structured Parallelism In Sequential and Parallel Code
☆10Feb 13, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
CharlieCurry / tvm-learning
View on GitHub
TVM learning and research
☆13Jan 8, 2021Updated 5 years ago
RoySegal / tvmcon23_byoc
View on GitHub
☆11Mar 15, 2023Updated 3 years ago
KJLdefeated / RL.cu
View on GitHub
RLVR training for LLM in CUDA/C++
☆39Jun 8, 2026Updated last month
violetDelia / LLCompiler
View on GitHub
☆25Jun 11, 2025Updated last year
MoZeWei / moTuner
View on GitHub
☆10May 12, 2022Updated 4 years ago
TaKO8Ki / qcc
View on GitHub
[WIP] A toy C compiler written in Rust
☆16Mar 4, 2022Updated 4 years ago
tqchen / ffi-navigator
View on GitHub
☆249Jul 27, 2025Updated 11 months ago