☆17Jan 24, 2024Updated 2 years ago
Alternatives and similar repositories for souffle-ae
Users that are interested in souffle-ae are comparing it to the libraries listed below
Sorting:
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆32Apr 27, 2024Updated last year
- ☆13May 8, 2025Updated 9 months ago
- ☆13Sep 19, 2024Updated last year
- ☆12Jan 7, 2025Updated last year
- ☆25Feb 20, 2024Updated 2 years ago
- ☆18Mar 4, 2025Updated last year
- ☆14Nov 9, 2024Updated last year
- ☆33Jul 17, 2024Updated last year
- My study note for mlsys☆14Nov 4, 2024Updated last year
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Feb 24, 2026Updated last week
- ☆48Jul 13, 2024Updated last year
- ☆23Jun 11, 2025Updated 8 months ago
- A New Format for SIMD-accelerated SpMV☆22Apr 4, 2022Updated 3 years ago
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆56May 29, 2024Updated last year
- play gemm with tvm☆92Jul 22, 2023Updated 2 years ago
- Horizontal Fusion☆24Jan 7, 2022Updated 4 years ago
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆29May 6, 2021Updated 4 years ago
- Artifacts of EVT ASPLOS'24☆29Mar 6, 2024Updated last year
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆121Oct 26, 2022Updated 3 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆34Feb 10, 2025Updated last year
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆66Apr 12, 2024Updated last year
- A Easy-to-understand TensorOp Matmul Tutorial☆410Feb 11, 2026Updated 3 weeks ago
- EdgeCortix maintained and extended fork of Apache TVM compiler stack utilized by MERA framework. TVM is an open deep learning compiler st…☆11Dec 22, 2023Updated 2 years ago
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆143Mar 31, 2023Updated 2 years ago
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 8 months ago
- RUHMI (Robust Unified Heterogeneous Model Integration) for RZ/V series is a framework for AI model optimization and deployment, powered b…☆55Jan 27, 2026Updated last month
- ☆20May 24, 2025Updated 9 months ago
- ☆289Feb 4, 2026Updated last month
- This is a re-implementation of our KDD 2020 paper "Grammatically Recognizing Images with Tree Convolution."☆13Dec 9, 2020Updated 5 years ago
- PSO , Simulated Annealing PSO , Chaotic SAPSO, Neural network, Nonlinear Function☆11Apr 6, 2022Updated 3 years ago
- DiscreteTom's Blog Boilerplate.☆10Mar 6, 2023Updated 3 years ago
- a vue-demo:vue仿网易新闻m站☆10Jul 26, 2017Updated 8 years ago
- ☆16Jan 14, 2025Updated last year
- a simple API to use CUPTI☆11Aug 19, 2025Updated 6 months ago
- ☆10Mar 2, 2024Updated 2 years ago
- Python implementation of a Genetic Algorithm for the Resource-Constrained Project Scheduling Problem☆14May 29, 2023Updated 2 years ago
- ☆11Apr 2, 2024Updated last year
- ☆12Jun 29, 2024Updated last year
- ☆11Mar 15, 2023Updated 2 years ago