☆74Oct 31, 2024Updated last year
Alternatives and similar repositories for deeplink.framework
Users that are interested in deeplink.framework are comparing it to the libraries listed below
Sorting:
- ☆76Nov 22, 2024Updated last year
- ☆13May 23, 2025Updated 9 months ago
- A benchmark suited especially for deep learning operators☆42Feb 13, 2023Updated 3 years ago
- ☆28Jan 7, 2025Updated last year
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆33Aug 31, 2022Updated 3 years ago
- DLSlime: Flexible & Efficient Heterogeneous Transfer Toolkit☆91Jan 26, 2026Updated last month
- [DAC2024] A Holistic Functionalization Approach to Optimizing Imperative Tensor Programs in Deep Learning☆15Jan 13, 2024Updated 2 years ago
- ☆33Mar 13, 2026Updated last week
- ☆74Updated this week
- Distributed IO-aware Attention algorithm☆24Sep 24, 2025Updated 5 months ago
- ☆24Jul 7, 2024Updated last year
- [IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any inte…☆52May 31, 2023Updated 2 years ago
- Reconstruction from edge image combined with color and gradient difference for industrial surface anomaly detection☆29Aug 15, 2024Updated last year
- Build TVM docker image for production compilation deployments☆12Sep 7, 2021Updated 4 years ago
- 基于InterLM的《黑神话:悟空》AI小助手,了解更多背后的故事--在更新视频中☆35Jan 4, 2025Updated last year
- ☆41Jun 5, 2024Updated last year
- Source code of the paper "OpSparse: a Highly Optimized Framework for Sparse General Matrix Multiplication on GPUs"☆16Aug 23, 2022Updated 3 years ago
- ☆14Aug 3, 2024Updated last year
- ☆23Aug 21, 2025Updated 6 months ago
- DLBlas: clean and efficient kernels☆35Updated this week
- extensible collectives library in triton☆97Mar 31, 2025Updated 11 months ago
- C++ package to store Matrix Market (.mtx) file format sparse matrices in Compressed Row Storage (CSR) format.☆16Oct 16, 2019Updated 6 years ago
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆43Dec 11, 2023Updated 2 years ago
- Yinghan's Code Sample☆364Jul 25, 2022Updated 3 years ago
- Simple intermediate representation language for learning and research.☆20Mar 27, 2020Updated 5 years ago
- C++ "borrowing" smart pointer.☆11May 13, 2022Updated 3 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆112Sep 10, 2024Updated last year
- Compiler for Dynamic Neural Networks☆45Nov 13, 2023Updated 2 years ago
- ☆192Jun 16, 2024Updated last year
- TORCH_TRACE parser for PT2☆78Updated this week
- InternEvo is an open-sourced lightweight training framework aims to support model pre-training without the need for extensive dependencie…☆419Aug 21, 2025Updated 7 months ago
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆125Jun 23, 2022Updated 3 years ago
- Machine Learning Compiler Road Map☆46Sep 12, 2023Updated 2 years ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- ☆10Feb 24, 2025Updated last year
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆67Apr 12, 2024Updated last year
- ☆192Mar 28, 2023Updated 2 years ago
- ☆11Apr 29, 2024Updated last year
- ☆14Jun 30, 2021Updated 4 years ago