deathwings602 / Unified-IRView external linksLinks
面向多平台编译优化的深度学习中间表示
☆10Oct 28, 2024Updated last year
Alternatives and similar repositories for Unified-IR
Users that are interested in Unified-IR are comparing it to the libraries listed below
Sorting:
- A torch compile backend for multi-targets☆45Jan 28, 2026Updated 2 weeks ago
- FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …☆32Dec 21, 2024Updated last year
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆17Aug 21, 2023Updated 2 years ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆31Jun 13, 2025Updated 8 months ago
- Optimize tensor program fast with Felix, a gradient descent autotuner.☆30Apr 27, 2024Updated last year
- CXL remote offloading data movement aware compiler☆72Jan 4, 2026Updated last month
- Tilus is a tile-level kernel programming language with explicit control over shared memory and registers.☆441Feb 4, 2026Updated last week
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆45Feb 2, 2026Updated 2 weeks ago
- SOTA Learning-augmented Systems☆37May 21, 2022Updated 3 years ago
- Vision Based Inspection tool comprising of retrained Inception V3 network and OpenCV Filters for fracture detection. Published at A2IC 20…☆11Jun 19, 2019Updated 6 years ago
- ☆14May 29, 2025Updated 8 months ago
- 西安电子科技大学本科生毕业设计(论文)LaTeX模板☆11Jun 7, 2016Updated 9 years ago
- libFastMesh - Optimized Finite Volume Computational Aeroacoustics (CAA) Code☆13Mar 28, 2024Updated last year
- A modification for Klei's Oxygen Not Included☆10Jun 9, 2022Updated 3 years ago
- [NeurIPS'25 Spotlight] Adaptive Attention Sparsity with Hierarchical Top-p Pruning☆87Nov 29, 2025Updated 2 months ago
- A unified programming framework for high and portable performance across FPGAs and GPUs☆11Mar 23, 2025Updated 10 months ago
- Phase Only Correlation in Python☆11Jan 20, 2021Updated 5 years ago
- A standalone CXL-enabled system simulator.☆18Jan 10, 2026Updated last month
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration☆33Jan 8, 2026Updated last month
- An MLIR-based AI compiler designed for Python frontend to RISC-V DSA☆13Oct 10, 2024Updated last year
- ASKAP Benchmark Packages☆13Nov 3, 2023Updated 2 years ago
- A simple pseudo-spectral solver for the Direct Numerical Simulation (DNS) of the 3D Taylor-Green Vortex in the Julia programming language☆10Jun 6, 2022Updated 3 years ago
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated last year
- ☆10Dec 8, 2021Updated 4 years ago
- Setuptools plugin to protobuf stub generation.☆10Mar 5, 2018Updated 7 years ago
- Hindcast Initial Condition Creation Utility/Processor☆11Updated this week
- An unofficial wrapper of Baidu Baike☆12Feb 20, 2014Updated 11 years ago
- ☆14Nov 11, 2024Updated last year
- Exploring Machine Learning methods and workflows in a simplified weather model☆19Jun 6, 2024Updated last year
- 李慧琴《Linux 系统编程》& APUE 笔记☆18Nov 16, 2023Updated 2 years ago
- Fast and memory-efficient exact attention☆18Jan 23, 2026Updated 3 weeks ago
- Fibertree emulator☆16Nov 4, 2024Updated last year
- ☆27Aug 4, 2025Updated 6 months ago
- Python Script to Open SJTU Dormitory Smart Lock☆10Sep 12, 2022Updated 3 years ago
- (elastic) cuckoo hashing☆15Jun 20, 2020Updated 5 years ago
- benchmark for linux server☆13Nov 6, 2016Updated 9 years ago
- A GPU-accelerated differentiable fluid simulator written in JAX.☆11Feb 1, 2021Updated 5 years ago
- A Compiler from "Mx* language" (A C++ & Java like language) to RV32I Assembly, with optimizations on LLVM IR. SJTU CS2966 Project.☆12Feb 12, 2023Updated 3 years ago