a tensor computing compiler based tile programming for gpu, cpu or tpu
☆45Feb 2, 2026Updated last month
Alternatives and similar repositories for galois
Users that are interested in galois are comparing it to the libraries listed below
Sorting:
- a simple general program language☆100Feb 2, 2026Updated last month
- 面向多平台编译优化的深度学习中间表示☆10Oct 28, 2024Updated last year
- Fibertree emulator☆17Nov 4, 2024Updated last year
- My Paper Reading Lists and Notes.☆21Feb 17, 2026Updated 2 weeks ago
- GOCART Aerosol model including process library and framework interfaces (MAPL, NUOPC, and CCPP)☆22Updated this week
- NUOPC Community Mediator for Earth Prediction Systems☆30Mar 1, 2026Updated last week
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆32Jun 13, 2025Updated 8 months ago
- 记录阅读各类paper的想法笔记(关注体系结构,机器学习系统,深度学习,计算机视觉)☆25Oct 25, 2019Updated 6 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆114Sep 10, 2024Updated last year
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆31Jun 26, 2024Updated last year
- triton for dsa☆58Updated this week
- Kendryte K210 SBI support using RustSBI, provides privileged spec 1.12 environment by emulating it using 1.9.1☆37Feb 18, 2024Updated 2 years ago
- GUI for GHRepoSearcher. It allows to search online repositories on github.☆10May 20, 2022Updated 3 years ago
- Optimize pipelines for locality☆14Feb 21, 2026Updated 2 weeks ago
- An experimental CPU backend for Triton☆181Feb 25, 2026Updated last week
- DDC is a discrete domain computation library.☆46Updated this week
- This is an implementation of sgemm_kernel on L1d cache.☆233Feb 26, 2024Updated 2 years ago
- MICRO 2023 Evaluation Artifact for TeAAL☆10Oct 26, 2023Updated 2 years ago
- A parser for PTX 6.5☆13Jun 19, 2023Updated 2 years ago
- Performance Monitor library - This library records execution performance of a user code and reports the summary. The PMlib is able to use…☆11Mar 21, 2023Updated 2 years ago
- OpenMP offload playground☆10Nov 16, 2024Updated last year
- ☆15Sep 19, 2021Updated 4 years ago
- High-performance Atmospheric Radiation Package☆10Oct 21, 2018Updated 7 years ago
- Qt/Qml application using Google speech-to-text API to make voice commands☆11Jan 19, 2020Updated 6 years ago
- The Process Watchdog is a Linux-based utility designed to start, monitor and manage processes specified in a configuration file. It ensur…☆11Dec 27, 2025Updated 2 months ago
- Low-level Vision Model Deployment☆10May 27, 2023Updated 2 years ago
- A shared-memory FFT for the Kokkos ecosystem☆48Updated this week
- FlagGems is an operator library for large language models implemented in the Triton Language.☆909Updated this week
- salon for sharing thoughts and ideas☆12Nov 11, 2021Updated 4 years ago
- Implementation of Butler-Portugal algorithm for tensor canonicalization in Rust☆18Feb 12, 2026Updated 3 weeks ago
- C++ parser to read data from MATLAB .mat files☆10Oct 12, 2014Updated 11 years ago
- Export files of iTunes Backup☆11Jul 16, 2022Updated 3 years ago
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- libmpdata++ - a library of parallel MPDATA-based solvers for systems of generalised transport equations☆12Updated this week
- qt page router☆12May 7, 2024Updated last year
- 知识图谱推理 复现论文 https://arxiv.org/pdf/2010.04029.pdf☆11Oct 26, 2022Updated 3 years ago
- ☆10May 12, 2022Updated 3 years ago
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated last year
- ASKAP Benchmark Packages☆13Nov 3, 2023Updated 2 years ago