☆16Nov 10, 2025Updated 4 months ago
Alternatives and similar repositories for tensorcast
Users that are interested in tensorcast are comparing it to the libraries listed below
Sorting:
- ☆21Mar 12, 2026Updated last week
- ☆30Mar 2, 2026Updated 2 weeks ago
- iOS Swift Realtime video manipulation☆16Oct 1, 2015Updated 10 years ago
- An open source branch of AIE API☆14Apr 30, 2025Updated 10 months ago
- A lightweight triton-based General Matrix Multiplication (GEMM) library.☆51Updated this week
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆13Jun 7, 2023Updated 2 years ago
- ☆10Nov 16, 2024Updated last year
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆22Jan 13, 2023Updated 3 years ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆94Sep 4, 2024Updated last year
- The source code of the experimental evaluation of Deprez et al. (nd)☆12Oct 8, 2025Updated 5 months ago
- PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation☆14Jul 15, 2016Updated 9 years ago
- ☆14Dec 10, 2022Updated 3 years ago
- The goal of the OSSCI Fleet is to provide a central mechanism to enable test automation, batch job scheduling, and developer access to a …☆13Feb 27, 2026Updated 3 weeks ago
- ☆19Dec 10, 2021Updated 4 years ago
- A small, interactive GUI/visualizer tool for SPS spectra, powered by bagpipes☆14Jul 14, 2025Updated 8 months ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 8 months ago
- Scale-out system monitoring☆21Updated this week
- A pure Python halo-model implementation for power spectra of any large-scale structure tracer combination.☆18Apr 26, 2024Updated last year
- ☆19Jan 17, 2024Updated 2 years ago
- ☆38Jul 16, 2025Updated 8 months ago
- ☆23Updated this week
- Python implementation of Efficient Graph-Based Image Segmentation☆24Sep 26, 2020Updated 5 years ago
- Data of SDSS DR12☆17Jun 26, 2025Updated 8 months ago
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 3 months ago
- ☆29Updated this week
- ☆128Updated this week
- Official code for Class Tokens Infusion for Weakly Supervised Semantic Segmentation, CVPR2024☆23Oct 26, 2024Updated last year
- ☆21Aug 3, 2024Updated last year
- moderngpu algorithms for C++ shaders☆16Mar 3, 2021Updated 5 years ago
- ☆20Apr 2, 2023Updated 2 years ago
- Verdvana‘s Blog☆22Feb 3, 2026Updated last month
- python package of rocm-smi-lib☆24Dec 15, 2025Updated 3 months ago
- ☆172Updated this week
- SPAA'21: Efficient Stepping Algorithms and Implementations for Parallel Shortest Paths☆21Aug 10, 2024Updated last year
- AI Tensor Engine for ROCm☆385Updated this week
- ☆11Jun 18, 2020Updated 5 years ago
- The Riallto Open Source Project from AMD☆85Apr 10, 2025Updated 11 months ago
- Final Project of Software_Hardware_Co-Design_24Spring. FPGA-based RISC-V+ Convolutional Acceleration Unit.☆23May 7, 2024Updated last year