☆16Nov 10, 2025Updated 5 months ago
Alternatives and similar repositories for tensorcast
Users that are interested in tensorcast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30Updated this week
- ☆107Feb 26, 2026Updated 2 months ago
- Fibertree emulator☆17Nov 4, 2024Updated last year
- An open source branch of AIE API☆14Apr 30, 2025Updated last year
- Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".☆11Feb 5, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A lightweight triton-based General Matrix Multiplication (GEMM) library.☆60Apr 22, 2026Updated last week
- ☆10Aug 30, 2024Updated last year
- ☆10Nov 16, 2024Updated last year
- Code for "Structured Sparsity Inducing Adaptive Optimizers for Deep Learning" in PyTorch☆18Feb 11, 2021Updated 5 years ago
- Weakly Supervised Object Localization via Class RE-Activation Mapping☆12Sep 19, 2022Updated 3 years ago
- This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…☆23Apr 2, 2025Updated last year
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆22Jan 13, 2023Updated 3 years ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆95Sep 4, 2024Updated last year
- PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation☆14Jul 15, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Bibtex for Sparsity in Deep Learning paper (https://arxiv.org/abs/2102.00554) - open for pull requests☆46May 4, 2022Updated 3 years ago
- ☆13Dec 10, 2022Updated 3 years ago
- The goal of the OSSCI Fleet is to provide a central mechanism to enable test automation, batch job scheduling, and developer access to a …☆13Mar 24, 2026Updated last month
- ☆19Dec 10, 2021Updated 4 years ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 9 months ago
- Scale-out system monitoring☆21Updated this week
- Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…☆23Mar 29, 2025Updated last year
- ☆39Jul 16, 2025Updated 9 months ago
- ☆19Jan 17, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A GPU FP32 computation method with Tensor Cores.☆26Dec 8, 2025Updated 4 months ago
- Explore training for quantized models☆26Jul 12, 2025Updated 9 months ago
- Official code for Class Tokens Infusion for Weakly Supervised Semantic Segmentation, CVPR2024☆23Oct 26, 2024Updated last year
- ☆136Updated this week
- ☆23Aug 3, 2024Updated last year
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆42Dec 9, 2024Updated last year
- ☆22Apr 2, 2023Updated 3 years ago
- Verdvana‘s Blog☆22Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆177Updated this week
- SPAA'21: Efficient Stepping Algorithms and Implementations for Parallel Shortest Paths☆21Aug 10, 2024Updated last year
- AI Tensor Engine for ROCm☆420Updated this week
- ☆54Apr 23, 2026Updated last week
- The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]☆67Jun 19, 2025Updated 10 months ago
- The Riallto Open Source Project from AMD☆86Apr 10, 2025Updated last year
- Board: PYNQ-Z2, Vitis version: 2022.1☆21Sep 2, 2024Updated last year