☆17Nov 10, 2025Updated 6 months ago
Alternatives and similar repositories for tensorcast
Users that are interested in tensorcast are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 3 years ago
- ☆30May 13, 2026Updated last week
- ☆112Feb 26, 2026Updated 2 months ago
- Fibertree emulator☆17Nov 4, 2024Updated last year
- An open source branch of AIE API☆15Apr 30, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- IREE plugin repository for the AMD AIE accelerator☆130May 7, 2026Updated 2 weeks ago
- ☆10Aug 30, 2024Updated last year
- ☆10Nov 16, 2024Updated last year
- Weakly Supervised Object Localization via Class RE-Activation Mapping☆12Sep 19, 2022Updated 3 years ago
- Fork of LLVM to support AMD AIEngine processors☆200Updated this week
- [PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…☆10Aug 13, 2024Updated last year
- Attention-Based Guided Structured Sparsity of Deep Neural Networks☆29Mar 22, 2020Updated 6 years ago
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆22Jan 13, 2023Updated 3 years ago
- Boosting 4-bit inference kernels with 2:4 Sparsity☆96Sep 4, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The source code of the experimental evaluation of Deprez et al. (nd)☆12Oct 8, 2025Updated 7 months ago
- PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation☆14Jul 15, 2016Updated 9 years ago
- ☆13Dec 10, 2022Updated 3 years ago
- ☆19Dec 10, 2021Updated 4 years ago
- LLVM/MLIR based compiler instrumentation of AMD GPU kernels☆20Jul 13, 2025Updated 10 months ago
- Various low power labs using sky130☆13Sep 3, 2021Updated 4 years ago
- ☆39Jul 16, 2025Updated 10 months ago
- ☆27Apr 28, 2026Updated 3 weeks ago
- ☆19Jan 17, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A GPU FP32 computation method with Tensor Cores.☆27Dec 8, 2025Updated 5 months ago
- Explore training for quantized models☆26Jul 12, 2025Updated 10 months ago
- An MLIR-based toolchain for AMD AI Engine-enabled devices.☆635Updated this week
- Official code for Class Tokens Infusion for Weakly Supervised Semantic Segmentation, CVPR2024☆23Oct 26, 2024Updated last year
- RISC-V ISA based 32-bit processor written in HLS☆16Nov 7, 2019Updated 6 years ago
- ☆31May 13, 2026Updated last week
- moderngpu algorithms for C++ shaders☆16Mar 3, 2021Updated 5 years ago
- HyFiSS: A Hybrid Fidelity Stall-Aware Simulator for GPGPUs☆42Dec 9, 2024Updated last year
- Verdvana‘s Blog☆22Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆113Updated this week
- python package of rocm-smi-lib☆25Dec 15, 2025Updated 5 months ago
- ☆179May 14, 2026Updated last week
- SPAA'21: Efficient Stepping Algorithms and Implementations for Parallel Shortest Paths☆21Aug 10, 2024Updated last year
- AI Tensor Engine for ROCm☆440Updated this week
- The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]☆68Jun 19, 2025Updated 11 months ago
- The Riallto Open Source Project from AMD☆86Apr 10, 2025Updated last year