Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication
☆15Mar 25, 2024Updated last year
Alternatives and similar repositories for arrow-matrix
Users that are interested in arrow-matrix are comparing it to the libraries listed below
Sorting:
- FROSTT: the Formidable Repository of Open Sparse Tensors and Tools.☆12Jun 11, 2025Updated 9 months ago
- NeuroSpector: Dataflow and Mapping Optimizer for Deep Neural Network Accelerators☆21Mar 20, 2025Updated last year
- An example model of a Network Processing Unit using the PFPSim framework.☆13Aug 23, 2016Updated 9 years ago
- Official code repository for the papers "Anti-Symmetric DGN: a stable architecture for Deep Graph Networks" accepted at ICLR 2023; "Non-D…☆15Jan 2, 2025Updated last year
- ☆14Mar 7, 2022Updated 4 years ago
- ☆19Oct 7, 2025Updated 5 months ago
- Selected Decomposition Routines☆21Aug 30, 2025Updated 6 months ago
- NeuraChip Accelerator Simulator☆16Apr 26, 2024Updated last year
- MendelMax RepRap Printer☆49Dec 9, 2011Updated 14 years ago
- Hardware-accelerated sorting algorithm☆16May 4, 2020Updated 5 years ago
- ☆14Nov 28, 2023Updated 2 years ago
- A Portable Linux-based Firmware for NVMe Computational Storage Devices☆31Jun 10, 2025Updated 9 months ago
- A set of tools for understanding F2FS usage of ZNS devices, which allow for identifying the on-device locations of files and inodes, mapp…☆20Jan 19, 2025Updated last year
- TRAGEN: A Synthetic Trace Generator for Realistic Cache Simulations☆22Mar 25, 2024Updated last year
- TIDENet is an ASIC written in Verilog for Tiny Image Detection at Edge with neural networks (TIDENet) using DNNWeaver 2.0, the Google Sky…☆17Jan 30, 2023Updated 3 years ago
- ☆11Jun 14, 2024Updated last year
- A Full-System Framework for Simulating NDP devices from Caches to DRAM☆21Jan 12, 2024Updated 2 years ago
- ☆13Jun 18, 2025Updated 9 months ago
- Code for High Performance Unstructured SpMM Computation Using Tensor Cores☆33Nov 3, 2024Updated last year
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆46May 22, 2024Updated last year
- [TRETS'23, FPT'20] CHIP-KNN: Configurable and HIgh-Performance K-Nearest Neighbors Accelerator on Cloud FPGAs☆18Apr 9, 2024Updated last year
- LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation ICML_2023☆13Oct 27, 2023Updated 2 years ago
- This repository describes I/O traces of Google storage servers and disks synthesized by Thesios. Thesios synthesizes representative I/O t…☆25Apr 29, 2024Updated last year
- Check your data which may be stolen every time you visit a site. ⚠️☆13Dec 7, 2022Updated 3 years ago
- Open source RTL simulation acceleration on commodity hardware☆34Apr 13, 2023Updated 2 years ago
- Repo for PyChart 1.39, refs http://download.gna.org/pychart/☆10Sep 29, 2014Updated 11 years ago
- HW accelerator mapping optimization framework for in-memory computing☆28Jun 3, 2025Updated 9 months ago
- CasHMC: A Cycle-accurate Simulator for Hybrid Memory Cube☆23Aug 10, 2018Updated 7 years ago
- Source Code for the paper Titled FASTHash: FPGA-Based High Throughput Parallel Hash Table published in ISC high performance 2020☆27Apr 11, 2022Updated 3 years ago
- Implementation of ICML'24 Paper "Graph Distillation with Eigenbasis Matching"☆15Jul 2, 2024Updated last year
- ReDMArk: Bypassing RDMA Security Mechanisms.☆43Oct 19, 2020Updated 5 years ago
- Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix☆15Jun 3, 2020Updated 5 years ago
- Finite-difference option pricer for GPU☆14Feb 29, 2024Updated 2 years ago
- Gallatin is a general-purpose memory manager for CUDA that allows for threads to quickly malloc and free memory of arbitrary size inside …☆25Feb 4, 2026Updated last month
- Verilog hardware abstraction library☆49Mar 13, 2026Updated last week
- ☆17Oct 21, 2020Updated 5 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆92Jan 28, 2026Updated last month
- High-Performance Machine Learning Primitives☆13Apr 17, 2021Updated 4 years ago
- MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator☆57Jun 12, 2021Updated 4 years ago