Arrow Matrix Decomposition - Communication-Efficient Distributed Sparse Matrix Multiplication
☆15Mar 25, 2024Updated 2 years ago
Alternatives and similar repositories for arrow-matrix
Users that are interested in arrow-matrix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- FROSTT: the Formidable Repository of Open Sparse Tensors and Tools.☆12Jun 11, 2025Updated 10 months ago
- NeuroSpector: Dataflow and Mapping Optimizer for Deep Neural Network Accelerators☆21Mar 20, 2025Updated last year
- An example model of a Network Processing Unit using the PFPSim framework.☆13Aug 23, 2016Updated 9 years ago
- Official code repository for the papers "Anti-Symmetric DGN: a stable architecture for Deep Graph Networks" accepted at ICLR 2023; "Non-D…☆15Jan 2, 2025Updated last year
- ☆14Mar 7, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆19Oct 7, 2025Updated 6 months ago
- Selected Decomposition Routines☆22Aug 30, 2025Updated 7 months ago
- NeuraChip Accelerator Simulator☆16Apr 26, 2024Updated last year
- MendelMax RepRap Printer☆49Dec 9, 2011Updated 14 years ago
- Hardware-accelerated sorting algorithm☆16May 4, 2020Updated 5 years ago
- ☆15Nov 28, 2023Updated 2 years ago
- A Portable Linux-based Firmware for NVMe Computational Storage Devices☆31Jun 10, 2025Updated 10 months ago
- A set of tools for understanding F2FS usage of ZNS devices, which allow for identifying the on-device locations of files and inodes, mapp…☆20Jan 19, 2025Updated last year
- TRAGEN: A Synthetic Trace Generator for Realistic Cache Simulations☆22Mar 25, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- TIDENet is an ASIC written in Verilog for Tiny Image Detection at Edge with neural networks (TIDENet) using DNNWeaver 2.0, the Google Sky…☆17Jan 30, 2023Updated 3 years ago
- ☆11Jun 14, 2024Updated last year
- A Full-System Framework for Simulating NDP devices from Caches to DRAM☆21Jan 12, 2024Updated 2 years ago
- ☆13Jun 18, 2025Updated 9 months ago
- Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…☆46May 22, 2024Updated last year
- [TRETS'23, FPT'20] CHIP-KNN: Configurable and HIgh-Performance K-Nearest Neighbors Accelerator on Cloud FPGAs☆18Apr 9, 2024Updated 2 years ago
- Code for High Performance Unstructured SpMM Computation Using Tensor Cores☆35Nov 3, 2024Updated last year
- LazyGNN: Large-Scale Graph Neural Networks via Lazy Propagation ICML_2023☆13Oct 27, 2023Updated 2 years ago
- This repository describes I/O traces of Google storage servers and disks synthesized by Thesios. Thesios synthesizes representative I/O t…☆26Apr 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Open source RTL simulation acceleration on commodity hardware☆35Apr 13, 2023Updated 2 years ago
- Check your data which may be stolen every time you visit a site. ⚠️☆13Dec 7, 2022Updated 3 years ago
- Repo for PyChart 1.39, refs http://download.gna.org/pychart/☆10Sep 29, 2014Updated 11 years ago
- HW accelerator mapping optimization framework for in-memory computing☆28Jun 3, 2025Updated 10 months ago
- CasHMC: A Cycle-accurate Simulator for Hybrid Memory Cube☆23Aug 10, 2018Updated 7 years ago
- Source Code for the paper Titled FASTHash: FPGA-Based High Throughput Parallel Hash Table published in ISC high performance 2020☆27Apr 11, 2022Updated 4 years ago
- Implementation of ICML'24 Paper "Graph Distillation with Eigenbasis Matching"☆15Jul 2, 2024Updated last year
- ReDMArk: Bypassing RDMA Security Mechanisms.☆44Oct 19, 2020Updated 5 years ago
- Distributed-memory, double-precision, polar decomposition (QDWH/ZOLO-PD) of a dense matrix, svd (QDWH/ZOLOPD-SVD) of a dense matrix☆15Jun 3, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Finite-difference option pricer for GPU☆15Feb 29, 2024Updated 2 years ago
- Gallatin is a general-purpose memory manager for CUDA that allows for threads to quickly malloc and free memory of arbitrary size inside …☆25Mar 27, 2026Updated 2 weeks ago
- Verilog hardware abstraction library☆50Apr 1, 2026Updated last week
- ☆17Oct 21, 2020Updated 5 years ago
- NPBench - A Benchmarking Suite for High-Performance NumPy☆92Jan 28, 2026Updated 2 months ago
- High-Performance Machine Learning Primitives☆13Apr 17, 2021Updated 4 years ago
- Sparse Matrix-Matrix Multiplication Benchmark on Intel Xeon and Xeon Phi (KNC, KNL) from blog post:☆12Sep 25, 2016Updated 9 years ago