MLNetwork / rostamLinks
☆24Updated 4 years ago
Alternatives and similar repositories for rostam
Users that are interested in rostam are comparing it to the libraries listed below
Sorting:
- ☆66Updated 4 years ago
- HW/SW co-designed end-host RPC stack☆20Updated 4 years ago
- Repository for MLCommons Chakra schema and tools☆39Updated last year
- FpgaNIC is an FPGA-based Versatile 100Gb SmartNIC for GPUs [ATC 22]☆135Updated 2 years ago
- RPCNIC: A High-Performance and Reconfigurable PCIe-attached RPC Accelerator [HPCA2025]☆12Updated 11 months ago
- ☆15Updated last year
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆59Updated last year
- A Cycle-level simulator for M2NDP☆32Updated 2 months ago
- Benchmark suite containing cache filtered traces for use with Ramulator. These include some of the workloads used in our SIGMETRICS 2019 …☆22Updated 5 years ago
- C++/MPI proxies for distributed training of deep neural networks.☆15Updated 3 years ago
- Clio, ASPLOS'22.☆78Updated 3 years ago
- A Programmable Hardware Architecture for Network Transport Logic☆35Updated 4 years ago
- MultiPIM: A Detailed and Configurable Multi-Stack Processing-In-Memory Simulator☆56Updated 4 years ago
- ☆79Updated 4 years ago
- A fast, accurate, and easy-to-integrate memory simulator that model memory system performance with bandwidth--latency curves.☆30Updated 3 weeks ago
- Pin based tool for simulation of rack-scale disaggregated memory systems☆30Updated 8 months ago
- ☆20Updated 2 weeks ago
- Heterogenous ML accelerator☆19Updated 6 months ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆28Updated 4 months ago
- Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025☆100Updated 6 months ago
- Synthetic Traffic Models Capturing a Full Range of Cache Coherent Behaviour☆14Updated 6 years ago
- ☆20Updated 4 years ago
- PsPIN: A RISC-V in-network accelerator for flexible high-performance low-power packet processing☆103Updated 2 years ago
- GNNear: Accelerating Full-Batch Training of Graph NeuralNetworks with Near-Memory Processing☆13Updated 3 years ago
- PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-worl…☆24Updated 10 months ago
- Simulator of a memory controller to connect DRAMSim and FlashDIMMSim into one unified memory☆17Updated last year
- An infrastructure for inline acceleration of network applications☆30Updated 4 years ago
- A PIM instrumentation, compilation, execution, simulation, and evaluation repository for BLIMP-style architectures.☆18Updated 3 years ago
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆33Updated last year
- GPU-accelerated LLM Training Simulator☆14Updated 4 months ago