aladinggit / RDMI
This is the repo for remote direct memory introspection.
☆19Updated last year
Related projects: ⓘ
- The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"☆25Updated 7 months ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆12Updated last year
- Dorylus: Affordable, Scalable, and Accurate GNN Training☆77Updated 3 years ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆46Updated last year
- ☆12Updated 3 months ago
- ☆25Updated 2 months ago
- A rust-based benchmark for BlueField SmartNICs.☆26Updated last year
- Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony☆31Updated 3 months ago
- Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)☆8Updated last year
- ☆48Updated 3 years ago
- Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory☆36Updated last year
- ☆14Updated 5 months ago
- ☆37Updated this week
- A collection of tools, code, and documentation to understand the host network on real server hardware.☆17Updated last month
- ☆15Updated 3 weeks ago
- This repository contains code for the paper: Bergsma S., Zeyl T., Senderovich A., and Beck J. C., "Generating Complex, Realistic Cloud Wo…☆42Updated 2 years ago
- Cluster Far Mem, framework to execute single job and multi job experiments using fastswap☆21Updated 8 months ago
- This repository contains a list of papers on various topics (that I am working/worked on) in the system and networking area.☆66Updated last month
- A Hybrid Framework to Build High-performance Adaptive Neural Networks for Kernel Datapath☆24Updated last year
- Website for Artifact Evaluation at EuroSys, SOSP, OSDI, ATC☆29Updated last week
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆31Updated last year
- Herald: Accelerating Neural Recommendation Training with Embedding Scheduling (NSDI 2024)☆12Updated 4 months ago
- Justitia provides RDMA isolation between applications with diverse requirements.☆37Updated 2 years ago
- Random collections of my interested research papers / projects☆18Updated 3 years ago
- Aequitas enables RPC-level QoS in datacenter networks.☆16Updated 2 years ago
- ☆45Updated last year
- ☆12Updated 3 months ago
- The accelerometer analytical model published in ASPLOS 2020 (Accelerometer: Understanding Acceleration Opportunities forData Center Overh…☆15Updated 4 years ago
- Sources and examples for ASPLOS20 paper☆14Updated 4 years ago
- Nu is a new datacenter system that enables developers to build fungible applications that can use datacenter resources wherever they are.☆34Updated 4 months ago