aladinggit / RDMI
This is the repo for remote direct memory introspection.
☆20Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RDMI
- The prototype for NSDI paper "NetHint: White-Box Networking for Multi-Tenant Data Centers"☆25Updated 9 months ago
- ☆33Updated 4 months ago
- ☆10Updated 6 months ago
- Website for Artifact Evaluation at EuroSys, SOSP, OSDI, ATC☆31Updated 3 weeks ago
- Nu is a new datacenter system that enables developers to build fungible applications that can use datacenter resources wherever they are.☆35Updated 6 months ago
- This repository contains code for the paper: Bergsma S., Zeyl T., Senderovich A., and Beck J. C., "Generating Complex, Realistic Cloud Wo…☆42Updated 3 years ago
- Aequitas enables RPC-level QoS in datacenter networks.☆16Updated 2 years ago
- Benchmark Test Suite for RDMA Networks☆49Updated last year
- Cupcake: A Compression Scheduler for Scalable Communication-Efficient Distributed Training (MLSys '23)☆9Updated last year
- A Hybrid Framework to Build High-performance Adaptive Neural Networks for Kernel Datapath☆25Updated last year
- ☆22Updated 2 months ago
- Benchmark Suite for RDMA Performance Isolation☆36Updated last year
- Herald: Accelerating Neural Recommendation Training with Embedding Scheduling (NSDI 2024)☆16Updated 6 months ago
- ☆15Updated 4 months ago
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Updated last year
- Random collections of my interested research papers / projects☆20Updated 3 years ago
- Canvas: Isolated and Adaptive Swapping for Multi-Applications on Remote Memory☆36Updated last year
- Managed collective communication service☆12Updated 2 months ago
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆11Updated 5 months ago
- ☆38Updated last month
- A list of network measurement sketch algorithms implemented in eBPF☆48Updated 6 months ago
- ☆16Updated last year
- A rust-based benchmark for BlueField SmartNICs.☆27Updated last year
- Cluster Far Mem, framework to execute single job and multi job experiments using fastswap☆21Updated 10 months ago
- ☆14Updated 5 months ago
- Bamboo is a system for running large pipeline-parallel DNNs affordably, reliably, and efficiently using spot instances.☆47Updated last year
- This repository contains a list of papers on various topics (that I am working/worked on) in the system and networking area.☆69Updated last month
- Hermit: Low-Latency, High-Throughput, and Transparent Remote Memory via Feedback-Directed Asynchrony☆32Updated 5 months ago
- Sources and examples for ASPLOS20 paper☆14Updated 4 years ago
- Code for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]☆16Updated last month