howardlau1999 / autograd
A simple demonstration of how PyTorch autograd works
☆16Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for autograd
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Updated 2 years ago
- Code of the paper "Building an Efficient Key-Value Store in a Flexible Address Space", EuroSys '22☆21Updated 5 months ago
- Rebuild YatSenOS On RISC-V 64.☆19Updated 2 years ago
- An efficient concurrent graph processing system☆46Updated 3 years ago
- My paper/code reading notes in Chinese☆45Updated 6 months ago
- ☆19Updated 3 weeks ago
- Source code for the FAST '23 paper “MadFS: Per-File Virtualization for Userspace Persistent Memory Filesystems”☆34Updated last year
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆48Updated 2 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆39Updated 2 years ago
- ☆11Updated 3 years ago
- Codes for MO's Trading☆15Updated 2 years ago
- Lightning In-Memory Object Store☆44Updated 2 years ago
- Cocytus is an efficient and available in-memory K/V-store through hybrid erasure coding and replication☆30Updated 8 years ago
- ☆14Updated 2 years ago
- General system research material (not limited to paper) reading notes.☆20Updated 3 years ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆17Updated 2 years ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆74Updated last year
- An IR for efficiently simulating distributed ML computation.☆25Updated 10 months ago
- [OSDI 2024] Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory☆43Updated 8 months ago
- Source code for DPTree: Differential Indexing for Persistent Memory☆61Updated 3 years ago
- C++ interfaces for RDMA access☆47Updated 3 weeks ago
- DINOMO: An Elastic, Scalable, High-Performance Key-Value Store for Disaggregated Persistent Memory (PVLDB 2022, VLDB 2023)☆36Updated last year
- ☆18Updated 2 weeks ago
- Fast RDMA-based Ordered Key-Value Store using Remote Learned Cache☆111Updated 3 years ago
- Thunder Research Group's Collective Communication Library☆26Updated 7 months ago
- website for systems seminar at UIUC☆17Updated this week
- ☆40Updated last month
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆54Updated 3 months ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated 6 months ago
- ☆20Updated 3 years ago