howardlau1999 / autograd
A simple demonstration of how PyTorch autograd works
☆16Updated 3 years ago
Alternatives and similar repositories for autograd:
Users that are interested in autograd are comparing it to the libraries listed below
- An efficient concurrent graph processing system☆46Updated 3 years ago
- ☆20Updated last month
- Rebuild YatSenOS On RISC-V 64.☆19Updated 3 years ago
- PSTensor provides a way to hack the memory management of tensors in TensorFlow and PyTorch by defining your own C++ Tensor Class.☆10Updated 3 years ago
- ☆14Updated 2 years ago
- Code of the paper "Building an Efficient Key-Value Store in a Flexible Address Space", EuroSys '22☆21Updated 9 months ago
- High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph☆51Updated 2 years ago
- My paper/code reading notes in Chinese☆46Updated 9 months ago
- [OSDI 2024] Motor: Enabling Multi-Versioning for Distributed Transactions on Disaggregated Memory☆47Updated last year
- A memory profiler for NVIDIA GPUs to explore memory inefficiencies in GPU-accelerated applications.☆25Updated 5 months ago
- Codes for MO's Trading☆15Updated 2 years ago
- Source code for the FAST '23 paper “MadFS: Per-File Virtualization for Userspace Persistent Memory Filesystems”☆39Updated 2 years ago
- ☆11Updated 3 years ago
- ☆12Updated 9 months ago
- C++ interfaces for RDMA access☆66Updated last month
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆19Updated last month
- Repo for OSDI 2023 paper: "Ship your Critical Section Not Your Data: Enabling Transparent Delegation with TCLocks"☆14Updated 4 months ago
- SOTA Learning-augmented Systems☆35Updated 2 years ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆92Updated 2 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆42Updated 2 years ago
- RLib is a header-only library for easier usage of RDMA.☆45Updated 3 years ago
- ☆20Updated 3 weeks ago
- Lightning In-Memory Object Store☆45Updated 3 years ago
- ☆32Updated 3 years ago
- Seminar on selected tools in Computer Science☆24Updated 4 years ago
- Ths is a fast RDMA abstraction layer that works both in the kernel and user-space.☆52Updated 4 months ago
- A User-Transparent Block Cache Enabling High-Performance Out-of-Core Processing with In-Memory Programs☆74Updated last year
- ☆20Updated 3 years ago
- PetPS: Supporting Huge Embedding Models with Tiered Memory☆30Updated 9 months ago
- Experimental KV store engine on non-volatile memory☆72Updated 4 years ago