ynsumanth / Parallel-BFS
Implementation of Parallel Breadth-First Search on Distributed Memory Systems
☆11Updated 8 years ago
Related projects: ⓘ
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆17Updated 2 years ago
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆16Updated last year
- A TVM-like CUDA/C code generator.☆8Updated 2 years ago
- My paper/code reading notes in Chinese☆44Updated 4 months ago
- This repo stores a more profound view of Computer Architecture: A Quantitative Approach that tells multi-tenancy, virtualize, fine graine…☆24Updated 7 months ago
- A naive key-value database as the project of Storage Technology Foundations course☆10Updated 5 years ago
- my big project PA for the course Introduction To Computer System in Nanjing University, building an operating system based on qemu called…☆13Updated 4 years ago
- rCore_tutorial_tests☆10Updated 3 years ago
- ☆22Updated 6 months ago
- Using OpenMP to optimize BFS:☆14Updated 3 years ago
- A toy example of database by C++☆22Updated 5 years ago
- General system research material (not limited to paper) reading notes.☆20Updated 3 years ago
- A database management system implemented in Rust from scratch.☆22Updated 3 years ago
- Spring 2022 Course Website for Operating System Course at Peking University☆11Updated last year
- An external memory allocator example for PyTorch.☆13Updated 2 years ago
- A Skew-Resistant Index for Processing-in-Memory☆22Updated 9 months ago
- Yet another toy CPU.☆81Updated 9 months ago
- Rebuild YatSenOS On RISC-V 64.☆19Updated 2 years ago
- Vector search with bounded performance.☆33Updated 7 months ago
- ☆18Updated 3 years ago
- The Verilog implementation of five-stage-pipelined MIPS CPU (Classic RISC pipeline)☆18Updated 2 months ago
- Arya: Arbitrary Graph Pattern Mining with Decomposition-based Sampling☆13Updated 11 months ago
- 上海交通大学软件学院课程计算机系统基础(ICS)笔记☆12Updated 2 years ago
- GHive: Accelerating Analytical Query Processing in Apache Hive via CPU-GPU Heterogeneous Computing.☆14Updated 10 months ago
- Seminar on selected tools in Computer Science☆24Updated 3 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆19Updated 6 months ago
- Final project for course 'Introduction to Databases' of Tsinghua University, Fall 2017☆41Updated 6 years ago
- DGEMM on KNL, achieve 75% MKL☆15Updated 2 years ago
- A GPU-accelerated DNN inference serving system that supports instant kernel preemption and biased concurrent execution in GPU scheduling.☆39Updated 2 years ago
- Some CS notes during Jiawei's undergrad.☆31Updated 2 years ago