☆18May 8, 2021Updated 4 years ago
Alternatives and similar repositories for MERCI
Users that are interested in MERCI are comparing it to the libraries listed below
Sorting:
- An Optimizing Compiler for Recommendation Model Inference☆26Jun 5, 2025Updated 8 months ago
- ☆12Oct 25, 2022Updated 3 years ago
- mit-6.824 distributed system labs demo in golang & python☆11Nov 20, 2023Updated 2 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆31Sep 19, 2024Updated last year
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Aug 20, 2025Updated 6 months ago
- ☆24Apr 13, 2025Updated 10 months ago
- Rebuild YatSenOS On RISC-V 64.☆22Jan 6, 2022Updated 4 years ago
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆19Mar 5, 2023Updated 2 years ago
- ☆26Aug 19, 2022Updated 3 years ago
- A framework for pipelined computing on GPU☆30Jul 17, 2019Updated 6 years ago
- Multi-GPU dynamic scheduler using PGAS style cross-GPU communication☆29Jul 23, 2023Updated 2 years ago
- Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.☆30Feb 12, 2022Updated 4 years ago
- ☆33Sep 9, 2020Updated 5 years ago
- Accelerating Recommender model training by leveraging popular choices -- VLDB 2022☆31Sep 15, 2024Updated last year
- [HPCA 2022] GCoD: Graph Convolutional Network Acceleration via Dedicated Algorithm and Accelerator Co-Design☆39Mar 30, 2022Updated 3 years ago
- Code accompanying the NeurIPS 2019 paper AutoAssist: A Framework to Accelerate Training of Deep Neural Networks.☆14Oct 3, 2022Updated 3 years ago
- ☆36Jun 10, 2024Updated last year
- Boosting GPU utilization for LLM serving via dynamic spatial-temporal prefill & decode orchestration☆36Jan 8, 2026Updated last month
- ☆11Sep 25, 2021Updated 4 years ago
- ☆11Aug 21, 2023Updated 2 years ago
- Rewrite OpenGFW in Rust, with web-ui.☆17Mar 3, 2025Updated last year
- Latex resume template☆12Mar 29, 2012Updated 13 years ago
- 一起来数三角形吧!☆10Jun 27, 2024Updated last year
- ☆11Dec 23, 2019Updated 6 years ago
- Yat another MySQL storage engine, a database course project.☆13Dec 23, 2022Updated 3 years ago
- A standalone CXL-enabled system simulator.☆18Jan 10, 2026Updated last month
- An EDM-enabled PHY + a rack-level network simulator☆14Dec 11, 2024Updated last year
- INFINEL: An efficient GPU-based processing method for unpredictable large output graph queries [PPoPP'24]☆10Jan 15, 2024Updated 2 years ago
- ☆11Aug 4, 2020Updated 5 years ago
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago
- Generate custom Mac OS folder icons with a desired image as stamp☆12Oct 3, 2023Updated 2 years ago
- CV and Deep Learning methods to analyze the data from Traffic Camera☆13Sep 29, 2018Updated 7 years ago
- ☆10May 12, 2022Updated 3 years ago
- Time-based Sequence Model for Personalization and Recommendation Systems☆49Aug 26, 2021Updated 4 years ago
- nv-one-logger enables tracking of GPU application progress over time and can help to identify overhead from workload and cluster ineffici…☆22Nov 6, 2025Updated 3 months ago
- Parallel Approximate Nearest Neighbor Search☆14Nov 12, 2022Updated 3 years ago
- Linear-Time Self Attention with Codeword Histogram for Efficient Recommendation☆11Mar 23, 2021Updated 4 years ago
- Transactional memory (mostly Intel® TSX) experiments☆14May 3, 2014Updated 11 years ago
- Drawing Comparison Figures in Scientific Research Papers, includes lines and bars.☆11Mar 22, 2024Updated last year