ZhangJingrong / gpu_topK_benchmarkView external linksLinks
GPU TopK Benchmark
☆18Dec 19, 2024Updated last year
Alternatives and similar repositories for gpu_topK_benchmark
Users that are interested in gpu_topK_benchmark are comparing it to the libraries listed below
Sorting:
- ☆15Dec 1, 2023Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- ☆19Nov 21, 2022Updated 3 years ago
- ☆50Jun 27, 2019Updated 6 years ago
- ngAP's artifact for ASPLOS'24☆25Jul 29, 2025Updated 6 months ago
- FlashMob is a shared-memory random walk system.☆32Jul 7, 2023Updated 2 years ago
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆32Jun 25, 2025Updated 7 months ago
- ☆32Oct 28, 2020Updated 5 years ago
- GPU-friendly Subgraph Isomorphism, published in ICDE 2020☆37Jul 30, 2025Updated 6 months ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Nov 11, 2019Updated 6 years ago
- Image Filtering using CUDA☆30Mar 22, 2019Updated 6 years ago
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆40Nov 16, 2021Updated 4 years ago
- ☆18Dec 4, 2025Updated 2 months ago
- A platform to evaluate techniques used in multicore graph processing.☆37Oct 25, 2018Updated 7 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆91Nov 23, 2022Updated 3 years ago
- Graph Pattern Mining☆95Sep 20, 2024Updated last year
- Few-shot text classification with meta learning and BERT☆11Jun 14, 2021Updated 4 years ago
- SJTU 中文简约 LaTeX 报告模板☆10Jun 7, 2021Updated 4 years ago
- An LLM inference engine, written in C++☆18Feb 5, 2026Updated last week
- ☆23Dec 30, 2025Updated last month
- This repository contains code for the paper RMM: A Recursive Mental Model for Dialog Navigation☆10Nov 22, 2022Updated 3 years ago
- ☆13Dec 9, 2024Updated last year
- New version of pbbs benchmarks☆97Nov 25, 2025Updated 2 months ago
- Code for the SIGMOD 2018 programming contest. Finished at 2nd place.☆13Jun 6, 2018Updated 7 years ago
- ☆10May 12, 2022Updated 3 years ago
- This repo implements an interface to GTAV for SCENIC language.☆11Dec 7, 2019Updated 6 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆10Jul 27, 2024Updated last year
- A Rust-based Unikernel Enhancing Reliability and Efficiency of Embedded Systems.☆11Jun 28, 2024Updated last year
- CSAPP3e Course Labs Files☆10Oct 9, 2020Updated 5 years ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Sep 7, 2015Updated 10 years ago
- ☆15Jul 13, 2025Updated 7 months ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- Source Code for Partial Interference☆10Dec 17, 2022Updated 3 years ago
- A selective knowledge distillation algorithm for efficient speculative decoders☆36Nov 27, 2025Updated 2 months ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆14Feb 4, 2025Updated last year
- Cluster simulator with far memory☆12Apr 28, 2020Updated 5 years ago
- Graph accelerator on FPGAs and ASICs☆11Aug 16, 2018Updated 7 years ago
- Agent framework for generating a synthetic dataset. This will be raw CoT and Reflection output to be cleaned up by a later step.☆15Apr 11, 2025Updated 10 months ago