GPU TopK Benchmark
☆18Dec 19, 2024Updated last year
Alternatives and similar repositories for gpu_topK_benchmark
Users that are interested in gpu_topK_benchmark are comparing it to the libraries listed below
Sorting:
- ☆15Dec 1, 2023Updated 2 years ago
- ☆12Dec 17, 2023Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- ☆19Nov 21, 2022Updated 3 years ago
- ☆50Jun 27, 2019Updated 6 years ago
- ngAP's artifact for ASPLOS'24☆25Jul 29, 2025Updated 7 months ago
- FlashMob is a shared-memory random walk system.☆32Jul 7, 2023Updated 2 years ago
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆32Jun 25, 2025Updated 8 months ago
- GPU-friendly Subgraph Isomorphism, published in ICDE 2020☆37Jul 30, 2025Updated 7 months ago
- ☆33Oct 28, 2020Updated 5 years ago
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Nov 11, 2019Updated 6 years ago
- ☆10Dec 29, 2015Updated 10 years ago
- A platform to evaluate techniques used in multicore graph processing.☆37Oct 25, 2018Updated 7 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆91Nov 23, 2022Updated 3 years ago
- ☆13Dec 9, 2024Updated last year
- ☆10Mar 2, 2024Updated 2 years ago
- A Rust-based Unikernel Enhancing Reliability and Efficiency of Embedded Systems.☆11Jun 28, 2024Updated last year
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆10Jul 27, 2024Updated last year
- ☆10May 12, 2022Updated 3 years ago
- New version of pbbs benchmarks☆97Nov 25, 2025Updated 3 months ago
- SJTU 中文简约 LaTeX 报告模板☆10Jun 7, 2021Updated 4 years ago
- Code for the SIGMOD 2018 programming contest. Finished at 2nd place.☆13Jun 6, 2018Updated 7 years ago
- CenterNet3D 部署版本,便于移植不同平台(onnx、tensorRT、rknn、Horizon)。☆13May 24, 2024Updated last year
- ☆14Oct 9, 2022Updated 3 years ago
- PyTorch implementation of delayed-feedback-model (DFM)☆15Feb 7, 2022Updated 4 years ago
- ✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.☆14Nov 28, 2021Updated 4 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert …☆15Feb 4, 2025Updated last year
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- ☆15Jul 13, 2025Updated 7 months ago
- Cluster simulator with far memory☆12Apr 28, 2020Updated 5 years ago
- Source Code for Partial Interference☆10Dec 17, 2022Updated 3 years ago
- CSAPP3e Course Labs Files☆10Oct 9, 2020Updated 5 years ago
- A selective knowledge distillation algorithm for efficient speculative decoders☆36Nov 27, 2025Updated 3 months ago
- ☆48Jan 30, 2026Updated last month
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- A flexible implementation of enhanced suffix arrays in template based C++. Supports single and multi-position wildcard. Fast queries than…☆21Oct 1, 2020Updated 5 years ago
- Agent framework for generating a synthetic dataset. This will be raw CoT and Reflection output to be cleaned up by a later step.☆15Apr 11, 2025Updated 10 months ago
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated 10 months ago