GPU TopK Benchmark
☆18Dec 19, 2024Updated last year
Alternatives and similar repositories for gpu_topK_benchmark
Users that are interested in gpu_topK_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Dec 1, 2023Updated 2 years ago
- ☆12Dec 17, 2023Updated 2 years ago
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆33Jun 25, 2025Updated 11 months ago
- ☆10May 12, 2022Updated 4 years ago
- A Framework for Graph Sampling and Random Walk on GPUs.☆38Feb 3, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ngAP's artifact for ASPLOS'24☆25Jul 29, 2025Updated 9 months ago
- ☆19Nov 21, 2022Updated 3 years ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- This repository contains code for the paper RMM: A Recursive Mental Model for Dialog Navigation☆10Nov 22, 2022Updated 3 years ago
- ☆50Jun 27, 2019Updated 6 years ago
- FlashMob is a shared-memory random walk system.☆33Jul 7, 2023Updated 2 years ago
- GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators☆34Apr 3, 2022Updated 4 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- CSAPP3e Course Labs Files☆10Oct 9, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICML‘25] Official code for paper "Occult: Optimizing Collaborative Communication across Experts for Accelerated Parallel MoE Training an…☆13Apr 17, 2025Updated last year
- LonestarGPU: Irregular algorithms parallelized for GPUs☆38Nov 11, 2019Updated 6 years ago
- ☆15Oct 9, 2022Updated 3 years ago
- ☆13Dec 9, 2024Updated last year
- Cluster simulator with far memory☆12Apr 28, 2020Updated 6 years ago
- GPU-friendly Subgraph Isomorphism, published in ICDE 2020☆37Jul 30, 2025Updated 9 months ago
- A platform to evaluate techniques used in multicore graph processing.☆37Oct 25, 2018Updated 7 years ago
- parallel algorithm based on cuda☆60Nov 27, 2017Updated 8 years ago
- The official repository for the experiments included in the paper titled "Patch-level Routing in Mixture-of-Experts is Provably Sample-ef…☆14Feb 12, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"☆42Nov 16, 2021Updated 4 years ago
- SJTU 中文简约 LaTeX 报告模板☆10Jun 7, 2021Updated 4 years ago
- New version of pbbs benchmarks☆97Nov 25, 2025Updated 6 months ago
- Estimate depth from surface normal.☆12Aug 14, 2020Updated 5 years ago
- Graph accelerator on FPGAs and ASICs☆11Aug 16, 2018Updated 7 years ago
- GPU-Accelerated Faster Decoding of Integer Lists☆13Aug 20, 2019Updated 6 years ago
- An LLM inference engine, written in C++☆19Mar 30, 2026Updated last month
- MAFIA: Multiple Application Framework for GPU architectures☆28Jan 21, 2022Updated 4 years ago
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆15Mar 6, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Few-shot text classification with meta learning and BERT☆11Jun 14, 2021Updated 4 years ago
- ☆14Apr 24, 2024Updated 2 years ago
- PyTorch implementation of "Vision-Dialog Navigation by Exploring Cross-modal Memory", CVPR 2020.☆19Nov 22, 2022Updated 3 years ago
- Image Filtering using CUDA☆30Mar 22, 2019Updated 7 years ago
- Compute applications.☆25Dec 12, 2019Updated 6 years ago
- Created a simple neural network using C++17 standard and the Eigen library that supports both forward and backward propagation.☆11Jul 27, 2024Updated last year
- ✂️ EyeLipCropper is a Python tool to crop eyes and mouth ROIs of the given video.☆14Nov 28, 2021Updated 4 years ago