Scalable radix top-k selection on GPUs.
☆23Jan 27, 2025Updated last year
Alternatives and similar repositories for radik
Users that are interested in radik are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Aug 24, 2022Updated 3 years ago
- GPU-accelerated AES encryption project☆11Feb 13, 2015Updated 11 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Oct 15, 2018Updated 7 years ago
- Fast and highly tuned bit vector implementation including space efficient rank and select support having only 3.51% space overhead.☆34Apr 7, 2025Updated last year
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)☆11Aug 12, 2020Updated 5 years ago
- ☆14Nov 7, 2022Updated 3 years ago
- ☆13May 8, 2020Updated 5 years ago
- VASim is a virtual homogeneous non-deterministic finite automata automata simulator and transformation tool. VASim can parse, transform, …☆36May 17, 2024Updated last year
- ☆16Aug 20, 2020Updated 5 years ago
- Memory footprint reduction for transformer models☆11Jan 24, 2023Updated 3 years ago
- Simplify the communication with Unitree Robots.☆16Nov 29, 2025Updated 5 months ago
- MLPerf Mobile benchmarks☆15Updated this week
- Auto-differentiation library for C++☆12Jan 16, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Machine Learning solution for Kaggle.com's "Partly Sunny with a Chance of Hashtags"☆27Dec 6, 2013Updated 12 years ago
- this is a binpicking code base on PCL☆11May 25, 2018Updated 7 years ago
- 模型加速/模型压缩(已完成所有Lab)☆11Dec 24, 2023Updated 2 years ago
- Dataset tools for converting the InteriorNet dataset raw sequence data to a ROS bag.☆17Aug 24, 2020Updated 5 years ago
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- ☆50Jun 27, 2019Updated 6 years ago
- Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)☆12Jun 20, 2025Updated 10 months ago
- ☆23Apr 8, 2026Updated 3 weeks ago
- AutodiffEngine☆13Apr 1, 2019Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16May 7, 2019Updated 6 years ago
- ☆10May 12, 2022Updated 3 years ago
- antkillerfarm's crazy magic☆17Oct 3, 2024Updated last year
- Simple library for manipulating strings using OpenFST☆12Sep 26, 2021Updated 4 years ago
- ☆10Mar 2, 2024Updated 2 years ago
- Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.☆57Oct 16, 2023Updated 2 years ago
- experimental python CFFI interface to NVIDIA's cuSOLVER and cuSPARSE libraries.☆13Jul 16, 2020Updated 5 years ago
- Artifact for OSDI'23: MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Mult…☆40Mar 17, 2024Updated 2 years ago
- A High performance and tiny TVM graph executor library written in C which can compile to WebAssembly and use CUDA/WebGPU as the accelerat…☆12Aug 3, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆19Mar 4, 2025Updated last year
- This GitHub repo contains the artifact for CPElide, which appears at MICRO '24☆15Sep 7, 2024Updated last year
- ☆13Oct 25, 2024Updated last year
- GenDP: A Dynamic Programming Framework for Genome Sequencing Analysis☆17Jan 12, 2024Updated 2 years ago
- Official Implementation of "Accel-GNN: High-Performance GPU Accelerator Design for Graph Neural Networks"☆52Mar 20, 2025Updated last year
- The only known (by 2022) open-source, easy-to-understand basic algorithm implementations in TD-CEM. (Please star and fork this project if…☆15Mar 1, 2022Updated 4 years ago
- ☆12Sep 11, 2020Updated 5 years ago