Official Implementation of "RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs"
☆28Jul 23, 2025Updated 10 months ago
Alternatives and similar repositories for RTopK
Users that are interested in RTopK are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Aug 22, 2023Updated 2 years ago
- Official Implementation of "LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference"☆25Nov 12, 2023Updated 2 years ago
- Automatic ReLU Reduction☆15Dec 20, 2023Updated 2 years ago
- Sparse Backpropagation for Mixture-of-Expert Training☆30Jul 2, 2024Updated last year
- An HBM FPGA based SpMV Accelerator☆18Aug 29, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Apr 27, 2013Updated 13 years ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Sep 7, 2015Updated 10 years ago
- [NeurIPS 2025 MechInterp Workshop - Spotlight] Official implementation of the paper "RelP: Faithful and Efficient Circuit Discovery in La…☆29Nov 3, 2025Updated 7 months ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Dec 10, 2025Updated 5 months ago
- ☆12Aug 26, 2025Updated 9 months ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 3 years ago
- A synthetic graph generator on spark for the LDBC Financial Benchmark, featured as temporal graph☆14Apr 12, 2026Updated last month
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- MLPerf Mobile benchmarks☆15Apr 28, 2026Updated last month
- The official implementation of the paper "MLP Memory: A Retriever-Pretrained Memory for Large Language Models". (ICLR 2026)☆66Jan 28, 2026Updated 4 months ago
- Proof of concept implementation of Sigmabus https://eprint.iacr.org/2023/1406☆10Dec 20, 2023Updated 2 years ago
- A tiny easily hackable implementation of a feature dashboard.☆16Oct 21, 2025Updated 7 months ago
- Connect a Ublox NEO-6M/NE0-M8N gps module to a WiPy2.0/3.0☆10Apr 29, 2018Updated 8 years ago
- Compare Bloxroute and Fiber transaction streams☆10Nov 22, 2024Updated last year
- Tendermint implementation of the blockchain of Aleo verifiable computing model built by LambdaClass☆15Feb 8, 2023Updated 3 years ago
- CDLS: Proving Knowledge of Committed Discrete Logarithms with Soundness☆12Apr 30, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This is the official repository for "Explanatory Learning: Beyond Empiricism in Neural Networks".☆15May 17, 2022Updated 4 years ago
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- ☆24Jan 28, 2025Updated last year
- The entry point for Rust projects to be run on Valida☆10Mar 14, 2025Updated last year
- Mapping out the "memory" of neural nets with data attribution☆58Updated this week
- A Benchmark for Multi-Stage Legal Case Documents Generation☆19Feb 24, 2025Updated last year
- ☆14Mar 1, 2021Updated 5 years ago
- Gamora: Graph Learning based Symbolic Reasoning for Large-Scale Boolean Networks (DAC'23)☆58Jan 8, 2025Updated last year
- [MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node …☆56Oct 6, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Jun 5, 2025Updated last year
- 🦎 Prototypes on polymorphic, metamorphic and poly-metamorphic malwares in Rust 🦎☆14Oct 8, 2023Updated 2 years ago
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆92Nov 23, 2022Updated 3 years ago
- New batched algorithm for sparse matrix-matrix multiplication (SpMM)☆16May 7, 2019Updated 7 years ago
- Composable numerical solvers for unconstrained and simple-bounds constrained convex optimization problems in Rust. WASM compatible☆16Jul 10, 2025Updated 10 months ago
- ☆12Oct 4, 2023Updated 2 years ago
- ☆12Sep 11, 2024Updated last year