Official Implementation of "RTop-K: Ultra-Fast Row-Wise Top-K Selection for Neural Network Acceleration on GPUs"
☆28Jul 23, 2025Updated 9 months ago
Alternatives and similar repositories for RTopK
Users that are interested in RTopK are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Aug 22, 2023Updated 2 years ago
- Official Implementation of "LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference"☆25Nov 12, 2023Updated 2 years ago
- Automatic ReLU Reduction☆15Dec 20, 2023Updated 2 years ago
- Official Implementation of "Accel-GNN: High-Performance GPU Accelerator Design for Graph Neural Networks"☆52Mar 20, 2025Updated last year
- Sparse Backpropagation for Mixture-of-Expert Training☆30Jul 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An HBM FPGA based SpMV Accelerator☆18Aug 29, 2024Updated last year
- ☆48Jan 3, 2026Updated 4 months ago
- Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k c…☆27Dec 10, 2022Updated 3 years ago
- Fun project to run your own LLM chat bot using llama.cpp☆11Jun 9, 2023Updated 2 years ago
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Sep 7, 2015Updated 10 years ago
- The official implementation of "Optimal Stochastic Trace Estimation in Generative Modeling (AISTATS 2025)"☆20Mar 2, 2025Updated last year
- A toy Inspect implementation of the Bliss Attractor eval from Claude 4 System Card Welfare Assessment☆38Jun 5, 2025Updated 11 months ago
- ☆166May 1, 2026Updated 2 weeks ago
- Matlab mex wrappers to cuSPARSE (NVIDIA)☆11Dec 10, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Aug 26, 2025Updated 8 months ago
- Debate interface, experiments, etc.☆10Mar 12, 2024Updated 2 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 3 years ago
- A synthetic graph generator on spark for the LDBC Financial Benchmark, featured as temporal graph☆14Apr 12, 2026Updated last month
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- A tiny easily hackable implementation of a feature dashboard.☆16Oct 21, 2025Updated 6 months ago
- Connect a Ublox NEO-6M/NE0-M8N gps module to a WiPy2.0/3.0☆10Apr 29, 2018Updated 8 years ago
- A test library for computing modular exponentiation in parallel using AVX-512 vector arithmetic☆12Dec 18, 2023Updated 2 years ago
- A repo for learning how to parallelize computations in the GPU using Apple's Metal, in Rust.☆10Mar 17, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Compare Bloxroute and Fiber transaction streams☆10Nov 22, 2024Updated last year
- Tendermint implementation of the blockchain of Aleo verifiable computing model built by LambdaClass☆15Feb 8, 2023Updated 3 years ago
- CDLS: Proving Knowledge of Committed Discrete Logarithms with Soundness☆11Apr 30, 2026Updated 2 weeks ago
- The entry point for Rust projects to be run on Valida☆10Mar 14, 2025Updated last year
- Mapping out the "memory" of neural nets with data attribution☆57May 9, 2026Updated last week
- Scalable radix top-k selection on GPUs.☆23Jan 27, 2025Updated last year
- A Benchmark for Multi-Stage Legal Case Documents Generation☆17Feb 24, 2025Updated last year
- ☆14Mar 1, 2021Updated 5 years ago
- Gamora: Graph Learning based Symbolic Reasoning for Large-Scale Boolean Networks (DAC'23)☆58Jan 8, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Pytorch-based tools for constructing a vocabulary of visual concepts in a GAN.☆17Feb 25, 2022Updated 4 years ago
- [MLSys 2022] "BNS-GCN: Efficient Full-Graph Training of Graph Convolutional Networks with Partition-Parallelism and Random Boundary Node …☆56Oct 6, 2023Updated 2 years ago
- ☆12Jun 5, 2025Updated 11 months ago
- 🦎 Prototypes on polymorphic, metamorphic and poly-metamorphic malwares in Rust 🦎☆14Oct 8, 2023Updated 2 years ago
- Some utility functions to help myself (and perhaps others) go faster with ML/AI work☆49Updated this week
- Bayesian multiple logistic regression for GWAS meta-analysis☆17Aug 20, 2025Updated 8 months ago
- Composable numerical solvers for unconstrained and simple-bounds constrained convex optimization problems in Rust. WASM compatible☆15Jul 10, 2025Updated 10 months ago