☆14Apr 24, 2024Updated 2 years ago
Alternatives and similar repositories for fasten
Users that are interested in fasten are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Feb 26, 2020Updated 6 years ago
- ☆42Dec 19, 2025Updated 5 months ago
- ☆12Oct 25, 2022Updated 3 years ago
- A pattern-based algorithmic autotuner for graph processing on GPUs.☆33Jun 25, 2025Updated 10 months ago
- Spack package repository maintained by Student Cluster Competition Team @ Sun Yat-sen University.☆16Aug 20, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A sparse BLAS lib supporting multiple backends☆51Mar 18, 2026Updated 2 months ago
- GenDP: A Dynamic Programming Framework for Genome Sequencing Analysis☆17Jan 12, 2024Updated 2 years ago
- ☆21Aug 21, 2023Updated 2 years ago
- An efficient concurrent graph processing system☆46Oct 27, 2021Updated 4 years ago
- VASim is a virtual homogeneous non-deterministic finite automata automata simulator and transformation tool. VASim can parse, transform, …☆36May 17, 2024Updated 2 years ago
- ☆19Nov 21, 2022Updated 3 years ago
- ANT-ACE: Advanced Compiler Ecosystem for Fully Homomorphic Encryption and Domain Specific Computing☆57May 6, 2026Updated 2 weeks ago
- Automata Benchmark Suite☆23Oct 23, 2023Updated 2 years ago
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Graphiler is a compiler stack built on top of DGL and TorchScript which compiles GNNs defined using user-defined functions (UDFs) into ef…☆59Oct 3, 2022Updated 3 years ago
- Race detector for NVIDIA GPUs, published in SOSP 2021.☆19Feb 22, 2025Updated last year
- Horizontal Fusion☆24Jan 7, 2022Updated 4 years ago
- SNIG: Accelerated Large Sparse Neural Network Inference using Task Graph Parallelism☆34Nov 12, 2021Updated 4 years ago
- CUDAAdvisor: a GPU profiling tool☆53Aug 24, 2018Updated 7 years ago
- Source code for the evaluated benchmarks and proposed cache management technique, GRASP, in [Faldu et al., HPCA'20].☆18Jan 23, 2020Updated 6 years ago
- JPStream: JSONPath Stream Processing in Parallel☆25Nov 15, 2022Updated 3 years ago
- ☆23Oct 31, 2023Updated 2 years ago
- Falconn++ is a locality-sensitive filtering (LSF) approach, built on top of cross-polytope LSH (FalconnLib) to answer approximate nearest…☆13Aug 5, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FLOWMATRIX: GPU-Assisted Information-Flow Analysis through Matrix-Based Representation, USENIX Security'22☆28Apr 17, 2023Updated 3 years ago
- Automatic differentiation for Triton Kernels☆29Aug 12, 2025Updated 9 months ago
- Solutions to introductory distributed computing exercises☆14Apr 9, 2023Updated 3 years ago
- ☆105May 31, 2025Updated 11 months ago
- A framework for pipelined computing on GPU☆30Jul 17, 2019Updated 6 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆31Jul 7, 2020Updated 5 years ago
- Download/merge conference papers, given a URL of conference webpage☆17Sep 27, 2021Updated 4 years ago
- Parallel Stable Sort☆15Oct 11, 2015Updated 10 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆66May 11, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This code helps to retrieve all papers from conferences and rank them by the number of (Google Scholar) citations.☆12Dec 12, 2021Updated 4 years ago
- RabbitKSSD: accelerating genome distance estimation on modern multi-core architectures☆12Jun 5, 2024Updated last year
- FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swa…☆39Oct 5, 2025Updated 7 months ago
- ☆12Aug 17, 2022Updated 3 years ago
- cuJSON: A Highly Parallel JSON Parser for GPUs☆46Dec 12, 2025Updated 5 months ago
- Supplemental materials for The ASPLOS 2025 / EuroSys 2025 Contest on Intra-Operator Parallelism for Distributed Deep Learning☆25May 12, 2025Updated last year
- ☆33Sep 9, 2020Updated 5 years ago