dian-lun-lin / SNIGView external linksLinks
SNIG: Accelerated Large Sparse Neural Network Inference using Task Graph Parallelism
☆34Nov 12, 2021Updated 4 years ago
Alternatives and similar repositories for SNIG
Users that are interested in SNIG are comparing it to the libraries listed below
Sorting:
- Object-Oriented Programming☆12Aug 26, 2021Updated 4 years ago
- Advanced Programming for Computer Design Problems☆17Aug 28, 2021Updated 4 years ago
- A decentralized work-stealing scheduler that dynamically schedules fixed-priority tasks in a non-preemptive manner.☆19May 31, 2015Updated 10 years ago
- Heterogeneous Programming☆17Apr 24, 2023Updated 2 years ago
- Profiling Taskflow Programs through Visualization☆51Mar 14, 2023Updated 2 years ago
- Scheduling examples using C++20 coroutines☆29May 13, 2023Updated 2 years ago
- [ICCAD 22]DeePEB: A neural network based PEB solver☆11Feb 17, 2023Updated 2 years ago
- ☆14Feb 14, 2022Updated 4 years ago
- ☆14Aug 27, 2020Updated 5 years ago
- A CUDA-based multi-GPU vertex-centric graph processing framework based on Warp Segmentation and Vertex Refinement techniques.☆12Mar 20, 2017Updated 8 years ago
- repo with information useful for the course ME759 - High Performance Computing for Applications in Engineering☆27Jan 22, 2026Updated 3 weeks ago
- C++ Workflow with kubernetes automated deployment.☆31Aug 13, 2021Updated 4 years ago
- ☆44Jan 26, 2020Updated 6 years ago
- About repo with information useful for the Fall 2024 offering of ECE 759 - High Performance Computing for Applications in Engineering☆26Oct 18, 2024Updated last year
- Thread pool which supports c++20 coroutine. 一个支持c++20协程的线程池。☆23Mar 3, 2022Updated 3 years ago
- ☆21Apr 17, 2025Updated 9 months ago
- ☆26Dec 8, 2025Updated 2 months ago
- Tunnel is a Pipeline Execution Engine based on C++20 coroutine☆30Aug 17, 2023Updated 2 years ago
- GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators☆34Apr 3, 2022Updated 3 years ago
- ☆31Jun 2, 2021Updated 4 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆31Sep 19, 2024Updated last year
- autonomous driving contest reference kit☆10Dec 2, 2021Updated 4 years ago
- Languages, Tools, and Techniques for Accelerator Design☆33Nov 2, 2021Updated 4 years ago
- ☆33Jan 24, 2020Updated 6 years ago
- A High-performance Cluster Computing Engine☆148May 4, 2019Updated 6 years ago
- C++20协程net,基于epoll,可以方便地使用await语法☆28Sep 14, 2023Updated 2 years ago
- This is the repository containing the implementation of sparse dense matrix multiplication for the matrix dimension of 560 x 560.☆10Jul 7, 2021Updated 4 years ago
- 文档项目☆12Jan 22, 2026Updated 3 weeks ago
- An open multiple patterning framework☆82May 17, 2024Updated last year
- A hierarchical collective communications library with portable optimizations☆37Dec 8, 2024Updated last year
- VASim is a virtual homogeneous non-deterministic finite automata automata simulator and transformation tool. VASim can parse, transform, …☆36May 17, 2024Updated last year
- ☆42Jun 13, 2025Updated 8 months ago
- ☆43Nov 28, 2022Updated 3 years ago
- ☆12Jan 21, 2026Updated 3 weeks ago
- 算法动画演示、ACM基础算法、简单算法思想演示☆10Jul 22, 2020Updated 5 years ago
- TBD☆12Updated this week
- Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.☆91Nov 23, 2022Updated 3 years ago
- Task graph-based asynchronous programming system using C++ coroutine☆100Feb 18, 2024Updated last year
- ☆12Apr 12, 2023Updated 2 years ago