SNIG: Accelerated Large Sparse Neural Network Inference using Task Graph Parallelism
☆34Nov 12, 2021Updated 4 years ago
Alternatives and similar repositories for SNIG
Users that are interested in SNIG are comparing it to the libraries listed below
Sorting:
- Object-Oriented Programming☆12Aug 26, 2021Updated 4 years ago
- Advanced Programming for Computer Design Problems☆17Aug 28, 2021Updated 4 years ago
- A decentralized work-stealing scheduler that dynamically schedules fixed-priority tasks in a non-preemptive manner.☆19May 31, 2015Updated 10 years ago
- Heterogeneous Programming☆18Apr 24, 2023Updated 2 years ago
- Profiling Taskflow Programs through Visualization☆51Mar 14, 2023Updated 2 years ago
- Scheduling examples using C++20 coroutines☆29May 13, 2023Updated 2 years ago
- [ICCAD 22]DeePEB: A neural network based PEB solver☆11Feb 17, 2023Updated 3 years ago
- ☆14Feb 14, 2022Updated 4 years ago
- ☆14Apr 24, 2024Updated last year
- ☆14Aug 27, 2020Updated 5 years ago
- A CUDA-based multi-GPU vertex-centric graph processing framework based on Warp Segmentation and Vertex Refinement techniques.☆12Mar 20, 2017Updated 8 years ago
- repo with information useful for the course ME759 - High Performance Computing for Applications in Engineering☆28Feb 28, 2026Updated last week
- ☆44Jan 26, 2020Updated 6 years ago
- About repo with information useful for the Fall 2024 offering of ECE 759 - High Performance Computing for Applications in Engineering☆26Oct 18, 2024Updated last year
- Thread pool which supports c++20 coroutine. 一个支持c++20协程的线程池。☆23Mar 3, 2022Updated 4 years ago
- ☆25Dec 8, 2025Updated 3 months ago
- ☆21Apr 17, 2025Updated 10 months ago
- Tunnel is a Pipeline Execution Engine based on C++20 coroutine☆30Aug 17, 2023Updated 2 years ago
- GARDENIA: Graph Analytics Repository for Designing Efficient Next-generation Accelerators☆34Apr 3, 2022Updated 3 years ago
- autonomous driving contest reference kit☆10Dec 2, 2021Updated 4 years ago
- ☆32Jun 2, 2021Updated 4 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆31Sep 19, 2024Updated last year
- ☆33Jan 24, 2020Updated 6 years ago
- Languages, Tools, and Techniques for Accelerator Design☆33Nov 2, 2021Updated 4 years ago
- A High-performance Cluster Computing Engine☆148May 4, 2019Updated 6 years ago
- C++20协程net,基于epoll,可以方便地使用await语法☆28Sep 14, 2023Updated 2 years ago
- This is the repository containing the implementation of sparse dense matrix multiplication for the matrix dimension of 560 x 560.☆10Jul 7, 2021Updated 4 years ago
- An open multiple patterning framework☆83May 17, 2024Updated last year
- A hierarchical collective communications library with portable optimizations☆37Dec 8, 2024Updated last year
- ☆42Jun 13, 2025Updated 8 months ago
- VASim is a virtual homogeneous non-deterministic finite automata automata simulator and transformation tool. VASim can parse, transform, …☆36May 17, 2024Updated last year
- ☆43Nov 28, 2022Updated 3 years ago
- ☆12Jan 21, 2026Updated last month
- 算法动画演示、ACM基础算法、简单算法思想演示☆10Jul 22, 2020Updated 5 years ago
- Task graph-based asynchronous programming system using C++ coroutine☆99Feb 18, 2024Updated 2 years ago
- ☆10Mar 2, 2024Updated 2 years ago
- ☆12Feb 12, 2026Updated 3 weeks ago
- FbxPipeline (https://github.com/VladSerhiienko/FbxPipeline) accompaning project, WIP.☆12Jul 4, 2019Updated 6 years ago
- The first large scale formally verified reasoning dataset for Verilog☆20May 16, 2025Updated 9 months ago