spectre900 / Parallel-Strassen-Algorithm
Parallelizing Strassen’s matrix multiplication using OpenMP, MPI and CUDA.
☆15Updated 3 years ago
Alternatives and similar repositories for Parallel-Strassen-Algorithm:
Users that are interested in Parallel-Strassen-Algorithm are comparing it to the libraries listed below
- 中国科学院大学高级计算机体系结构课程作业:使用OpenROAD-flow完成RTL到GDS全流程☆26Updated 4 years ago
- SpV8 is a SpMV kernel written in AVX-512. Artifact for our SpV8 paper @ DAC '21.☆29Updated 4 years ago
- A Homework for Computer Architecture at SJTU☆14Updated 5 years ago
- 龙芯官方给出的MIPS源码与我个人优化文件结构之后的源码☆14Updated 5 years ago
- ☆16Updated 3 years ago
- Our repository for NSCSCC☆19Updated 2 months ago
- ☆30Updated 2 years ago
- 《自己动手写CPU》一书附带的文件☆80Updated 7 years ago
- National Student Computer System Capability Challenge☆9Updated 6 years ago
- ☆14Updated 3 years ago
- 中国科学院大学 计算机组成原理FPGA实验课程 - "Five projects to better understand key principles of computer systems", UCAS Spring 2017 Session☆32Updated 7 years ago
- 中山大学2020年并行与分布式计算作业☆21Updated 4 years ago
- Case studies constitute a modern interdisciplinary and valuable teaching practice which plays a critical and fundamental role in the deve…☆13Updated 6 years ago
- ☆14Updated 5 years ago
- RISC-V Proxy Kernel for Education☆29Updated last year
- Learning materials for Stanford CS149 : Parallel Computing☆218Updated 3 years ago
- 高级计算机体系结构2020,吴俊敏老师,中科大研究生课程☆67Updated last year
- Code base and slides for ECE408:Applied Parallel Programming On GPU.☆122Updated 3 years ago
- 【原创,已被编入官方教材】Three-level storage subsystem(SD+DDR2 SDRAM+Cache), based on Nexys4 FPGA board. 同济大学计算机系统结构课程设计,FPGA三级存储子系统。☆111Updated 4 years ago
- 中国科学院大学2020年课程资料 UCAS 国科大☆29Updated 4 years ago
- 操作系统 2019 ucore labs☆46Updated 5 years ago
- A MIPS CPU with dual-issue, out-of-order, and 5-stage pipelines☆11Updated 5 years ago
- Chongqing University 2020 NSCSCC☆28Updated 4 years ago
- 历年 CCF CSP 题目解析 (MkDocs 版本)☆17Updated 3 years ago
- Computer System Project for Loongson FPGA Board in 2017☆52Updated 6 years ago
- PKU computer organization and architecture RISC-V Simulator LAB☆35Updated 6 years ago
- Implementation of parallel Breadth First Algorithm for graph traversal using CUDA and C++ language.☆33Updated 5 years ago
- A simple SAT solver that implements the DPLL algorithm with unit resolution☆45Updated 5 years ago
- A toy compiler written in C++17 that translates SysY (a C-like toy language) into ARM-v7a assembly.☆138Updated 3 years ago
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19Updated 11 months ago