Soappyooo / pointnet_cuda_evalLinks
UCAS国科大2024课程《GPU架构与编程》大作业1,编写pointnet的cuda推理程序。
☆11Updated 9 months ago
Alternatives and similar repositories for pointnet_cuda_eval
Users that are interested in pointnet_cuda_eval are comparing it to the libraries listed below
Sorting:
- 计算机体系结构研讨课 2020秋季 UCAS 《CPU设计实战》 工程环境及 RTL 代码合集☆18Updated 4 years ago
- 计算机体系结构研讨课 2020年秋季 UCAS 《CPU 设计实战》 Lab3-Lab9☆29Updated 4 years ago
- This is my hobby project with System Verilog to accelerate LeViT Network which contain CNN and Attention layer.☆22Updated last year
- 中国科学院大学高级计算机体系结构课程作业:使用OpenROAD-flow完成RTL到GDS全流程☆29Updated 5 years ago
- 关于移植模型至gemmini的文档☆29Updated 3 years ago
- 中国科学院大学-C语言编程-五子棋☆13Updated last year
- 一个单发射五级静态流水CPU,采用龙芯32位精简版指令集,支持异常和中断处理,使用AXI总线接口,已集成TLB模块☆15Updated 2 years ago
- This is a series of quick start guide of Vitis HLS tool in Chinese. It explains the basic concepts and the most important optimize techni…☆23Updated 2 years ago
- Model LLM inference on single-core dataflow accelerators☆14Updated last month
- Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…☆17Updated 5 months ago
- GPGPU-Sim 中文注释版代码,包含 GPGPU-Sim 模拟器的最新版代码,经过中文注释,以帮助中文用户更好地理解和使用该模拟器。☆23Updated 9 months ago
- ☆12Updated last year
- Accelerate multihead attention transformer model using HLS for FPGA☆12Updated last year
- 中国科学院大学2022秋季学期智能计算系统实验-陈云霁☆10Updated 2 years ago
- all kind of notes, I maybe sort this in the future☆13Updated 3 weeks ago
- Vision Transformer Accelerator implemented in Vivado HLS for Xilinx FPGAs.☆16Updated 8 months ago
- R2MDC FFT/IFFT processor adaptive to 64/128/256/512 point☆15Updated 2 months ago
- H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference☆62Updated 4 months ago
- A LoongArch pipeline CPU. Project of Computer Architecture Lab @UCAS.☆27Updated last year
- ☆10Updated 3 years ago
- A fork of Xiangshan for AI☆28Updated this week
- 我设计了一些数字集成电路的教学实验,供大家学习~☆29Updated 7 months ago
- 基于FPGA的FFT算法并行优化☆12Updated last year
- Open-source of MSD framework☆16Updated 2 years ago
- 中国科学院大学(UCAS)2020年春季学期计算机组成原理实验课作业☆16Updated 3 years ago
- (Verilog) A simple convolution layer implementation with systolic array structure☆12Updated 3 years ago
- 基于Xilinx FPGA的通用型 CNN卷积神经网络加速器,本设计基于KV260板卡,MpSoC架构均可移植☆13Updated 9 months ago
- An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences☆29Updated last year
- ☆18Updated last year
- [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers☆51Updated last year