yhinai / TensorGPGPULinks
RISC-V vector and tensor compute extensions for Vortex GPGPU acceleration for ML workloads. Optimized for transformer models, CNNs, and generative AI with configurable precision (FP32/16/BF16/INT8).
☆19Updated 9 months ago
Alternatives and similar repositories for TensorGPGPU
Users that are interested in TensorGPGPU are comparing it to the libraries listed below
Sorting:
- Open-source AI Accelerator Stack integrating compute, memory, and software — from RTL to PyTorch.☆23Updated this week
- Linux on RISC-V on FPGA (LOROF): RV64GC Sv39 Quad-Core Superscalar Out-of-Order Virtual Memory CPU☆15Updated this week
- The official NaplesPU hardware code repository☆22Updated 6 years ago
- DUTH RISC-V Superscalar Microprocessor☆33Updated last year
- ☆17Updated 3 months ago
- Open-source AMBA CHI infrastructures (supporting Issue B, E.b)☆33Updated last week
- A DDR3 Controller that uses the Xilinx MIG-7 PHY to interface with DDR3 devices.☆11Updated 4 years ago
- Chisel implementation of Neural Processing Unit for System on the Chip☆26Updated 2 weeks ago
- ☆20Updated last month
- Cycle-accurate C++ & SystemC simulator for the RISC-V GPGPU Ventus☆31Updated last month
- A small Neural Network Processor for Edge devices.☆15Updated 3 years ago
- NPUsim: Full-Model, Cycle-Level, and Value-Aware Simulator for DNN Accelerators☆46Updated last year
- A Heterogeneous GPU Platform for Chipyard SoC☆42Updated this week
- matrix-coprocessor for RISC-V☆29Updated last month
- ☆14Updated 11 months ago
- RTL code for AXI4 Interconnect (Verilog). Supports weighted round-robin arbitration, n-channel master, 4Kb splitting, reorder transaction…☆23Updated 10 months ago
- ☆15Updated 3 years ago
- ☆15Updated 8 months ago
- ☆33Updated 2 months ago
- OpenExSys_NoC a mesh-based network on chip IP.☆20Updated 2 years ago
- Parametric floating-point unit with support for standard RISC-V formats and operations as well as transprecision formats.☆18Updated 2 months ago
- A simple cycle accurate template model for ASIC/FPGA hardware design. Including a cycle accurate FIFO design example. More designs are co…☆17Updated 6 years ago
- SystemVerilog overhaul of ESP L2 and LLC caches with directory based protocol☆18Updated 11 months ago
- ☆11Updated 3 years ago
- 2-8bit weights, 8-bit activations flexible Neural Processing Engine for PULP clusters☆28Updated 3 weeks ago
- Input / Output Physical Memory Protection Unit for RISC-V☆15Updated 2 years ago
- RISCV-VP++ is a extended and improved successor of the RISC-V based Virtual Prototype (VP) RISC-V VP. It is maintained at the Institute f…☆48Updated last month
- Matrix Accelerator Generator for GeMM Operations based on SIGMA Architecture in CHISEL HDL☆15Updated last year
- The ParaNut Processor - Highly Parallel and More Than Just a CPU Core☆36Updated 2 years ago
- CS533 Course Project (ongoing) - Exploring Parallel Architectures for Neural Processing Unit Implementations☆20Updated 8 years ago