Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator
☆216Dec 10, 2023Updated 2 years ago
Alternatives and similar repositories for halutmatmul
Users that are interested in halutmatmul are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Jul 24, 2023Updated 2 years ago
- 10x faster matrix and vector operations☆2,517Oct 12, 2022Updated 3 years ago
- Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024☆186Apr 16, 2024Updated 2 years ago
- Algebraic enhancements for GEMM & AI accelerators☆292Feb 28, 2025Updated last year
- BARVINN: A Barrel RISC-V Neural Network Accelerator: https://barvinn.readthedocs.io/en/latest/☆96Jan 5, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Submission template for Tiny Tapeout 04☆17Jun 15, 2024Updated last year
- ☆32Mar 31, 2025Updated last year
- Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".☆280Nov 3, 2023Updated 2 years ago
- Open Source Compiler Framework using ONNX as Frontend and IR☆33Aug 17, 2022Updated 3 years ago
- A configurable RTL to bitstream FPGA toolchain☆60Apr 24, 2026Updated 2 weeks ago
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Feb 3, 2025Updated last year
- Rust-based Scheme Compiler, written in the Nanopass style☆12Jun 12, 2018Updated 7 years ago
- C++ library for graph ordering☆15Mar 20, 2020Updated 6 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Dec 3, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PolyMage is a domain-specific language and optimizing code generator for auto-parallelisation☆14Jul 15, 2016Updated 9 years ago
- Minimax: a Compressed-First, Microcoded RISC-V CPU☆225Feb 19, 2026Updated 2 months ago
- Package manager and build abstraction tool for FPGA/ASIC development☆1,413Feb 13, 2026Updated 2 months ago
- Heterogeneous Accelerated Computed Cluster (HACC) Resources Page☆22Apr 27, 2026Updated last week
- A simple example of VAEs with KANs☆12May 17, 2024Updated last year
- Code for the paper "Faster Neural Network Training with Approximate Tensor Operations"☆10Oct 23, 2021Updated 4 years ago
- Successfully training approximations to full-rank matrices for efficiency in deep learning.☆16Jan 5, 2021Updated 5 years ago
- PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial functi…☆55Feb 9, 2024Updated 2 years ago
- A Rust library to deconstruct DNS SPF records☆17Apr 20, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Rust-like traits implementation in C++☆20Nov 8, 2023Updated 2 years ago
- Low Precision Arithmetic Simulation in PyTorch☆290May 20, 2024Updated last year
- ☆45Jul 14, 2021Updated 4 years ago
- Train and deploy LUT-based neural networks on FPGAs☆112Jun 12, 2024Updated last year
- A safe and efficient target language for functional compilers☆20May 5, 2018Updated 8 years ago
- This is mainly a simulation library of xilinx primitives that are verilator compatible.☆34Jul 15, 2024Updated last year
- Fun with wgpu: Simulating slime mold☆24Aug 22, 2024Updated last year
- CVA6 softcore contest☆24Apr 17, 2026Updated 3 weeks ago
- Object-Oriented Programming☆12Aug 26, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆11Jun 29, 2021Updated 4 years ago
- PB-LLM: Partially Binarized Large Language Models☆155Nov 20, 2023Updated 2 years ago
- ☆103Mar 5, 2026Updated 2 months ago
- A tracing JIT compiler for PyTorch☆14Dec 11, 2021Updated 4 years ago
- Linear algebra accelerators for RISC-V (published in ICCD 17)☆66Oct 5, 2017Updated 8 years ago
- muSYCL, the SYCL musical!☆13Aug 25, 2024Updated last year
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆50Feb 26, 2025Updated last year