graphcore-research / gfloatLinks
Generic floating-point types in Python
☆12Updated 2 months ago
Alternatives and similar repositories for gfloat
Users that are interested in gfloat are comparing it to the libraries listed below
Sorting:
- The Riallto Open Source Project from AMD☆80Updated last month
- Floating-Point Optimized On-Device Learning Library for the PULP Platform.☆34Updated 3 weeks ago
- A library to train and deploy quantised Deep Neural Networks☆24Updated 5 months ago
- A Data-Centric Compiler for Machine Learning☆83Updated last year
- A python library for fractional fixed-point (base 2) arithmetic and binary manipulation with Numpy compatibility.☆192Updated last year
- Customized matrix multiplication kernels☆54Updated 3 years ago
- ☆19Updated this week
- ☆52Updated 9 months ago
- A Python-embedded DSL that makes it easy to write fast, scalable ML kernels with minimal boilerplate.☆153Updated this week
- Torch2Chip (MLSys, 2024)☆51Updated 2 months ago
- Model zoo for the Quantized ONNX (QONNX) model format☆12Updated last week
- Explore training for quantized models☆18Updated last week
- Adaptive floating-point based numerical format for resilient deep learning☆14Updated 3 years ago
- A high-efficiency system-on-chip for floating-point compute workloads.☆36Updated 4 months ago
- An AI accelerator implementation with Xilinx FPGA☆46Updated 4 months ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆149Updated last week
- ☆9Updated 2 years ago
- DNN Compiler for Heterogeneous SoCs☆39Updated this week
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆87Updated 2 years ago
- ☆100Updated this week
- A Deep Learning Framework for the Posit Number System☆28Updated 10 months ago
- A polyhedral compiler for hardware accelerators☆59Updated 10 months ago
- A lightweight, Pythonic, frontend for MLIR☆81Updated last year
- ☆16Updated 3 weeks ago
- General Matrix Multiplication using NVIDIA Tensor Cores☆17Updated 4 months ago
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Updated 2 years ago
- MLIR-based partitioning system☆87Updated this week
- SAMO: Streaming Architecture Mapping Optimisation☆33Updated last year
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆270Updated this week
- FPGA-based hardware acceleration for dropout-based Bayesian Neural Networks.☆24Updated last year