GreenWaves-Technologies / bfloat16Links
bfloat16 dtype for numpy
☆20Updated 2 years ago
Alternatives and similar repositories for bfloat16
Users that are interested in bfloat16 are comparing it to the libraries listed below
Sorting:
- ☆168Updated 2 years ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆168Updated last week
- ☆159Updated 2 years ago
- Sandbox for TVM and playing around!☆22Updated 3 years ago
- Customized matrix multiplication kernels☆57Updated 3 years ago
- ☆68Updated 2 years ago
- Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.☆26Updated 3 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆88Updated 2 years ago
- Fast sparse deep learning on CPUs☆56Updated 3 years ago
- A Python library transfers PyTorch tensors between CPU and NVMe☆123Updated last year
- torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…☆182Updated this week
- Prototype routines for GPU quantization written using PyTorch.☆21Updated 4 months ago
- An experimental CPU backend for Triton (https//github.com/openai/triton)☆47Updated 4 months ago
- This project contains a code generator that produces static C NN inference deployment code targeting tiny micro-controllers (TinyML) as r…☆29Updated 4 years ago
- ☆71Updated 8 months ago
- A stand-alone implementation of several NumPy dtype extensions used in machine learning.☆320Updated this week
- A tiny FP8 multiplication unit written in Verilog. TinyTapeout 2 submission.☆14Updated 3 years ago
- Tool for the deployment and analysis of TinyML applications on TFLM and MicroTVM backends☆33Updated this week
- The official, proof-of-concept C++ implementation of PocketNN.☆35Updated 2 months ago
- ☆39Updated last year
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 4 years ago
- Parse TFLite models (*.tflite) EASILY with Python. Check the API at https://zhenhuaw.me/tflite/docs/☆103Updated 10 months ago
- A Deep Learning Framework for the Posit Number System☆31Updated last year
- A lightweight, Pythonic, frontend for MLIR☆80Updated 2 years ago
- ☆33Updated 2 years ago
- Benchmark scripts for TVM☆74Updated 3 years ago
- A collection of research papers on efficient training of DNNs☆70Updated 3 years ago
- A Data-Centric Compiler for Machine Learning☆85Updated last week
- Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset☆28Updated 7 years ago
- An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).☆276Updated 5 months ago