graphcore-research / gfloatLinks

Generic floating-point types in Python

☆13

Alternatives and similar repositories for gfloat

Users that are interested in gfloat are comparing it to the libraries listed below

Sorting:

gau-nernst / quantized-training
Explore training for quantized models
☆20Updated this week
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆79Updated last year
lianakoleva / no-libtorch-compile
☆21Updated 4 months ago
google / drjax
☆13Updated 2 weeks ago
iree-org / iree-jax
☆52Updated 11 months ago
lucidrains / assoc-scan
Associative scan package for DRYing some code between repos
☆13Updated 2 months ago
alexzhang13 / Triton-Puzzles-Solutions
Personal solutions to the Triton Puzzles
☆19Updated last year
graphcore-research / out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
☆46Updated last year
FrancescoSaverioZuppichini / pytorch-2.0-benchmark
Benchmarking PyTorch 2.0 different models
☆21Updated 2 years ago
Jokeren / triton-samples
☆28Updated 6 months ago
facebookexperimental / protoquant
Prototype routines for GPU quantization written using PyTorch.
☆21Updated 5 months ago
spcl / daceml
A Data-Centric Compiler for Machine Learning
☆84Updated last year
johnryan465 / pscan
☆40Updated last year
emalach / LinearLM
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆19Updated 11 months ago
pytorch / tlparse
TORCH_LOGS parser for PT2
☆46Updated this week
GindaChen / FlexFlashAttention3
FlexAttention w/ FlashAttention3 Support
☆26Updated 9 months ago
topal-team / rockmate
☆36Updated 7 months ago
brightlaboratory / polydl
☆12Updated 4 years ago
graphcore-research / unit-scaling-demo
Unit Scaling demo and experimentation code
☆16Updated last year
pytorch-labs / triton-cpu
An experimental CPU backend for Triton (https//github.com/openai/triton)
☆43Updated 4 months ago
koyeb / tenstorrent-examples
☆13Updated last month
groq / mlagility
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
☆39Updated 2 months ago
sustcsonglin / gated_linear_attention_layer
☆32Updated last year
CLAIRE-Labo / StructuredFFN
The official code of "Building on Efficient Foundations: Effectively Training LLMs with Structured Feedforward Layers"
☆19Updated 11 months ago
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆44Updated 2 years ago
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
Chillee / lit-llama
Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama code
☆11Updated last year
jaewoosong / pocketnn
The official, proof-of-concept C++ implementation of PocketNN.
☆34Updated last year
ruslangrimov / mnist-minimal-model
Trying to find out what is the minimal model that can achieve 99% accuracy on MNIST dataset
☆25Updated 6 years ago
AMDResearch / Riallto
The Riallto Open Source Project from AMD
☆81Updated 3 months ago