graphcore-research/out-of-the-box-fp8-training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/graphcore-research/out-of-the-box-fp8-training)

graphcore-research / out-of-the-box-fp8-training

Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.

☆46

Alternatives and similar repositories for out-of-the-box-fp8-training

Users that are interested in out-of-the-box-fp8-training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

graphcore-research / unit-scaling
View on GitHub
A library for unit scaling in PyTorch
☆134Jul 11, 2025Updated last year
kyegomez / MultiQueryAttention
View on GitHub
This is a simple torch implementation of the high performance Multi-Query Attention
☆16Aug 23, 2023Updated 2 years ago
f-dangel / vivit
View on GitHub
[TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…
☆17Jul 19, 2023Updated 3 years ago
kyegomez / SelfExtend
View on GitHub
Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta
☆13Nov 11, 2024Updated last year
huggingface / m4-logs
View on GitHub
M4 experiment logbook
☆59Aug 21, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
graphcore-research / gfloat
View on GitHub
Generic floating-point types in Python
☆19Apr 18, 2026Updated 3 months ago
asu-idi / ZNS-Cache
View on GitHub
[HotStorage '24] Can ZNS SSDs be Better Storage Devices for Persistent Cache?
☆13Jun 14, 2024Updated 2 years ago
KindXiaoming / physics_of_skill_learning
View on GitHub
We study toy models of skill learning.
☆34Feb 3, 2026Updated 5 months ago
kyegomez / MLXTransformer
View on GitHub
Simple Implementation of a Transformer in the new framework MLX by Apple
☆19Nov 18, 2024Updated last year
theophilusx / icsorg
View on GitHub
Import an ICS file into org to include events in your org agenda
☆18Mar 30, 2026Updated 3 months ago
kyegomez / Qwen-VL
View on GitHub
My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…
☆13Jan 29, 2024Updated 2 years ago
applicaai / CCpdf
View on GitHub
Index of URLs to pdf files all over the internet and scripts
☆25May 2, 2023Updated 3 years ago
mgmalek / efficient_cross_entropy
View on GitHub
☆124May 28, 2024Updated 2 years ago
Azure / MS-AMP
View on GitHub
Microsoft Automatic Mixed Precision Library
☆636Dec 1, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
AlxSp / t-jepa
View on GitHub
☆12Apr 26, 2024Updated 2 years ago
julian-8897 / hyperbolic-latent-vae
View on GitHub
Variational Autoencoder with non-euclidean (hyperbolic) latent space
☆14Nov 25, 2022Updated 3 years ago
ZhimingZhou / LGANs-for-reproduce
View on GitHub
This repo is for reproducing our results in “Lipschitz Generative Adversarial Nets”.
☆11Sep 26, 2020Updated 5 years ago
OpenBioML / project-proposal-template
View on GitHub
The project proposal template for OpenBioML community projects.
☆18Feb 9, 2023Updated 3 years ago
Harry-Chen / fp4_sm120
View on GitHub
Make FP4 on 5090 Great Again
☆17Updated this week
Deep-Learning-Profiling-Tools / triton-samples
View on GitHub
☆14Mar 8, 2025Updated last year
Tom-CaoZH / CXL-101
View on GitHub
Contain some materials about CXL.
☆20Feb 29, 2024Updated 2 years ago
secure-foundations / veribetrkv-osdi2020
View on GitHub
VeriBetrKV OSDI'20 artifact
☆13Sep 5, 2020Updated 5 years ago
PierrickPochelu / JaxDecompiler
View on GitHub
Jax Decompiler
☆16Apr 22, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
luohongyin / EntST
View on GitHub
Entailment self-training
☆27May 30, 2023Updated 3 years ago
kyegomez / HRTX
View on GitHub
Multi-Modal Multi-Embodied Hivemind-like Iteration of RTX-2
☆15Jun 27, 2025Updated last year
takahiro-hirofuchi / mesmeric-emulator
View on GitHub
MESMERIC: A Software-based NVM Emulator Supporting Read/Write Asymmetric Latencies
☆10Oct 1, 2020Updated 5 years ago
graphcore-research / jax-scalify
View on GitHub
JAX Scalify: end-to-end scaled arithmetics
☆18Oct 30, 2024Updated last year
GallagherCommaJack / pdf2md
View on GitHub
☆16Dec 30, 2024Updated last year
LemuelKL / CIFAR10-HOG-PCA-SVM
View on GitHub
Classifier for CIFAR-10. Grayscaling, HOG, PCA, and RBF SVM. 62% test accuracy. Walkthrough on YouTube: https://youtu.be/gmTweV0eHhk
☆14Nov 24, 2024Updated last year
andrewargatkiny / dense-attention
View on GitHub
This is the repo for DenseAttention and DANet - fast and conceptually simple modification of standard attention and Transformer
☆20Apr 6, 2026Updated 3 months ago
XiangpengHao / VeryPM
View on GitHub
Persistent Memory Tool Box
☆12Mar 4, 2024Updated 2 years ago
graphcore-research / jax-experimental
View on GitHub
JAX for Graphcore IPU (experimental)
☆21Mar 12, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yasumasaonoe / ecbd
View on GitHub
☆11Apr 23, 2023Updated 3 years ago
vict0rsch / pytorch-fid-wrapper
View on GitHub
A simple wrapper around @mseitzer's great pytorch-fid work to compute Fréchet Inception Distance in-memory from batches of images, using …
☆17Nov 8, 2020Updated 5 years ago
smearle / autoverse
View on GitHub
Generative cellular automaton-like learning environments for RL.
☆20Jan 30, 2025Updated last year
NousResearch / StripedHyenaTrainer
View on GitHub
☆67Dec 8, 2023Updated 2 years ago
tlcui / cloth_simulation_demo
View on GitHub
☆14May 28, 2023Updated 3 years ago
Erkaman / erkaman.github.io
View on GitHub
The source code of my website.
☆14Dec 8, 2021Updated 4 years ago
kyegomez / dev-swarm
View on GitHub
A swarm of LLM agents that will help you test, document, and productionize your code!
☆19Updated this week