AI-Hypercomputer / ray-tpuLinks

☆15

Alternatives and similar repositories for ray-tpu

Users that are interested in ray-tpu are comparing it to the libraries listed below

Sorting:

AI-Hypercomputer / torchprime
torchprime is a reference model implementation for PyTorch on TPU.
☆41Updated last month
AI-Hypercomputer / kithara
☆16Updated 6 months ago
cat-state / tinypar
☆20Updated 2 years ago
yixiaoer / tpu-training-example
☆16Updated last year
erogol / BlaGPT
Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…
☆87Updated 2 weeks ago
mgmalek / efficient_cross_entropy
☆121Updated last year
lianakoleva / no-libtorch-compile
☆21Updated 8 months ago
yixiaoer / tpux
A set of Python scripts that makes your experience on TPU better
☆54Updated 2 months ago
young-geng / mlxu
Machine Learning eXperiment Utilities
☆46Updated 4 months ago
npuichigo / blazing-fast-io-tutorial
Blazing fast data loading with HuggingFace Dataset and Ray Data
☆16Updated last year
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆87Updated 3 years ago
lessw2020 / transformer_central
Various transformers for FSDP research
☆38Updated 3 years ago
drisspg / transformer_nuggets
A place to store reusable transformer components of my own creation or found on the interwebs
☆62Updated last week
sholtodouglas / scalingExperiments
☆62Updated 3 years ago
warner-benjamin / optimi
Fast, Modern, and Low Precision PyTorch Optimizers
☆116Updated 2 months ago
frankxwang / dpo-prefix-sharing
DPO, but faster 🚀
☆46Updated 11 months ago
proger / hippogriff
Griffin MQA + Hawk Linear RNN Hybrid
☆89Updated last year
NX-AI / mlstm_kernels
Tiled Flash Linear Attention library for fast and efficient mLSTM Kernels.
☆74Updated this week
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆49Updated last year
thecharlieblake / lovely-llama
An implementation of the Llama architecture, to instruct and delight
☆21Updated 6 months ago
fattorib / ZeRO-transformer
Two implementations of ZeRO-1 optimizer sharding in JAX
☆14Updated 2 years ago
apple / ml-ademamix
☆68Updated last year
google-deepmind / randomized_positional_encodings
Randomized Positional Encodings Boost Length Generalization of Transformers
☆83Updated last year
irhum / hyena
JAX/Flax implementation of the Hyena Hierarchy
☆34Updated 2 years ago
Joluck / MiSS
MiSS is a novel PEFT method that features a low-rank structure but introduces a new update mechanism distinct from LoRA, achieving an exc…
☆25Updated last month
sustcsonglin / mamba-triton
☆50Updated last year
erfanzar / eformer
(EasyDel Former) is a utility library designed to simplify and enhance the development in JAX
☆28Updated this week
PythonNut / superbpe
Official code release for "SuperBPE: Space Travel for Language Models"
☆76Updated last week
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆80Updated last year
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆18Updated 4 months ago