AI-Hypercomputer / kitharaLinks

☆16

Alternatives and similar repositories for kithara

Users that are interested in kithara are comparing it to the libraries listed below

Sorting:

AI-Hypercomputer / ray-tpu
☆15Updated 6 months ago
AI-Hypercomputer / torchprime
torchprime is a reference model implementation for PyTorch on TPU.
☆41Updated last month
AI-Hypercomputer / cloud-accelerator-diagnostics
☆24Updated 2 weeks ago
AI-Hypercomputer / tpu-recipes
☆54Updated this week
google / saxml
☆148Updated last month
mgmalek / efficient_cross_entropy
☆121Updated last year
catie-aq / flashT5
A fast implementation of T5/UL2 in PyTorch using Flash Attention
☆112Updated last month
AI-Hypercomputer / jetstream-pytorch
PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"
☆78Updated 2 months ago
google / praxis
☆190Updated 2 weeks ago
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆87Updated 3 years ago
AI-Hypercomputer / xpk
xpk (Accelerated Processing Kit, pronounced x-p-k,) is a software tool to help Cloud developers to orchestrate training jobs on accelerat…
☆154Updated this week
google-deepmind / randomized_positional_encodings
Randomized Positional Encodings Boost Length Generalization of Transformers
☆83Updated last year
yixiaoer / tpux
A set of Python scripts that makes your experience on TPU better
☆54Updated 2 months ago
warner-benjamin / optimi
Fast, Modern, and Low Precision PyTorch Optimizers
☆116Updated 3 months ago
cat-state / tinypar
☆20Updated 2 years ago
thecharlieblake / lovely-llama
An implementation of the Llama architecture, to instruct and delight
☆21Updated 6 months ago
microsoft / encoder-decoder-slm
Efficient encoder-decoder architecture for small language models (≤1B parameters) with cross-architecture knowledge distillation and visi…
☆33Updated 10 months ago
pytorch-tpu / transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
☆17Updated 6 months ago
erfanzar / jax-flash-attn2
A flexible and efficient implementation of Flash Attention 2.0 for JAX, supporting multiple backends (GPU/TPU/CPU) and platforms (Triton/…
☆30Updated 9 months ago
young-geng / mlxu
Machine Learning eXperiment Utilities
☆46Updated 4 months ago
kyo-takano / chinchilla
A toolkit for scaling law research ⚖
☆53Updated 10 months ago
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆123Updated 10 months ago
fattorib / ZeRO-transformer
Two implementations of ZeRO-1 optimizer sharding in JAX
☆14Updated 2 years ago
graphcore-research / unit-scaling
A library for unit scaling in PyTorch
☆132Updated 4 months ago
facebookresearch / spdl
Scalable and Performant Data Loading
☆349Updated this week
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…
☆392Updated 5 months ago
lessw2020 / transformer_central
Various transformers for FSDP research
☆38Updated 3 years ago
kjslag / spacebyte
A byte-level decoder architecture that matches the performance of tokenized Transformers.
☆66Updated last year
TobiasNorlund / retro
Official repo to On the Generalization Ability of Retrieval-Enhanced Transformers
☆44Updated last year
HeegyuKim / torch-xla-SPMD
Pytorch/XLA SPMD Test code in Google TPU
☆23Updated last year