huggingface / optimum-graphcoreLinks

Blazing fast training of 🤗 Transformers on Graphcore IPUs

☆85

Alternatives and similar repositories for optimum-graphcore

Users that are interested in optimum-graphcore are comparing it to the libraries listed below

Sorting:

huggingface / bloom-jax-inference
☆66Updated 3 years ago
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆188Updated 3 years ago
EleutherAI / DeeperSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
☆169Updated last month
google / praxis
☆190Updated this week
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆216Updated last year
huggingface / optimum-habana
Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
☆199Updated this week
lucidrains / flash-attention-jax
Implementation of Flash Attention in Jax
☆219Updated last year
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆87Updated last year
sholtodouglas / scalingExperiments
☆62Updated 3 years ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆119Updated last year
google / flaxformer
☆362Updated last year
lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆276Updated 3 years ago
microsoft / varuna
☆252Updated last year
lessw2020 / transformer_central
Various transformers for FSDP research
☆38Updated 2 years ago
Sea-Snell / JAXSeq
Train very large language models in Jax.
☆209Updated 2 years ago
SeanNaren / min-LLM
Minimal code to train a Large Language Model (LLM).
☆172Updated 3 years ago
graphcore / tutorials
Training material for IPU users: tutorials, feature examples, simple applications
☆87Updated 2 years ago
r-three / git-theta
git extension for {collaborative, communal, continual} model development
☆215Updated 11 months ago
yandex-research / DeDLOC
Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)
☆117Updated 3 years ago
huggingface / doc-builder
The package used to build the documentation of our Hugging Face repos
☆130Updated this week
huggingface / olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
☆178Updated 2 years ago
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆121Updated 9 months ago
huggingface / transformers_bloom_parallel
Techniques used to run BLOOM at inference in parallel
☆37Updated 3 years ago
huggingface / optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of O…
☆318Updated last month
google / aqt
☆335Updated last month
kingoflolz / swarm-jax
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆242Updated 2 years ago
rom1504 / cc2dataset
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
☆319Updated last year
sgugger / torchdynamo-tests
☆19Updated 2 years ago
salesforce / jaxformer
Minimal library to train LLMs on TPU in JAX with pjit().
☆298Updated last year
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago