tunib-ai / osloLinks

OSLO: Open Source framework for Large-scale model Optimization

☆309

Alternatives and similar repositories for oslo

Users that are interested in oslo are comparing it to the libraries listed below

Sorting:

EleutherAI / oslo
OSLO: Open Source for Large-scale Optimization
☆175Updated last year
tunib-ai / parallelformers
Parallelformers: An Efficient Model Parallelization Toolkit for Deployment
☆790Updated 2 years ago
tunib-ai / large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch
☆290Updated 3 years ago
lassl / lassl
Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets
☆129Updated 2 years ago
EleutherAI / dps
Data processing system for polyglot
☆91Updated last year
friendliai / FAI-Model
FriendliAI Model Hub
☆91Updated 3 years ago
kakaobrain / trident
A performance library for machine learning applications.
☆184Updated last year
tunib-ai / tunib-electra
Korean-English Bilingual Electra Models
☆110Updated 3 years ago
yandex-research / DeDLOC
Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)
☆115Updated 3 years ago
lucidrains / triton-transformer
Implementation of a Transformer, but completely in Triton
☆269Updated 3 years ago
microsoft / varuna
☆250Updated 11 months ago
SeanNaren / minGPT
A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!
☆111Updated 2 years ago
AminRezaei0x443 / memory-efficient-attention
Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch
☆184Updated 2 years ago
kakaobrain / kortok
The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)
☆118Updated 4 years ago
huggingface / nn_pruning
Prune a model while finetuning or training.
☆403Updated 3 years ago
huggingface / optimum-graphcore
Blazing fast training of 🤗 Transformers on Graphcore IPUs
☆85Updated last year
EleutherAI / polyglot
Polyglot: Large Language Models of Well-balanced Competence in Multi-languages
☆483Updated last year
EleutherAI / polyglot-data
data related codebase for polyglot project
☆19Updated 2 years ago
tunib-ai / KMWP
Korean Math Word Problems
☆59Updated 3 years ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆118Updated last year
hpcaitech / PaLM-colossalai
Scalable PaLM implementation of PyTorch
☆190Updated 2 years ago
huggingface / bloom-jax-inference
☆67Updated 2 years ago
gyunggyung / MLLMArxivTalk
[Google Meet] MLLM Arxiv Casual Talk
☆52Updated 2 years ago
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆187Updated 3 years ago
tunib-ai / transformers
🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed
☆31Updated 3 years ago
huggingface / olm-training
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆93Updated 2 years ago
KLUE-benchmark / KLUE-baseline
Finetuning Pipeline
☆90Updated 3 years ago
google / flaxformer
☆356Updated last year
graykode / matorage
Matorage is tensor(multidimensional matrix) object storage manager for deep learning framework(Pytorch, Tensorflow V2, Keras)
☆73Updated 2 years ago
facebookresearch / bitsandbytes
Library for 8-bit optimizers and quantization routines.
☆715Updated 2 years ago