tunib-ai / osloLinks
OSLO: Open Source framework for Large-scale model Optimization
☆309Updated 2 years ago
Alternatives and similar repositories for oslo
Users that are interested in oslo are comparing it to the libraries listed below
Sorting:
- OSLO: Open Source for Large-scale Optimization☆175Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆790Updated 2 years ago
- Large-scale language modeling tutorials with PyTorch☆290Updated 3 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆129Updated 2 years ago
- Data processing system for polyglot☆91Updated last year
- FriendliAI Model Hub☆91Updated 3 years ago
- A performance library for machine learning applications.☆184Updated last year
- Korean-English Bilingual Electra Models☆110Updated 3 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆115Updated 3 years ago
- Implementation of a Transformer, but completely in Triton☆269Updated 3 years ago
- ☆250Updated 11 months ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆111Updated 2 years ago
- Memory Efficient Attention (O(sqrt(n)) for Jax and PyTorch☆184Updated 2 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆118Updated 4 years ago
- Prune a model while finetuning or training.☆403Updated 3 years ago
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆85Updated last year
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆483Updated last year
- data related codebase for polyglot project☆19Updated 2 years ago
- Korean Math Word Problems☆59Updated 3 years ago
- Inference code for LLaMA models in JAX☆118Updated last year
- Scalable PaLM implementation of PyTorch☆190Updated 2 years ago
- ☆67Updated 2 years ago
- [Google Meet] MLLM Arxiv Casual Talk☆52Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆187Updated 3 years ago
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- Finetuning Pipeline☆90Updated 3 years ago
- ☆356Updated last year
- Matorage is tensor(multidimensional matrix) object storage manager for deep learning framework(Pytorch, Tensorflow V2, Keras)☆73Updated 2 years ago
- Library for 8-bit optimizers and quantization routines.☆715Updated 2 years ago