tunib-ai / oslo
OSLO: Open Source framework for Large-scale model Optimization
☆308Updated 2 years ago
Alternatives and similar repositories for oslo:
Users that are interested in oslo are comparing it to the libraries listed below
- OSLO: Open Source for Large-scale Optimization☆175Updated last year
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆786Updated 2 years ago
- Large-scale language modeling tutorials with PyTorch☆290Updated 3 years ago
- FriendliAI Model Hub☆92Updated 2 years ago
- Data processing system for polyglot☆91Updated last year
- A performance library for machine learning applications.☆184Updated last year
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆129Updated 2 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆116Updated 3 years ago
- Korean-English Bilingual Electra Models☆109Updated 3 years ago
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆111Updated last year
- ☆96Updated 2 years ago
- ☆251Updated 9 months ago
- Lightweight and Parallel Deep Learning Framework☆261Updated 2 years ago
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆480Updated last year
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Updated 3 years ago
- Natural Language Processing Tasks and Examples.☆62Updated 2 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆313Updated last year
- Implementation of a Transformer, but completely in Triton☆263Updated 3 years ago
- Matorage is tensor(multidimensional matrix) object storage manager for deep learning framework(Pytorch, Tensorflow V2, Keras)☆73Updated 2 years ago
- Korean Math Word Problems☆58Updated 3 years ago
- data related codebase for polyglot project☆19Updated 2 years ago
- Implementation of Flash Attention in Jax☆206Updated last year
- [Google Meet] MLLM Arxiv Casual Talk☆52Updated 2 years ago
- This project shows how to serve an TF based image classification model as a web service with TFServing, Docker, and Kubernetes(GKE).☆122Updated 2 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆101Updated 4 years ago
- Dataset of Korean Threatening Conversations☆71Updated 2 years ago
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆118Updated 4 years ago
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆85Updated last year
- Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.☆719Updated 3 months ago
- KoCLIP: Korean port of OpenAI CLIP, in Flax☆149Updated last year