tunib-ai / osloLinks
OSLO: Open Source framework for Large-scale model Optimization
☆309Updated 2 years ago
Alternatives and similar repositories for oslo
Users that are interested in oslo are comparing it to the libraries listed below
Sorting:
- OSLO: Open Source for Large-scale Optimization☆175Updated last year
- FriendliAI Model Hub☆91Updated 3 years ago
- Large-scale language modeling tutorials with PyTorch☆291Updated 3 years ago
- Parallelformers: An Efficient Model Parallelization Toolkit for Deployment☆791Updated 2 years ago
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆130Updated 2 years ago
- Data processing system for polyglot☆91Updated last year
- A performance library for machine learning applications.☆184Updated last year
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆112Updated 2 years ago
- Korean-English Bilingual Electra Models☆110Updated 3 years ago
- Polyglot: Large Language Models of Well-balanced Competence in Multi-languages☆486Updated last year
- The code and models for "An Empirical Study of Tokenization Strategies for Various Korean NLP Tasks" (AACL-IJCNLP 2020)☆119Updated 4 years ago
- Official code for "Distributed Deep Learning in Open Collaborations" (NeurIPS 2021)☆117Updated 3 years ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆60Updated 3 years ago
- data related codebase for polyglot project☆19Updated 2 years ago
- [Google Meet] MLLM Arxiv Casual Talk☆52Updated 2 years ago
- ☆19Updated 2 years ago
- Korean Math Word Problems☆59Updated 3 years ago
- Natural Language Processing Tasks and Examples.☆62Updated 2 years ago
- Adversarial Test Dataset for Korean Multi-turn Response Selection☆35Updated 3 years ago
- 🤗 최소한의 세팅으로 LM을 학습하기 위한 샘플코드☆58Updated 2 years ago
- ☆96Updated 3 years ago
- ☆107Updated 2 years ago
- Prune a model while finetuning or training.☆403Updated 3 years ago
- Implementation of stop sequencer for Huggingface Transformers☆16Updated 2 years ago
- Implementation of a Transformer, but completely in Triton☆273Updated 3 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆187Updated 3 years ago
- ☆47Updated last year
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Updated 3 years ago
- A PyTorch Implementation of the Luna: Linear Unified Nested Attention☆41Updated 4 years ago
- Create paraphrasing korean sentence with GPT-3☆34Updated 2 years ago