EleutherAI / oslo
OSLO: Open Source for Large-scale Optimization
☆174Updated last year
Related projects ⓘ
Alternatives and complementary repositories for oslo
- Data processing system for polyglot☆90Updated last year
- Inference code for LLaMA models in JAX☆113Updated 6 months ago
- OSLO: Open Source framework for Large-scale model Optimization☆306Updated 2 years ago
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆80Updated 11 months ago
- evolve llm training instruction, from english instruction to any language.☆113Updated last year
- some common Huggingface transformers in maximal update parametrization (µP)☆77Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆115Updated 10 months ago
- FriendliAI Model Hub☆89Updated 2 years ago
- data related codebase for polyglot project☆19Updated last year
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆59Updated 2 years ago
- Multipack distributed sampler for fast padding-free training of LLMs☆178Updated 3 months ago
- [Google Meet] MLLM Arxiv Casual Talk☆55Updated last year
- Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets☆127Updated 2 years ago
- A performance library for machine learning applications.☆180Updated last year
- JAX implementation of the Llama 2 model☆210Updated 9 months ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated last year
- manage histories of LLM applied applications☆86Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆68Updated 3 months ago
- Anh - LAION's multilingual assistant datasets and models☆27Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆49Updated 3 months ago
- Train very large language models in Jax.☆195Updated last year
- Large-scale language modeling tutorials with PyTorch☆287Updated 3 years ago
- ☆57Updated 2 years ago
- ☆64Updated 2 years ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Updated 3 months ago
- ☆73Updated 4 months ago
- ☆18Updated 4 months ago
- ☆77Updated 5 months ago
- ☆122Updated 10 months ago