EleutherAI / osloLinks

OSLO: Open Source for Large-scale Optimization

☆175

Alternatives and similar repositories for oslo

Users that are interested in oslo are comparing it to the libraries listed below

Sorting:

tunib-ai / oslo
OSLO: Open Source framework for Large-scale model Optimization
☆309Updated 2 years ago
Sea-Snell / JAX_llama
Inference code for LLaMA models in JAX
☆118Updated last year
EleutherAI / dps
Data processing system for polyglot
☆91Updated last year
microsoft / mutransformers
some common Huggingface transformers in maximal update parametrization (µP)
☆81Updated 3 years ago
xrsrke / pipegoose
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
☆84Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆115Updated 2 years ago
huggingface / bloom-jax-inference
☆67Updated 2 years ago
ayaka14732 / llama-2-jax
JAX implementation of the Llama 2 model
☆219Updated last year
mgmalek / efficient_cross_entropy
☆109Updated last year
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
lassl / lassl
Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets
☆129Updated 2 years ago
lcw99 / evolve-instruct
evolve llm training instruction, from english instruction to any language.
☆118Updated last year
KyujinHan / Sakura-SOLAR-DPO
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Updated last year
huggingface / olm-datasets
Pipeline for pulling and processing online language model pretraining data from the web
☆178Updated last year
sholtodouglas / scalingExperiments
☆60Updated 3 years ago
deep-diver / PingPong
manage histories of LLM applied applications
☆90Updated last year
EleutherAI / polyglot-data
data related codebase for polyglot project
☆19Updated 2 years ago
friendliai / FAI-Model
FriendliAI Model Hub
☆91Updated 3 years ago
tunib-ai / large-scale-lm-tutorials
Large-scale language modeling tutorials with PyTorch
☆290Updated 3 years ago
kakaobrain / trident
A performance library for machine learning applications.
☆184Updated last year
imoneoi / multipack_sampler
Multipack distributed sampler for fast padding-free training of LLMs
☆191Updated 10 months ago
LAION-AI / Anh
Anh - LAION's multilingual assistant datasets and models
☆27Updated 2 years ago
goddoe / simple-r1
☆15Updated 2 weeks ago
SeanNaren / minGPT
A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!
☆111Updated 2 years ago
HanGuo97 / lq-lora
☆126Updated last year
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆187Updated 3 years ago
srush / do-we-need-attention
☆166Updated last year
jaymody / speculative-sampling
Simple implementation of Speculative Sampling in NumPy for GPT-2.
☆95Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆106Updated last year
cloneofsimo / ezmup
Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam
☆80Updated 11 months ago