openai / weak-to-strongLinks

☆2,552

Alternatives and similar repositories for weak-to-strong

Users that are interested in weak-to-strong are comparing it to the libraries listed below

Sorting:

uclaml / SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
☆1,220Updated last year
myshell-ai / JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
☆987Updated last year
lucidrains / self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,405Updated last year
openai / transformer-debugger
☆4,110Updated last year
meta-pytorch / gpt-fast
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
☆6,156Updated 3 months ago
openai / prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
☆2,073Updated 2 years ago
jiaweizzhao / GaLore
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
☆1,623Updated last year
microsoft / ToRA
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…
☆1,104Updated last year
XueFuzhao / OpenMoE
A family of open-sourced Mixture-of-Experts (MoE) Large Language Models
☆1,635Updated last year
SakanaAI / evolutionary-model-merge
Official repository of Evolutionary Optimization of Model Merging Recipes
☆1,382Updated 11 months ago
openai / simple-evals
☆4,181Updated 3 months ago
AnswerDotAI / fsdp_qlora
Training LLMs with QLoRA + FSDP
☆1,531Updated last year
allenai / open-instruct
AllenAI's post-training codebase
☆3,317Updated last week
google-deepmind / funsearch
☆961Updated last year
allenai / OLMo
Modeling, training, eval, and inference code for OLMo
☆6,168Updated last month
dvlab-research / LongLoRA
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
☆2,693Updated last year
JShollaj / awesome-llm-interpretability
A curated list of Large Language Model (LLM) Interpretability resources.
☆1,444Updated 5 months ago
facebookresearch / llm-transparency-tool
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. …
☆849Updated 11 months ago
tatsu-lab / alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
☆1,906Updated 3 months ago
anthropics / hh-rlhf
Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
☆1,799Updated 5 months ago
huggingface / optimum-nvidia
☆1,011Updated 9 months ago
openai / automated-interpretability
☆1,056Updated last year
huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆5,427Updated 2 months ago
AI-Hypercomputer / maxtext
A simple, performant and scalable Jax LLM!
☆1,994Updated last week
dvmazur / mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
☆2,324Updated last year
pytorch / torchtitan
A PyTorch native platform for training generative AI models
☆4,754Updated this week
Xwin-LM / Xwin-LM
Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment
☆1,042Updated last year
open-compass / MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
☆773Updated last year
Alpha-VLLM / LLaMA2-Accessory
An Open-source Toolkit for LLM Development
☆2,794Updated 10 months ago
facebookresearch / chameleon
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
☆2,068Updated last year