openai / weak-to-strong
☆2,483Updated 4 months ago
Related projects: ⓘ
- Reaching LLaMA2 Performance with 0.1M Dollars☆957Updated last month
- A native PyTorch Library for large model training☆1,727Updated this week
- Modeling, training, eval, and inference code for OLMo☆4,406Updated this week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,309Updated 5 months ago
- ☆1,522Updated last week
- A unified evaluation framework for large language models☆2,375Updated last week
- The official implementation of Self-Play Fine-Tuning (SPIN)☆958Updated 4 months ago
- Training LLMs with QLoRA + FSDP☆1,385Updated this week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,614Updated last month
- A Native-PyTorch Library for LLM Fine-tuning☆3,954Updated this week
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,521Updated this week
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,385Updated 2 months ago
- Tools for merging pretrained large language models.☆4,501Updated this week
- A framework for few-shot evaluation of language models.☆6,426Updated this week
- Robust recipes to align language models with human and AI preferences☆4,481Updated last month
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,354Updated last week
- A simple, performant and scalable Jax LLM!☆1,450Updated this week
- Run Mixtral-8x7B models in Colab or consumer desktops☆2,288Updated 5 months ago
- ☆876Updated this week
- A curated list of Large Language Model (LLM) Interpretability resources.☆1,070Updated last month
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆931Updated 6 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,192Updated 5 months ago
- ☆4,006Updated 3 months ago
- Set of tools to assess and improve LLM security.☆2,515Updated last week
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,436Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆5,162Updated this week
- An Open-source Toolkit for LLM Development☆2,687Updated 3 months ago
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,210Updated last month
- [ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?☆1,766Updated 2 weeks ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,120Updated 3 weeks ago