openai / weak-to-strong
☆2,521Updated 9 months ago
Alternatives and similar repositories for weak-to-strong:
Users that are interested in weak-to-strong are comparing it to the libraries listed below
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,117Updated 9 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,361Updated 10 months ago
- A unified evaluation framework for large language models☆2,532Updated last week
- ☆2,332Updated last week
- A curated list of Large Language Model (LLM) Interpretability resources.☆1,239Updated 2 months ago
- Training LLMs with QLoRA + FSDP☆1,451Updated 3 months ago
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,790Updated 2 months ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,446Updated 11 months ago
- [ICML'24] Magicoder: Empowering Code Generation with OSS-Instruct☆1,992Updated 3 months ago
- ☆930Updated 2 weeks ago
- Mixture-of-Experts for Large Vision-Language Models☆2,082Updated 2 months ago
- PyTorch native post-training library☆4,856Updated this week
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,654Updated last month
- ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting wit…☆1,031Updated 11 months ago
- PyTorch code and models for V-JEPA self-supervised learning from video.☆2,785Updated 6 months ago
- An Open-source Toolkit for LLM Development☆2,758Updated last month
- ☆4,058Updated 8 months ago
- A simple, performant and scalable Jax LLM!☆1,623Updated this week
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,374Updated 3 weeks ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,498Updated 3 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆973Updated 6 months ago
- Measuring Massive Multitask Language Understanding | ICLR 2021☆1,305Updated last year
- A PyTorch native library for large model training☆3,326Updated this week
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,668Updated 6 months ago
- Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09…☆2,059Updated this week
- Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"☆2,282Updated 2 months ago
- 【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection☆3,153Updated 2 months ago
- [ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling☆1,609Updated 7 months ago
- Representation Engineering: A Top-Down Approach to AI Transparency☆789Updated 6 months ago
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,083Updated last year