az1326 / advisor-modelsLinks
How to Train Your Advisor: Steering Black-Box LLMs with Advisor Models
☆48Updated 2 months ago
Alternatives and similar repositories for advisor-models
Users that are interested in advisor-models are comparing it to the libraries listed below
Sorting:
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆55Updated 5 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 9 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆62Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆152Updated 10 months ago
- ☆88Updated last week
- ☆69Updated last year
- ☆144Updated 3 months ago
- ☆115Updated 2 weeks ago
- PyTorch library for Active Fine-Tuning☆95Updated 2 months ago
- Discovering Data-driven Hypotheses in the Wild☆122Updated 6 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆76Updated last year
- ☆124Updated 2 months ago
- ☆77Updated 2 months ago
- Storing long contexts in tiny caches with self-study☆220Updated 2 weeks ago
- A reading list of relevant papers and projects on foundation model annotation☆28Updated 9 months ago
- Official implementation of "BERTs are Generative In-Context Learners"☆32Updated 9 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆221Updated this week
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆63Updated 7 months ago
- Inverse Scaling in Test-Time Compute☆23Updated 2 weeks ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆277Updated 3 weeks ago
- A mechanistic approach for understanding and detecting factual errors of large language models.☆49Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆121Updated 2 months ago
- Simple repository for training small reasoning models☆47Updated 10 months ago
- ReLM is a Regular Expression engine for Language Models☆107Updated 2 years ago
- ☆72Updated last year
- Extract full next-token probabilities via language model APIs☆248Updated last year
- Optimize Any User-defined Compound AI Systems☆63Updated 4 months ago
- ☆33Updated 11 months ago
- chrome extension for renaming tabs showing paper-pdfs from common providers☆97Updated 11 months ago
- ☆234Updated 5 months ago