nec-research / agentquest
☆25Updated 5 months ago
Alternatives and similar repositories for agentquest:
Users that are interested in agentquest are comparing it to the libraries listed below
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- ☆50Updated 4 months ago
- ☆49Updated 8 months ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆26Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated this week
- ☆20Updated 3 weeks ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated last month
- ☆48Updated 4 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆15Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆32Updated 9 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated last month
- Simple GRPO scripts and configurations.☆58Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- ☆24Updated last year
- ☆41Updated 3 months ago
- ☆15Updated 5 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆21Updated 2 weeks ago
- Automatic Prompt Optimization☆27Updated 10 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated 9 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- ☆45Updated 6 months ago
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 3 months ago
- ☆48Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆41Updated 11 months ago
- Based on the tree of thoughts paper☆46Updated last year
- Mixing Language Models with Self-Verification and Meta-Verification☆102Updated 3 months ago