nec-research / agentquest
☆22Updated last month
Related projects ⓘ
Alternatives and complementary repositories for agentquest
- SCREWS: A Modular Framework for Reasoning with Revisions☆26Updated last year
- ☆41Updated 2 weeks ago
- Mixing Language Models with Self-Verification and Meta-Verification☆97Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- ☆24Updated last year
- ☆42Updated 4 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 4 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- PyTorch implementation for MRL☆18Updated 9 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆63Updated last year
- ☆48Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆13Updated 8 months ago
- Embedding Recycling for Language models☆38Updated last year
- ☆41Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- Writing Blog Posts with Generative Feedback Loops!☆43Updated 8 months ago
- ☆46Updated this week
- Based on the tree of thoughts paper☆45Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆21Updated last week
- Track the progress of LLM context utilisation☆53Updated 4 months ago
- ☆74Updated 3 weeks ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 4 months ago
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated 10 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆43Updated 9 months ago
- ☆37Updated this week