dsam99 / QueRELinks
Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".
☆12Updated last year
Alternatives and similar repositories for QueRE
Users that are interested in QueRE are comparing it to the libraries listed below
Sorting:
- LLM reads a paper and produce a working prototype☆60Updated 9 months ago
- Lottery Ticket Adaptation☆39Updated last year
- ☆25Updated 8 months ago
- ☆54Updated 2 weeks ago
- ☆55Updated last year
- accompanying material for sleep-time compute paper☆119Updated 9 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- ☆28Updated 9 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- ☆95Updated last week
- Explore the use of DSPy for extracting features from PDFs 🔎☆52Updated last year
- A method for steering llms to better follow instructions☆76Updated 5 months ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Updated 10 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆112Updated 9 months ago
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆33Updated last year
- ☆39Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆128Updated 11 months ago
- ☆105Updated 10 months ago
- ☆23Updated last year
- ☆43Updated 2 months ago
- ☆67Updated 10 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆37Updated 8 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Updated 5 months ago
- ☆61Updated 7 months ago
- Codebase accompanying the Summary of a Haystack paper.☆80Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆260Updated last week
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆46Updated last year
- Simple GRPO scripts and configurations.☆59Updated 11 months ago