dsam99 / QueRELinks
Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".
☆11Updated 9 months ago
Alternatives and similar repositories for QueRE
Users that are interested in QueRE are comparing it to the libraries listed below
Sorting:
- LLM reads a paper and produce a working prototype☆57Updated 6 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated 2 years ago
- ☆25Updated 4 months ago
- Lottery Ticket Adaptation☆40Updated 10 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 10 months ago
- accompanying material for sleep-time compute paper☆117Updated 5 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆35Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆96Updated last week
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆21Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆46Updated last year
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- ☆50Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆102Updated 6 months ago
- ☆28Updated 6 months ago
- ☆23Updated last year
- ☆55Updated 11 months ago
- Verifiers for LLM Reinforcement Learning☆76Updated 6 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 6 months ago
- ☆40Updated 10 months ago
- ☆95Updated 6 months ago
- ☆48Updated last year
- Automatic Prompt Optimization☆45Updated last year
- ☆56Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- A method for steering llms to better follow instructions☆54Updated 2 months ago
- ☆119Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆50Updated last year
- ☆21Updated 4 months ago
- ☆68Updated last year
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆39Updated 6 months ago