dsam99 / QueRELinks
Code repository for the paper on "Predicting the Performance of Black-Box LLMs through Self-Queries".
β11Updated 7 months ago
Alternatives and similar repositories for QueRE
Users that are interested in QueRE are comparing it to the libraries listed below
Sorting:
- LLM reads a paper and produce a working prototypeβ57Updated 4 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" π€β73Updated 8 months ago
- accompanying material for sleep-time compute paperβ102Updated 3 months ago
- β23Updated 2 months ago
- Lottery Ticket Adaptationβ39Updated 8 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.β100Updated 3 months ago
- β54Updated 9 months ago
- SCREWS: A Modular Framework for Reasoning with Revisionsβ27Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM datasetβ18Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β82Updated this week
- Verifiers for LLM Reinforcement Learningβ69Updated 4 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoningβ61Updated last month
- Simple GRPO scripts and configurations.β59Updated 6 months ago
- Multi-Granularity LLM Debuggerβ88Updated last month
- β48Updated last year
- β24Updated 10 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agentsβ53Updated last month
- Python package for generating datasets to evaluate reasoning and retrieval of large language modelsβ19Updated last week
- β66Updated 4 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!β74Updated 2 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)β29Updated last year
- β48Updated 10 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.β34Updated last year
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorerβ43Updated last year
- Understanding the correlation between different LLM benchmarksβ29Updated last year
- β92Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignmentβ60Updated 11 months ago
- The original Shared Recurrent Memory Transformer implementationβ30Updated last month
- Automatic Prompt Optimizationβ40Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)β118Updated 6 months ago