csinva / tree-promptLinks
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
☆37Updated last year
Alternatives and similar repositories for tree-prompt
Users that are interested in tree-prompt are comparing it to the libraries listed below
Sorting:
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆56Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- ☆33Updated 6 months ago
- ☆47Updated 5 months ago
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆66Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 5 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆26Updated 3 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- Exploration of automated dataset selection approaches at large scales.☆47Updated 4 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆27Updated 4 months ago
- Lottery Ticket Adaptation☆39Updated 7 months ago
- ☆19Updated 4 months ago
- On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆39Updated last week
- ☆66Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆35Updated 9 months ago
- NeurIPS 2024 tutorial on LLM Inference☆45Updated 7 months ago
- ☆23Updated 3 months ago
- ☆16Updated 11 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆20Updated 2 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆39Updated 4 months ago
- ☆19Updated 3 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆86Updated last year
- ☆41Updated 8 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆107Updated 9 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆47Updated 5 months ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated 11 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 4 months ago
- ☆24Updated 6 months ago