swarnaHub / ExplanationInterventionLinks
[NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind
☆66Updated last year
Alternatives and similar repositories for ExplanationIntervention
Users that are interested in ExplanationIntervention are comparing it to the libraries listed below
Sorting:
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆88Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆64Updated 2 years ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 6 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- SILO Language Models code repository☆81Updated last year
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 2 months ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆114Updated 2 years ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆74Updated last year
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆118Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆36Updated last year
- ☆45Updated 4 months ago
- ☆135Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Updated last year
- ☆94Updated 8 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 8 months ago
- [NAACL 2024] Struc-Bench: Are Large Language Models Good at Generating Complex Structured Tabular Data? https://aclanthology.org/2024.naa…☆54Updated 3 weeks ago
- Mixing Language Models with Self-Verification and Meta-Verification☆105Updated 8 months ago
- ☆74Updated last year
- ☆29Updated last year
- ☆127Updated 10 months ago
- A set of utilities for running few-shot prompting experiments on large-language models☆122Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆29Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆103Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 10 months ago