rosewang2008 / bridge
NAACL 2024. Code & Dataset for "๐ Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes"
โ32Updated 7 months ago
Alternatives and similar repositories for bridge:
Users that are interested in bridge are comparing it to the libraries listed below
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Dataโ82Updated 6 months ago
- โ65Updated 10 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersโ125Updated 11 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"โ83Updated 6 months ago
- ๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023โ47Updated 11 months ago
- Code/data for MARG (multi-agent review generation)โ38Updated 3 months ago
- โ32Updated last year
- Backtracing: Retrieving the Cause of the Query, EACL 2024 Long Paper, Findings.โ88Updated 7 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"โ54Updated 11 months ago
- โ104Updated 9 months ago
- โ20Updated 8 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"โ34Updated 2 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesโ42Updated 2 months ago
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice Deโฆโ15Updated 10 months ago
- โ117Updated 4 months ago
- โ90Updated 8 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"โ64Updated 8 months ago
- โ42Updated 8 months ago
- The Synthetic-Persona-Chat dataset is a synthetically generated persona-based dialogue dataset. It extends the original Persona-Chat dataโฆโ82Updated last year
- Codebase accompanying the Summary of a Haystack paper.โ74Updated 5 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"โ71Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructionsโ42Updated 7 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award โฆโ38Updated 3 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)โ43Updated last month
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)โ35Updated last month
- Code accompanying "How I learned to start worrying about prompt formatting".โ102Updated 4 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptionsโ69Updated last year
- โ57Updated 4 months ago
- โ33Updated 4 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).โ79Updated 11 months ago