rosewang2008 / bridge
NAACL 2024. Code & Dataset for "๐ Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes"
โ29Updated 4 months ago
Related projects โ
Alternatives and complementary repositories for bridge
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Dataโ77Updated 3 months ago
- ๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023โ45Updated 8 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersโ122Updated 8 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"โ78Updated 3 months ago
- โ86Updated 5 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"โ62Updated last year
- โ29Updated last year
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Setsโ211Updated 10 months ago
- โ63Updated 7 months ago
- โ94Updated 6 months ago
- โ199Updated this week
- Benchmarking library for RAGโ123Updated this week
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice Deโฆโ13Updated 7 months ago
- Governance of the Commons Simulation (GovSim)โ21Updated 4 months ago
- [ICLR 2024 & NeurIPS 2023 WS] An Evaluator LM that is open-source, offers reproducible evaluation, and inexpensive to use. Specifically dโฆโ287Updated last year
- Wrapper to easily generate the chat template for Llama2โ63Updated 8 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasetsโ191Updated this week
- Datasets collection and preprocessings framework for NLP extreme multitask learningโ149Updated 4 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesโ37Updated last month
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptionsโ68Updated last year
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Noveltyโ68Updated 7 months ago
- The Prism Alignment Projectโ37Updated 6 months ago
- Evaluating LLMs with CommonGen-Liteโ85Updated 8 months ago
- Evaluating LLMs with fewer examplesโ134Updated 7 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"โ115Updated last month
- This repository hosts the paper โLLM Based Math Tutoring: Challenges and Datasetโ, along with the accompanying dataset. It explores the pโฆโ28Updated 2 months ago
- โ95Updated last week
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"โ49Updated 8 months ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023โ31Updated 11 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"โ62Updated 5 months ago