rosewang2008 / bridgeLinks
NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes"
☆41Updated 11 months ago
Alternatives and similar repositories for bridge
Users that are interested in bridge are comparing it to the libraries listed below
Sorting:
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆55Updated 3 months ago
- ☆69Updated last year
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆95Updated 2 months ago
- The Prism Alignment Project☆77Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆130Updated last year
- ☆106Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 5 months ago
- ☆25Updated last year
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆18Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆106Updated 8 months ago
- Can Large Language Models Be an Alternative to Human Evaluations?☆9Updated last year
- ☆33Updated 2 years ago
- ☆51Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆81Updated last year
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 6 months ago
- ☆43Updated 10 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆84Updated 10 months ago
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors☆12Updated 2 weeks ago
- ☆42Updated last year
- ☆37Updated 8 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆105Updated 2 weeks ago
- ☆95Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆100Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆44Updated 11 months ago
- PRODIGy is a collection of dialogues in which each conversation is aligned with speaker profile representations.☆19Updated 5 months ago
- Data and code for the paper "NormBank: A Knowledge Bank of Situational Social Norms"☆28Updated last year
- ☆86Updated 7 months ago
- ☆11Updated last year