rosewang2008 / bridge
NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes"
☆38Updated 9 months ago
Alternatives and similar repositories for bridge
Users that are interested in bridge are comparing it to the libraries listed below
Sorting:
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆92Updated 3 weeks ago
- ☆69Updated last year
- ☆93Updated 11 months ago
- ☆35Updated 6 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆75Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆83Updated 9 months ago
- 🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023☆52Updated 2 months ago
- ☆106Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆105Updated 7 months ago
- ☆33Updated 2 years ago
- Resources for cultural NLP research☆94Updated 2 weeks ago
- The repository contains the code and dataset for the Socratic Debugging task which is a novel task for Socratically Questioning Novice De…☆17Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- ☆23Updated 11 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆104Updated 7 months ago
- Learning to route instances for Human vs AI Feedback☆23Updated 3 months ago
- ☆57Updated 7 months ago
- ☆20Updated 2 months ago
- Code/data for MARG (multi-agent review generation)☆43Updated 5 months ago
- ☆42Updated 9 months ago
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆120Updated 8 months ago
- A package dedicated for running benchmark agreement testing☆16Updated this week
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆84Updated 5 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆36Updated 4 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆47Updated 11 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆69Updated 2 years ago
- Functional Benchmarks and the Reasoning Gap☆85Updated 7 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated 11 months ago